Having a list of political actions of interest poses a challenge as we need to extract the information regarding those actions from different sources. In this workshop, we will see two different approaches for applying text mining to a large news corpus aiming at extracting the set of news containing those political actions. We will start with a basic approach, and then compare these results against a more advanced approach using NLP - Natural Language Processing - . This workshop will also present the results obtained by these two different strategies, also assessing venues for improvement. ==
Nestor Alvaro is a Data Scientist at Deloitte Digital researching on machine learning (ML) and natural language processing (NLP).
== *Supported by JSPS KAKENHI, Project/Area Number 15KT0040 (Takeshi Wada, The University of Tokyo)
|