Taken the circumstances relating to Novel Coronavirus into consideration, please be noted that this conference is cancelled. (February 27, 2020.)

* This conference is dedicated to Charles Tilly, a pioneer of the event analytic method and our great teacher.

Dates:        March 2 (Monday) - March 6 (Friday), 2020
Time:         10:00 - 19:00
Language: English

Social scientists have used various forms of “event data”—a catalogue of political events such as civil violence, riots, international conflicts, and popular protests—to understand chronological changes, geospatial variations, and dynamics of political phenomena of interest. With advent of new methodological innovations including natural language processing (NLP) and big data approaches, an increasing number of scholars have started to integrate these innovations into their event data analysis. One of the main objectives of this conference is to share experiences of the scholars who are specialized in one or more of the following research areas and techniques: event analysis, NLP, big data analysis, computer sciences, computational linguistics, forecasting and early warnings, data visualization, and other relevant fields. We will explore potentials and limitations of event analysis based on the innovative methods. We welcome research papers and courses addressing the following topics:


To construct a database of political, economic, and social events, scholars have used mass media such as newspapers as data source and coded manually to extract information about political events from these sources. However, a variety of techniques associated with NLP and big data approaches may revolutionize the ways of event data development.
  • Do the NLP techniques enable new ways of building event data?
  • What are the advantages and disadvantages of the event databases constructed by using some of the NLP techniques in comparison with the ones built manually?
  • What is the prospect of using the SNS such as Twitter as a source of event data? What are the potentials and problems associated with the use of the SNS in event data analysis?
  • Event data that are useful for social science research typically include the information about (1) who (2) did what (3) to whom, (4) when, (5) where, and (6) why. In other words, scholars would like to extract information about (1) Actor, (2) Action, (3) Target, (4) Time, (5) Location, and (6) Claim/Goal of event. Which of the six types of information can the new methods based on NLP and big data approaches extract reliably and which not? What are the reasons of the difficulty of extracting the information in a consistent and reliable manner?
  • If event data projects succeed in forecasting the occurrence of political violence around the world in a timely manner, the success will improve significantly the ability of policymakers and academics to prevent such disasters in the future. To make timely predictions of political disasters such as violence and civil war, it is vital to convert a massive number of news reports instantaneously on a real time basis into a data set for subsequent analyses. What kind of event data system is more desirable for such a purpose?


  • What types of analysis can be done using event data that are developed using the NLP and big data approaches?
  • What kinds of research questions can be addressed by NLP-based event data, and what kind of questions that can be addressed by manually-build event data cannot be addressed by NLP-based event data?
  • How can we explore geographical information in event data and apply GIS (geographic information systems) methods and visualize the findings?
  • How can we explore relational information in event data and apply network analytic methods to the data?
  • How can we predict civil violence using event data? The outbreak of violent conflicts often catches us by surprise. Why do people who have lived peacefully side-by-side for years suddenly start to behave violently toward each other? Why does political violence tend to concentrate in large waves, leading to tragic outcomes ranging from riots, to ethnic cleansing, genocide, and civil wars? How to develop analytic models predicting political violence theoretically and methodologically?

Our goal is to compile proceedings from the conference as a LAINAC report booklet. Each contributor is expected to develop his/her contribution for publication based on the feedback he/she will get during the sessions.



March 2 (Monday) - March 6 (Friday)
2:30 p.m. - 5:30 p.m.
Media Laboratory #2
1st Floor, Building 18
UTokyo Komaba Campus (Map)

As one of the highlights of the conference, we welcome Roberto Franzosi, professor of sociology at Emory University, who will give a 5-day intensive seminar on the application of NLP to event analysis and visualization. Dr. Franzosi is one of the leading experts of the event analytic methods and has recently completed his own NLP-based event data systems. He teaches a graduate seminar on “Big/Small Data & Visualization” at Emory, and this intensive seminar is its condensed version with a special introduction of his own event analysis. The following topics will be covered (subject to change).

1. NLP (Natural Language Processing): What Is it all about?
2. Pre-processing tools (convert pdf, merge/split txt files, utf-8 compliance, document similarities)
3. Parsers, sentence splitters, lemmatizers, taggers (Postag, Deprel, NER-Named Entity Recognition)
4. The Stanford CoreNLP parser and the CoNLL table
5. Co-reference resolution (via Stanford CoreNLP and manual editing)
6. SVO (Subject-Verb-Object) extractor
7. Filtering S & O for social actors via WordNet
8. Geocoding of NER location (and date) values and GIS mapping in Google Earth Pro
9. SVOs network graphs in Gephi
10. Document annotator and extractor (via dictionary and dbpedia)

[NOTE] As for the 5-day intensive seminar by Dr. Roberto Franzosi, space is limited due to the classroom size. Priority is given to people (1) who can attend all five-day sessions, (2) who will present their work in the conference, and (3) who are undergraduate or graduate student of the University of Tokyo. If you would like to take the seminar, please submit your application through our online submission system (accessible from this website after January 21, 2020). The result will be notified by February 28, Friday, at the latest.

[Online Application for the Five-Day Intensive Seminar by Roberto Franzosi (CLOSED)]


* Venue: Media Laboratory #2, 1st Floor, Building 18, unless otherwise noted.
* The program is subject to change.

March 2, Monday

 10:00-11:15 Takeshi Wada (The University of Tokyo, Japan)
"Global Event Analysis: Issues in Data Development and Analysis"

 11:30-12:45 Yoshifumi Kawasaki (The University of Tokyo, Japan)
"Automatic Classification of News Articles in Spanish Using Distributed Representations of Words"

 14:30-17:15 Roberto Franzosi (Emory University, USA)
Intensive Seminar "Automatic Extraction and Visualization of the Ws of Narrative: Who, What, When, Where" Day #1

 17:30-20:00 Welcome Reception
Venue: Open Space, 4th Floor, Building 18

March 3, Tuesday

 10:00-11:15 Roudabeh Kishi (Armed Conflict Location & Event Data Project, USA)
"Ensuring a Robust Methodology in Monitoring Patterns of Conflict and Disorder"

 11:30-12:45 Antonio Ruiz Tinoco (Sophia University, Japan)
"Collecting Geolocated Tweets and Mapping Them Out with QGIS"

 14:30-17:15 Roberto Franzosi (Emory University)
Intensive Seminar "Automatic extraction and visualization of the Ws of narrative: Who, What, When, Where" Day #2

 17:30-18:45 Jason Franz (University of Heidelberg, Germany)
"The Best of Both Worlds? Combining Big Data with Large-Scale Collaborative Manual Coding in the Heidelberg Approach to Conflict Research"

March 4, Wednesday

 10:00-11:15 Doug Bond (Virtual Research Associates & Harvard University, USA)
"From Manual Encoding to Sparse Parsing to Field Monitoring: Three-Decades of Working with KEDS and VRA’s Proprietary Parsers, the PANDA and IDEA event Frameworks, and Supporting Field Data Monitoring Efforts for NGO & IGO Clients"
 11:30-12:45 Yoojin Koo (The University of Tokyo, Japan)
"Organizing Local Conservatives in Japan: Utilizing the Event Data"

 14:30-17:15 Roberto Franzosi (Emory University)
Intensive Seminar "Automatic extraction and visualization of the Ws of narrative: Who, What, When, Where" Day #3

 17:30-18:45 Nestor Alvaro (StratioBD, Spain)
"Corpus of Spanish news containing political actions"

March 5, Thursday

 10:00-11:15 Roudabeh Kishi (Armed Conflict Location & Event Data Project, USA)
"Using Tableau to Explore Trends in Conflict and Disorder using ACLED Data" Part 1
 11:30-12:45 Roudabeh Kishi (Armed Conflict Location & Event Data Project, USA)
"Using Tableau to Explore Trends in Conflict and Disorder using ACLED Data" Part 2

 14:30-17:15 Roberto Franzosi (Emory University)
Intensive Seminar "Automatic extraction and visualization of the Ws of narrative: Who, What, When, Where" Day #4

 17:30-18:45 Jason Franz (University of Heidelberg, Germany)
"Chasing Down Clausewitz' Chameleons. Approaching Political Conflict Through Network Analysis"

March 6, Friday

 10:00-11:15 Sean Yeo (Virtual Research Associates, USA) & Hiroyoshi Fujimoto (Fukuyama University)
"Research Update from a Cross-Comparison, Triangulation and Validity Check of Public Data Sets and Early Warning Field Reports"

 11:30-12:45 Nestor Alvaro (StratioBD, Spain)
"Machine Learning pipeline for identifying news containing political actions and for extracting the actions within these news"

 14:30-17:15 Roberto Franzosi (Emory University)
Intensive Seminar "Automatic extraction and visualization of the Ws of narrative: Who, What, When, Where" Day #5

 17:30-18:45 Farewell Party 
"TBA" in Shibuya


Conference registration: Free. 
Reception party: Free.

  • Néstor Alvaro (Service Delivery Manager, StratioBD, Spain)
  • Doug Bond (Director, Virtual Research Associates, VRA & Harvard University, USA)
  • Jason Franz (HIIK Conflict Barometer Methodology Group, Heidelberg Institute for International Conflict Research at the Institute for Political Science, University of Heidelberg, Germany)
  • Roberto Franzosi (Emory University, USA)
  • Yoshifumi Kawasaki (The University of Tokyo, Japan)
  • Roudabeh Kishi (Research Director, Armed Conflict Location & Event Data Project, ACLED, USA)
  • Yoojin Koo (The University of Tokyo, Japan)
  • Antonio Ruiz Tinoco (Emeritus Professor, Sophia University, Japan)
  • Takeshi Wada (The University of Tokyo, Japan)
  • Seunghwan Yeo (Program Manager, Virtual Research Associates, VRA)
* Scholars and practitioners who have confirmed his/her participation as of December 23, 2019.

  • UTokyo LAINAC
  • JSPS KAKENHI, Project/Area Number 15KT0040. ""International and interdisciplinary joint research to develop a system for predicting the outbreak of violent conflict."
  • JSPS KAKENHI, Porject/Area Number 18H00921. "International collaborative research on 'another world' proposed by social movements in the post-neoliberal era."
The conference is a part of the Global Event Data System (GEDS) Seminar Series "Big Data and Natural Language Processing in the Social Sciences & Humanities" (see the following website for more information: http://www.jp.lainac.c.u-tokyo.ac.jp/research/seminars/bdnlp)

This event is organized by the Institute for Advanced Global Studies, The University of Tokyo.

The conference will take place on March 2-6, 2020 at the University of Tokyo, Komaba Campus.


The submission deadline of abstract is February 3 (Monday), 2020 at 11:59 p.m. (Japan Standard Time).

Paper Title (15 words or less) 

Abstract (250 words or less)

How to Submit: All submissions must be made via the online submission system (accessible from this website starting from January 17, 2020). For the detail, follow the instructions on the submission system page.

[Online Abstract Proposal Submission Form (CLOSED)]


The results of application will be notified by email to the applicants on February 5 (Wednesday), 2020 (Japan Standard Time).


The submission deadline of paper is February 24 (Monday), 2020 at 11:59 p.m. (Japan Standard Time).

Length and Style: Only draft/working papers that are 15-35 pages, double-spaced, (including footnotes, tables, figures, and bibliographies), that represent a well-thought-out idea or topic will be considered. Lengthier versions are more suitable for subsequent publications than for oral presentations.

How to Submit: Submission information will be sent to those who are accepted for presentation.