OKFestival has ended
Back To Schedule
Wednesday, July 16 • 14:00 - 14:45
Data doesn’t grow in tables - dealing with large sets of documents

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

While we would all like for our journalistic evidence to be delivered to our doorsteps in nicely-formatted spreadsheets, more often than not that is not the case. Instead, information often comes as a large stash of (unstructured) documents. When these collections grow, reading through all of the documents stops being an option.

This workshop will discuss alternatives: what tools and technologies are available for the automated analysis of large document sets? How can you learn about the recurring topics of a document stash automatically? How can important concepts, people and companies be traced across the result of a leak?

avatar for Friedrich Lindenberg

Friedrich Lindenberg

Data Librarian, OpenSanctions
Friedrich Lindenberg is a coder and data journalist working on web technology for new narrative and investigative techniques. He's currently building OpenSanctions, a global database of persons of journalistic interest.

Wednesday July 16, 2014 14:00 - 14:45 CEST
Space K2 Kulturbrauerei

Attendees (0)