Loading…
OKFestival has ended
View analytic
Wednesday, July 16 • 14:00 - 14:45
Data doesn’t grow in tables - dealing with large sets of documents

Sign up or log in to save this to your schedule and see who's attending!

While we would all like for our journalistic evidence to be delivered to our doorsteps in nicely-formatted spreadsheets, more often than not that is not the case. Instead, information often comes as a large stash of (unstructured) documents. When these collections grow, reading through all of the documents stops being an option.

This workshop will discuss alternatives: what tools and technologies are available for the automated analysis of large document sets? How can you learn about the recurring topics of a document stash automatically? How can important concepts, people and companies be traced across the result of a leak?

Facilitators
avatar for Friedrich Lindenberg

Friedrich Lindenberg

Tech Coordinator, OCCRP
Friedrich Lindenberg is a coder and data journalist working on web technology for new narrative and investigative techniques. He was an 2014 ICFJ Knight International Journalism Fellow with Code for Africa, and a 2013 Knight-Mozilla OpenNews Fellow at Spiegel Online. Previously, he... Read More →

Wednesday July 16, 2014 14:00 - 14:45
Space K2 Kulturbrauerei

Attendees (0)