247 Hesburgh Library, Navari Family Center for Digital Scholarship
Learn how to automatically "read" & "analyze" an arbitrarily large corpora of textual materials. The Distant Reader, a locally written system, can accept as input large numbers of files of just about any type. It then creates a corpus from the input, converts it into plain text, does natural language processing against the plain text, and outputs sets of reports enabling you to use & understand the corpora to a greater degree.
Useful to anybody across campus who needs to read large volumes of materials, this hands-on workshop will help you take control of your content.
Related LibGuide: Text mining and natural language processing