HathiTrust Text Mining Workshop at UF on October 1-2

The Libraries at UF are partnering with HathiTrust to host text mining workshops at UF in Library West in the Scott Nygren Scholars Studio (212) on October 1 and 2. Priority registration is for those from HathiTrust member institutions.  The registration links and more details are below.

**Please note that you can only register as either a Librarian or a Researcher. October 1 will be a workshop for Librarians, while October 2 will be a workshop for Researchers.

No experience with text mining is required, so if this is something you’re interested in exploring, please consider it!

Registration Links

Librarian (train the trainer session; 10/1)


Researcher (folks using HathiTrust for research, 10/2)

HathiTrust Text Mining Workshop Details

Text Mining with HathiTrust: An Introduction for Librarians
October 1, 2019
9:00 am – 4:00 pm
Scott Nygren Scholars Studio, Humanities and Social Sciences Library West 212

This day-long workshop will introduce librarians to computational text and data mining, with hands-on examples drawn from HathiTrust data and HathiTrust Research Center analysis tools. Attendees will learn how scholars employ this research method, as well as the types of research questions that can be explored using it. Particular attention will be given to the librarian’s role in supporting text and data mining throughout the research process. Attendees can expect to leave with an understanding of:

  • The text data available from HathiTrust
  • The tools and services of the HathiTrust Research Center, including freely available datasets, plug-and-play tools, and secure computing environments for advanced researchers
  • The ways librarians can support text and data mining research

The workshop will be particularly useful for librarians who would like to become more familiar with HathiTrust research opportunities or who would like to gain a deeper understanding of the current state of academic text and data mining research. No experience is required! While the workshop will address technical topics, the content has been designed to be approachable for librarians from a variety of backgrounds.

HathiTrust’s Data and Analysis Tools for Text Mining Research
October 2, 2019
9:00 am – 12:00 pm
Scott Nygren Scholars Studio, Humanities and Social Sciences Library West 212

This half-day workshop will introduce attendees to the text data and computational tools of HathiTrust. HathiTrust operates a repository of over 17 million items digitized at a network of partner libraries. This massive collection of text is available for computational text mining primarily through the tools and services of the HathiTrust Research Center. Attendees of this workshop will gain hands-on experience with these data and tools in order to become more familiar with the opportunities for research HathiTrust makes available. The workshop will include:

  • A characterization of the data available
  • Hands-on activities with HTRC’s Extracted Features dataset and secure research environments

No experience is required, but prior exposure to text analysis concepts would be beneficial.