Skip to Main Content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.

Digital Humanities: Text Mining & Analysis

Getting Started

Advice on data mining, text mining, SAS Enterprise Miner and Text Miner: Damien Mather

Learn More

Do More

Text & corpus analysis
Topic Modelling

Topic models are a type of statistical model used as a text-mining tool to discover the hidden semantic structures ("topics") occurring in a collection of documents. They are useful for analyzing large collections of unlabeled text.


Further Tools to Create Beautiful Visualisations

see Visualisation 

A brief introduction to distant reading

We want your feedback!

This guide continues to evolve, and we really welcome your feedback so we can continue to improve it. Please let us know if you find:

  • Incorrect or irrelevant details, tools that don't work, dead links or otherwise unhelpful information
  • Helpful details, tools, links or information that you think need to be on the guide, but aren't currently.

We'd also love to hear from you if you want to have your project featured on the guide, or would like to be profiled on the Connect&Collaborate@Otago page. Email Alexander Ritchie, or Antje Lubcke with any comments or suggestions on how we can improve it.

Viva Digital Humanities!