LLMs and AI for Cultural Analytics

LLMs and AI for Cultural Analytics#

This section introduces Large Language Models (LLMs) and AI tools for humanities and social science research. The tutorials below are drawn from AI for Humanists, a project created by Melanie Walsh, David Mimno, and Matt Wilkens and designed to help humanities researchers and students understand, use, and critically engage with AI technologies.

What LLM Should I Use?#

See the AI for Humanists Guide to Models for help choosing the right model for your research task.

Tutorials#

The following tutorials are available as Google Colab notebooks, which means you can open and run them directly in your browser without any installation.

Local LLMs#

Working with Local LLMs (On Your Own Computer!) — Ollama and Llama 3 Run LLMs locally so your data never leaves your machine. Covers setting up Ollama, creating structured data from unstructured text, chatting with a local LLM, and generating document embeddings. 🚀 Open in Colab

Word and Document Embeddings#

Measuring Document Similarity with LLMs Use LLMs to find similar texts within a dataset, including comparing narrative versus non-narrative texts and analyzing poetry collections. 🚀 Open in Colab

Measuring Word Similarity with BERT Use a pre-trained BERT model to measure word similarity by finding semantically comparable words from poem collections. 🚀 Open in Colab

Measuring Word Similarity with BERT (Spanish) A demo showing how the word similarity approach works with a Spanish-language BERT model, illustrating that these techniques can be applied beyond English. 🚀 Open in Colab

Text Classification#

Zero-Shot Prompting with LLMs Classify texts without any training examples. Covers prompting strategies for book genre and narrative classification, and how to evaluate results. 🚀 Open in Colab

Training and Fine-Tuning BERT for Classification Train and fine-tune a BERT model to classify Goodreads reviews by book genre. 🚀 Open in Colab

Workshop Slides#

The AI for Humanists team has also given a number of workshops on LLMs and AI for humanities researchers. Slides from past workshops are available on the AI for Humanists Workshops page. A few highlights:

Artificial Intelligence: An Overview (NEH, December 2023)
Large Language Models for Humanists: A Hands-On Introduction (Simpson Center for the Humanities, November 2023)
A Hands-On Introduction to Large Language Models for FAccT Researchers (FAccT Conference, June 2023)
BERT for Social Scientists and Humanists (Bell Labs, July 2022)