Schedule

Introduction - Overview of NLP (Aug 27)

Content:

  • What is natural language processing?
  • What are the features of natural language?
  • What do we want to do with NLP?
  • What makes it hard?
  • Building a rule-based classifier
  • Training a bag-of-words classifier

Slides: Intro Slides

Code: Simple Text Classifiers

Reading Material

Representing Words (Aug 29)

Content:

  • Subword models
  • Continuous word embeddings
  • Training more complex models
  • Neural network basics
  • Visualizing word embeddings

Recitation (OH): PyTorch and SentencePiece

Slides: Word Representation and Text Classification Slides

Code: Subword Models, Text Classification

Reading Material

Language and Sequence Modeling (Sep 03)

Content:

  • Language Modeling Problem Definition
  • Count-based Language Models
  • Measuring Language Model Performance: Accuracy, Likelihood, and Perplexity
  • Log-linear Language Models
  • Neural Network Basics
  • Feed-forward Neural Network Language Models

Recitation (OH): N-Gram Language Model

Slides: Language Modeling Slides

Reading Material

Attention and Transformers (Sep 05)

Content:

  • Attention
  • Transformer Architecture
  • Multi-Head Attention
  • Positional Encodings
  • Layer Normalization
  • Optimizers and Training
  • LLaMa Architecture

Recitation (OH): Hugging Face Transformers, Annotated Transformer

Slides: Attention and Transformers Slides

Reading Material

Pre-training and Pre-trained LLMs (Sep 10)

Content:

  • Overview of pre-training
  • Pre-training objectives
  • Pre-training data
  • Open vs. closed models
  • Representative pre-trained models

Slides: Pretraining Slides

References

Instruction Tuning (Sep 12)

Co-Lecturer Xiang Yue

Content:

  • Multi-tasking
  • Fine-tuning and Instruction Tuning
  • Parameter Efficient Fine-tuning
  • Instruction Tuning Datasets
  • Synthetic Data Generation

Slides: Instruction Tuning Slides

Reading Material

Prompting and Complex Reasoning (Sep 17)

Content:

  • Prompting Methods
  • Sequence-to-sequence Pre-training
  • Prompt Engineering
  • Answer Engineering
  • Multi-prompt Learning
  • Prompt-aware Training Methods
  • Types of Reasoning
  • Chain-of-thought and Variants
  • Supervised Training for Reasoning

Recitation(OH): OpenAI API, LiteLLM

Slides: Prompting Slides

Reading Material

Reinforcement Learning (Sep 19)

Content:

  • Methods to Gather Feedback
  • Error and Risk
  • Reinforcement Learning
  • Stabilizing Reinforcement Learning

Slides: Reinforcement Learning and Human Feedback Slides

Experimental Design and Human Annotation (Sep 24)

Content:

  • Experimental Design
  • Data Annotation

Slides: Experimental Design Slides

References:

Retrieval and RAG (Sep 26)

Content:

  • Retrieval Methods
  • Retrieval Augmented Generation
  • Long-context Transformers

Recitation(OH): LangChain or LlamaIndex

Slides: Retrieval Augmented Generation Slides

References

Distillation, Quantization, and Pruning (Oct 01)

Co-Lecturer Vijay Viswanathan

Content:

  • Distillation
  • Quantization
  • Pruning

Slides (Spring 2024 version):: Distillation Slides

References

Domain Specific Modeling: Code and Math (Oct 03)

Co-Lecturer Xiang Yue Content:

Recitation(OH):

Slides :: References

Long Sequence Models (Oct 08)

Content:

Recitation (OH): Unlimiformer, Mamba

Other LMs (Oct 10)

Content:

Recitation(OH):

Slides :: References

Language Agents I - Frameworks (Oct 22)

Content:

Recitation(OH):

Slides :: References

Language Agents II - Applications (Oct 24)

Content:

  • Tool Use
  • Language Agents

Slides (Spring 2024 version):: Tool Use Slides

Slides (Spring 2024 version):: Language Agents Slides

References

Evaluation of LLMs (Oct 29)

Content:

Recitation(OH):

Slides :: References

Multimodal Models (Oct 31)

Co-Lecturer Xiang Yue

Content:

Recitation(OH):

Slides :: References

Linguistics and Computational Linguistics (Nov 07)

Knowledge Based QA (Nov 12)

Content:

  • Knowledge Based QA

Slides (Spring 2024 version):: Knowledge Based QA Slides

References

Ensembling and Mixture of Experts (Nov 14)

Content:

  • Ensembling
  • Model Merging
  • Sparse Mixture of Experts
  • Pipeline Models

Slides (Spring 2024 version):: Multi-model Slides

References:

Safety and Security: Bias, Fairness and Privacy (Nov 19)

Content:

Recitation(OH):

Slides :: References

Inference Algorithms - Sean Welleck (Nov 21)

Guest Lecturer Sean Welleck

Reading Material

Guest Lecture - Beidi Chen (Nov 26)

Guest Lecturer Beidi Chen