Schedule

Introduction - Overview of NLP (Aug 27)

Content:

  • What is natural language processing?
  • What are the features of natural language?
  • What do we want to do with NLP?
  • What makes it hard?
  • Building a rule-based classifier
  • Training a bag-of-words classifier

Slides: Intro Slides

Code: Simple Text Classifiers

Reading Material

Representing Words (Aug 29)

Content:

  • Subword models
  • Continuous word embeddings
  • Training more complex models
  • Neural network basics
  • Visualizing word embeddings

Recitation (OH): PyTorch and SentencePiece

Slides: Word Representation and Text Classification Slides

Code: Subword Models, Text Classification

Reading Material

Language and Sequence Modeling (Sep 03)

Content:

  • Language Modeling Problem Definition
  • Count-based Language Models
  • Measuring Language Model Performance: Accuracy, Likelihood, and Perplexity
  • Log-linear Language Models
  • Neural Network Basics
  • Feed-forward Neural Network Language Models

Recitation (OH): N-Gram Language Model

Slides: Language Modeling Slides

Reading Material

Attention and Transformers (Sep 05)

Content:

  • Attention
  • Transformer Architecture
  • Multi-Head Attention
  • Positional Encodings
  • Layer Normalization
  • Optimizers and Training
  • LLaMa Architecture

Recitation (OH): Hugging Face Transformers, Annotated Transformer

Slides: Attention and Transformers Slides

Reading Material

Pre-training and Pre-trained LLMs (Sep 10)

Content:

  • Overview of pre-training
  • Pre-training objectives
  • Pre-training data
  • Open vs. closed models
  • Representative pre-trained models

Slides: Pretraining Slides

References

Instruction Tuning (Sep 12)

Co-Lecturer Xiang Yue

Content:

  • Multi-tasking
  • Fine-tuning and Instruction Tuning
  • Parameter Efficient Fine-tuning
  • Instruction Tuning Datasets
  • Synthetic Data Generation

Slides: Instruction Tuning Slides

Reading Material

Prompting and Complex Reasoning (Sep 17)

Content:

  • Prompting Methods
  • Sequence-to-sequence Pre-training
  • Prompt Engineering
  • Answer Engineering
  • Multi-prompt Learning
  • Prompt-aware Training Methods
  • Types of Reasoning
  • Chain-of-thought and Variants
  • Supervised Training for Reasoning

Recitation(OH): OpenAI API, LiteLLM

Slides: Prompting Slides

Reading Material

Reinforcement Learning (Sep 19)

Content:

  • Methods to Gather Feedback
  • Error and Risk
  • Reinforcement Learning
  • Stabilizing Reinforcement Learning

Slides: Reinforcement Learning and Human Feedback Slides

Experimental Design and Human Annotation (Sep 24)

Content:

  • Experimental Design
  • Data Annotation

Slides: Experimental Design Slides

References:

Retrieval and RAG (Sep 26)

Content:

  • Retrieval Methods
  • Retrieval Augmented Generation
  • Long-context Transformers

Recitation(OH): LangChain or LlamaIndex

Slides: Retrieval Augmented Generation Slides

References

Distillation, Quantization, and Pruning (Oct 01)

Co-Lecturer Vijay Viswanathan

Content:

  • Distillation
  • Quantization
  • Pruning

Slides: Distillation Slides

References

Domain Specific Modeling: Code and Math (Oct 03)

Co-Lecturer Xiang Yue

Content:

  • Code Generation Models
  • Math Models

Slides: Code and Math Slides

References

Long Sequence Models (Oct 08)

Content:

  • Transformer compute/memory complexity
  • Extrapolation of trained models
  • Alternative transformer architectures
  • Non-attentional models
  • Evaluation of long-context models

Slides: Long Context Slides

Ensembling and Mixture of Experts (Oct 10)

Content:

  • Ensembling
  • Model Merging
  • Sparse Mixture of Experts
  • Pipeline Models

Slides: Ensembling and MOE Slides

References:

Tool Use and LLM Agent Basics (Oct 22)

Content:

  • Agent Basics
  • Agent Use Cases/Envrionments
  • Tool Use
  • Environment Representation
  • Environment Understanding
  • Reasoning and Planning
  • Multi-agent Systems

Slides: Tool Use and LLM Agent Basics

References

Agents for Software Development and Web Browsing (Oct 24)

Content:

  • Coding Agents
  • Web Browsing Agents

Slides: Software Development and Web Browsing Slides

References

Project Discussion (Oct 29)

Content:

  • Project Discussion

Multimodal Models (Oct 31)

Co-Lecturer Xiang Yue

Content:

  • Evaluation of NLP tasks and LMs
  • Multimodal

Slides: Evaluation and Multimodal Slides

References

Linguistics and Computational Linguistics (Nov 07)

Knowledge Based QA (Nov 12)

Content:

  • Knowledge Based QA

Slides: Knowledge Based QA Slides

References

Multilingual NLP (Nov 14)

Content:

  • Multilingual NLP

Slides Multilingual NLP Slides

References

Safety and Security: Bias, Fairness and Privacy (Nov 19)

Content:

  • Safety and Security: Bias, Fairness and Privacy

Slides Safety and Security: Bias, Fairness and Privacy Slides

Inference Algorithms - Sean Welleck (Nov 21)

Guest Lecturer Sean Welleck

Reading Material

Guest Lecture - Beidi Chen (Nov 26)

Guest Lecturer Beidi Chen