DATA8005: Advanced Natural Language Processing

Course Information

Instructor

Lecture

Course Description

Natural language processing (NLP) is the study of human language from a computational perspective. This course is an introductory graduate-level course on natural language processing aimed at students who are interested in doing cutting-edge research in the field. In this class, we will cover recent developments on core techniques and modern advances in NLP, especially in the era of large language models. We will also survey some recent NLP research topics including language grounding, agents, multimodality, interactivity, and interoperability for NLP. Students will gain the necessary skills and experience to understand, design, implement, and test large language models through a final project. We will also introduce cutting-edge research topics and learn how to conduct NLP research through paper readings and discussions. We will potentially also host invited speakers for talks.

Prerequisites

We require students to have prior knowledge undergraduate linear algebra, probability and statistics, machine learning, or deep learning. Familiarity with Python programming is required. Introduction to natural language processing is recommended.

Grading

Course Schedule

Date Topic Material Event Due
Week 1
Sep 3
Introduction (Tao Yu)
[slides]
Readings
Week 2
Sep 10
Introduction to LLMs (Tao Yu)
[slides]
Readings Others
Week 3
Sep 17
Introduction to LLMs (Tao Yu)
[slides]
Readings
Week 4
Sep 24
Canceled due to bad weather
Week 5
Oct 1
No class
Week 6
Oct 8
No class
Week 7
Oct 15
No class
Week 8
Oct 22
Model architectures (Tianbao Xie, Yitao Liu)
Mixture of experts (Xusen Xiao, Haolun Wang)
Readings Presentation and project registration Due
Week 9
Oct 29
GPUs, Kernels (Haofeng Xu, Qingwen Bu, Bangjun Wang)
Parallelism (Qingquan Lin, Haochen Luo)
Readings
Week 10
Nov 5
Scaling laws (Kaixuan Wang, Liuao Pei)
LLM Data (Baiyue He, Shujie Li)
Readings Project proposal Due
Week 11
Nov 12
Alignment - SFT/RLHF (Yi Ji, Yao Li)
Alignment - RL (Jiangxuan Long, Bo Chen)
Readings
Week 12
Nov 19
Multimodal LMs 1 - VLMs (Wenhao Yuan, Jian Chen)
Multimodal LMs 2 - diffusion models (Chen Xu, Zhihuan Jiang)
Readings
Week 13
Nov 26
LLM/VLMs as Agents (Junzhou Fang, Lingrui Xu)
Embodied AI - VLA (Jiwen Yu, Sijin Chen)
Readings
Week 14
Dec 3
No class