| Instructor | Lingpeng Kong |
| Teaching Assistant | |
| Syllabus | Natural language processing (NLP) is the study of human language from a computational perspective. The course will be focusing on machine learning and corpus-based methods and algorithms. We will cover syntactic, semantic and discourse processing models. We will describe the use of these methods and models in applications including syntactic parsing, information extraction, statistical machine translation, dialogue systems, and summarization. This course starts with language models (LMs), which are both front and center in natural language processing (NLP), and then introduces key machine learning (ML) ideas that students should grasp (e.g. feature-based models, log-linear models and then the neural models). We will land on modern generic meaning representation methods (e.g. BERT/GPT-3) and the idea of pretraining / finetuning. |
| Introduction by Professor | Natural language processing (NLP) is the study of human language from a computational perspective. The course will investigate modern NLP algorithms from the machine learning perspective. We will cover syntactic, semantic and other useful models for language. We will describe the use of these methods and models in applications including syntactic parsing, information extraction, statistical machine translation, dialogue systems, and summarization. This course starts with language models (LMs), which are both front and center in natural language processing (NLP), and then introduces key machine learning (ML) ideas that students should grasp (e.g., feature-based models, log-linear models and then the neural models). We will land on modern generic meaning representation methods (e.g., BERT/GPT-3) and the idea of pretraining / finetuning / prompt-based learning. |
| Learning Outcomes | |
| Pre-requisites | - |
| Compatibility | Nil |
| Prior knowledge expected | Basic knowledge about Machine Learning, Probability, Statistics, and Programming |
| Topics covered | |
| Assessment | |
| Course materials | Prescribed textbook: - Jurafsky, Daniel, and James H. Martin. "Speech and Language Processing."
|
| Session dates | |
| Add/drop | 1 September, 2025 - 14 September, 2025 |
| Maximum class size | |
| Moodle course website | |