Foundations of NLP

Fall 2024 · Mahindra University

Fall 2024Course CS3126ELT 1
LectureTopicResources
Lecture 0Introduction to NLP and its applications
Lecture 1Regular Expressions
  • Class Homework/Hands-on Activity
  • Resources related to Chapter
  • Regular Expressions (Textbook Chapter)
Lecture 2Text Normalization and Tokenization
  • Class Homework/Hands-on Activity
  • Resources related to Chapter
  • Textbook Chapter
  • Code/Implementation
Lecture 3Stop Words, Bag-of-Words, TF-IDF, POS Tagging, NER​
  • Additional Resources/Suggested Readings
  • TF-IDF (Python documentation)
  • Textual Data(Implementation/Code)
  • Stemming/Lemmatization (Textbook Chapter)
  • Words and Vectors (Textbook Chapter)
  • Text Preprocessing (Implementation)
  • NLP Datasets
  • Spacy Library for NLP
Lecture 4Semantic, Distributed representations, Vector Embeddings and Word2vec
  • Class Homework/Hands-on Activity
  • Resources related to Chapter
  • Vector Semantics (Textbook Chapter)
  • Logistic regression (Optional)
  • GloVe: Global Vectors for Word Representation (Implementation/Code)
  • fast text: Library for efficient text classification and representation learning (Implementation/Code)
Lecture 5Language Models-Probabilistic Language Modeling
  • Class Homework/Hands-on Activity
  • Resources related to Chapter
  • N-gram Language models (Textbook Chapter)
  • Pytorch Basics
Lecture 6Recurrent Neural Networks, LSTM
  • Class Homework/Hands-on Activity
  • Resources related to Chapter
  • RNNs and LSTMs (Textbook Chapter)
  • Language Models, RNN and LSTM [CS224n Lecture Notes]
  • RNN PyTorch Slides & Implementation
  • On the difficulty of training Recurrent Neural Networks [Research Paper][Additional]
  • Vanishing and Exploding gradients (Colab Notebook)
  • Recurrent Neural Network [Implementation/Code]
  • Recurrent Neural Network [Blog]
Lecture 7Attention
  • Class Homework/Hands-on Activity
  • Resources related to Chapter
  • Transformers (Textbook Chapter)
  • Attention Visualization [Implementation]
  • Attention is all you need [Research Paper]
  • Visual Attention paper [Research Paper]
  • WMT: The Conference on Machine Translation [Conference]
Lecture 8Transformers
  • Class Homework/Hands-on Activity
  • Resources related to Chapter
  • Transformers (Textbook Chapter)
  • Transformer [Blog]
Lecture 9, Lecture 11BleU Score, Contextual Embeddings, BERT, MLM
  • Class Homework/Hands-on Activity
  • Resources related to Chapter
  • Masked Language Models (Textbook Chapter)
  • BERT [Implementation]
  • Fine-tuned BERT with NER [Implementation]
Lecture 12Overview of Large Language Models
  • Class Homework/Hands-on Activity
  • Resources related to Chapter
  • Large Language Models (Textbook Chapter)
  • Unsloth - Finetune for Free (Additional Link)
  • A Comprehensive Survey on Pretrained Foundation Models: A
  • History from BERT to ChatGPT [Research Paper]
  • A Survey of Large Language Models [Research Paper]
Lecture 13Generative Modeling, Naive Bayes
  • Resources related to Chapter
  • Naive Bayes, Text Classification, and Sentiment (Textbook Chapter)
Lecture 14POS, Markov Models, HMM, Viterbi
  • Resources related to Chapter
  • Sequence Labeling for Parts of Speech and Named Entities (Textbook Chapter)