Teaching assistant of NLP

Undergraduate course, ENSAE Paris, 2023

Teacher : Pierre Colombo

Introductory course to NLP, with practical sessions in Pytorch and HuggingFace.

Program

Courses:

  • The Basics of NLP : This session introduces what is NLP, why it is challenging and how we approach any NLP problem.
  • Representing text into vectors : This lecture covers representation learning techniques for Natural Language Processing.
  • Deep Learning Methods for NLP : This lecture presents Deep Learning techniques used in NLP. We cover the design and training principles. We present the Multi-Layer-Perceptron (MLP), Recurrent Architectures (RNN and LSTM) and the transformer architecture.
  • Language Modeling : This lecture introduces language modeling.
  • Sequence Labeling & Classification : Sequence Labeling & Classification with Deep Learning models such as RNN-based models and transformers.
  • Sequence Generation : Sequence Generation using Encoder-Decoders.

Labs :

  • Introduction to textual data with Python : This lab introduces basics processing operations required for any NLP experiments. After introducing preprocessing tools for data cleaning and tokenization, we compute some descriptive statistics on textual data.
  • Word Embeddings and their evaluation : This lab explores representation learning techniques for words and documents. It explores models like tf-idf and Word2vec and develop quantitative and qualtiative evaluation methods.
  • Sequence Labeling and Sequence Classification with Deep Learning Models: This lab implements, trains and evaluates sequence classification and labeling models based on Recurrent Neural Networks and transformer deep-learning architecture.
  • Machine Translation :This lab introduces basics processing operations required for any NLP experiments. After introducing preprocessing tools for data cleaning and tokenization, we compute some descriptive statistics on textual data.