Calculus [Online course from MIT]
Linear Algebra [CS6015 or equivalent] | [Online course from MIT]
Probability Theory [CS6015 or equivalent] | [Online course from MIT]
Non-linear Optimization [CS5020 or equivalent] | [First Course in Optimization by Prof. Soman (IITB) available on CDEEP]
Pattern Recognition and Machine Learning [CS5691 or equivalent] | [Andrew Ng's ML course]
Instructor: Mitesh M. Khapra
When: Jan-May 2024
Lectures: Slot H
Where: CS25
Teaching Assistants:
| Name | Lab | Office hours | Days | |
|---|---|---|---|---|
| Anushka Singh | AI4Bharat | cs22s015@smail.iitm.ac.in | 3-4 pm | Tuesday, Thursday |
| Oikantik Nath | AI4Bharat | cs22s013@smail.iitm.ac.in | 12-1 pm | Tuesday, Thursday |
| Bibhuti Majhi | AI4Bharat | cs22m031@smail.iitm.ac.in | 2-4 pm | Wednesday |
| Sarthak Naithani | AI4Bharat | cs22m078@smail.iitm.ac.in | 3-4 pm | Tuesday, Wednesday |
| Putta Sai Sree Ram | AI4Bharat | cs22m076@smail.iitm.ac.in | 3-4 pm | Monday, Friday |
| Ravi Prakash Singh | AI4Bharat | cs22m069@smail.iitm.ac.in | 1-2 pm | Tuesday, Friday |
| Guddeppagari Shathish Kumar Reddy | AI4Bharat | cs22m080@smail.iitm.ac.in | 3-4 pm | Monday, Friday |
| Poorbi Mukesh Dalal | - | cs22m064@smail.iitm.ac.in | 1-3 pm | Wednesday |
| Ashok R | - | cs22m023@smail.iitm.ac.in | 2-4 pm | Wednesday |
| Amit Kumar | - | cs22m010@smail.iitm.ac.in | 2-4 pm | Wednesday |
| Lecture# | Contents | Lecture pdf | Lecture Videos | Extra Reading Material |
|---|---|---|---|---|
| Lecture 1 | (Partial) History of Deep Learning, Deep Learning Success Stories | Slides | M1 | M2| M3 | M4 | M5 | Juergen Schmidhuber, Deep Learning in Neural Networks: An Overview. 2014 |
| Lecture 2 | McCulloch Pitts Neuron, Thresholding Logic, Perceptrons, Perceptron Learning Algorithm and Convergence, Multilayer Perceptrons (MLPs), Representation Power of MLPs | Slides | M1 | M2| M3 | M4 | M5| M6 | M7 | M8 | Chapters 1,2,3,4 from Neural Networks by Rojas |
| Lecture 3 | Sigmoid Neurons, Gradient Descent, Feedforward Neural Networks, Representation Power of Feedforward Neural Networks | Slides | M1 | M2| M3 | M4 | M5| M6 | M7| M8 | http://neuralnetworksanddeeplearning.com/chap4.html |
| Lecture 4 | Feedforward Neural Networks, Backpropagation | Slides | M1 | M2 | M3 | M4 | M5 | M6 | M7 | M8 | See Lecture 2 [Training Neural Networks] by Hugo Larochelle |
| Lecture 5 | Gradient Descent (GD), Momentum Based GD, Nesterov Accelerated GD, Stochastic GD, AdaGrad, RMSProp, Adam | Slides-1 | Slides-2 | M1 | M2 | M3 | M4 | M5 | M6 | M7 | M8 | M9 | M10 | M11 | M12 | M13 | M14| M15 | Lecture 6 | Bias Variance Tradeoff, L2 regularization, Early stopping, Dataset augmentation, Parameter sharing and tying, Injecting noise at input, Ensemble methods, Dropout | Slides | M1| M2 | M3 | M4| M5 | M6 | M7 | M8 | M9 | M10| M11|M12 |
| Lecture 7 | Greedy Layerwise Pre-training, Better activation functions, Better weight initialization methods, Batch Normalization | Slides | M1 | M2| M3 | M4 | M5| M6 | |
| Lecture 8 | Convolutional Neural Networks, LeNet, AlexNet, ZF-Net, VGGNet, GoogLeNet, ResNet | Slides | M1 | M2| M3 | M4 | M5| M6 | M7| M8 | |
| Lecture 9 | Visualizing Convolutional Neural Networks, Guided Backpropagation, Deep Dream, Deep Art, Fooling Convolutional Neural Networks | Slides | M1 | M2| M3 | M4 | M5| M6 | M7 | M8 | M9 | M10 |
|
| Lecture 10 | Learning Vectorial Representations Of Words | Slides | M1 | M2| M3 | M4 | M5| M6 | M7 | M8 | M9 | M8 | M9 | M10| M11 | Lecture 11 | Recurrent Neural Networks, Backpropagation Through Time (BPTT), Vanishing and Exploding Gradients, Truncated BPTT | Slides | M1 | M2| M3 | M4 | M5 |
| Lecture 12 | Gated Recurrent Units (GRUs), Long Short Term Memory (LSTM) Cells, Solving the vanidhing gradient problem with LSTMs | Slides | M1 | M2| M3 | M4 | |
| Lecture 13 | Encoder Decoder Models, Attention Mechanism, Attention over images, Hierarchical Attention | Slides | M1 | M2| M3 | |
| Lecture 14 | Transformers: Multi-headed Self Attention, Cross Attention | Slides | M1 | M2| M3 | M4 | M5 |
| Topics | Resources | Release Date | Submission Date | |
|---|---|---|---|---|
| Assignment 1 (Graded) | Feedforward Neural Networks | Link | 19-Feb-2024 | 10-Mar-2024 |
| Assignment 2 ( Ungraded ) | Convolutional Neural Networks | Link | 03-Mar-2024 | 03-Apr-2024 |
| Assignment 3 ( Ungraded ) | Recurrent Neural Networks | Link | 03-Mar-2024 | 03-Apr-2024 |
| Assignment 4 ( Ungraded ) | RBMs and GANs | Link | 03-Mar-2024 | 03-Mar-2024 |
| Assignment 5 ( Graded ) | Transformers | Link | 22-Mar-2024 | 22-Apr-2024 |
| Endsem | 11-May-2024 (Saturday) | -- |
| Topics | Resources | Release Date | Submission Date | |
|---|---|---|---|---|
| Tutorial 1 | Calculus | |||
| Tutorial 2 | Linear Algebra | |||
| Tutorial 3 | MP Neurons, Perceptrons | |||
| Tutorial 4 | Sigmoid Neurons, Gradient Descent | |||
| Tutorial 5 | Feedforward Neural Networks, Backpropagation |
Deep Learning for Computer Vision [from Stanford]
Deep Learning for NLP [from Stanford]