Back to Courses

Introduction to Data Analytics

Data Analytics is the science of analyzing data to convert information to useful knowledge. This knowledge could help us understand our world better, and in many contexts enable us to make better decisions. While this is the broad and grand objective, the last 20 years has seen steeply decreasing costs to gather, store, and process data, creating an even stronger motivation for the use of empirical approaches to problem solving. This course seeks to present you with a wide range of data analytic techniques and is structured around the broad contours of the different types of data analytics, namely, descriptive, inferential, predictive, and prescriptive analytics.

This course is offered jointly by Prof. Nandan Sudarsanam. The tables below enlists the courses materials for Week 1 to Week 9. Each topic has both YouTube link and VideoKen link.

Week 1 - Course Overview & Descriptive Statistics

  • Course overview - 1
  • Course overview - 2
  • Descriptive statistics - graphical approaches
  • Descriptive statistics - measures of central tendency
  • Descriptive statistics - measures of dispersion
  •    Assignment 1
  •    Solution 1
  • Week 2 - Probability Distributions & Inferential Statistics

  • Random variables & probability distributions
  • Probability distributions - 2
  • Probability distributions - 3
  • Inferential statistics - motivation
  • Inferential statistics - single sample tests
  •    Assignment 2
  •    Solution 2
  • Week 3 - Inferential Statistics

  • Two sample tests
  • Type I & type II errors
  • Confidence intervals
  • ANOVA and test of independence
  • Short introduction to regression
  •    Assignment 3
  •    Solution 3
  • Week 4 - Machine Learning

  • Introduction to machine learning
  • Supervised learning
  • Unsupervised learning
  • Ordinary least squares regression
  • Simple & multiple regression in Excel & Matlab
  • Regularisation/coefficient shrinkage
  • Data modelling & algorithmic modelling approaches
  •    Assignment 4
  •    Assignment 4 data
  •    Solution 4
  • Week 5 - Supervised Learning (Regression & Classification Techniques) I

  • Logistic regression
  • Training a logistic regression classifier
  • Classification & regression trees - 1
  • Classification & regression trees - 2
  • Bias variance dichotomy
  • Model assessment and selection
  • Support vector machines - 1
  • Support vector machines - 2
  • SVMs for non-linearly separable data
  • SVMs & kernel transformations
  •    Assignment 5
  •    Assignment 5 data
  •    Solution 5
  • Week 6 - Supervised Learning (Regression & Classification Techniques) II

  • Ensemble methods & random forests
  • Artificial neural networks - 1
  • Artificial neural networks - 2
  • Deep learning
  •    Assignment 6
  •    Solution 6
  • Week 7 Association Rule Mining & Big Data

  • Association rule mining - 1
  • Association rule mining - 2
  • Big data - a small introduction - 1
  • Big data - a small introduction - 2
  •    Assignment 7
  •    Solution 7
  • Week 8 - Clustering Analysis & Predictive Analytics

  • Clustering analysis - 1
  • Clustering analysis - 2
  • Introduction to experimentation & active learning - 1
  • Introduction to experimentation & active learning - 2
  • Introduction to online learning - reinforcement learning - 1
  • Introduction to online learning - reinforcement learning - 2
  •    Assignment 8
  •    Solution 8
  • Week 9 - Course Summary

    Course summary & insights into the final exam