How to Learn Machine Learning

Background
Introduction
Mathematics
Programming
Machine Learning Fundamentals
Machine Learning Subfields
Engineering - Machine Learning, Computer Science
answerAI

Background

This page is a list of everything that I have found to learn machine learning in the llm chatgpt era. I used a lot of these references during my undergrad and I discovered new resources and courses that helped me study for some of the challenging courses during my masters program. Kylie Ying, Chip Huyen, Josh Starmer were a huge inspiration for this page because of how they organized the content on their websites and youtube channels. I recommend following the materials starting from the Introduction to Machine Learning Fundamentals in order to understand the material in the Machine Learning Subfields section.

Lastly, I recommend that every person who wants to study machine learning read The Worlds I See: Curiosity, Exploration, and Discovery at the Dawn of AI by Fei-Fei Li.

Introduction

Mathematics

Calculus

Probability and Statistics

Advice

Probability and Statistics for Machine Learning

Stanford CS 109: Probability and Statistics for Computer Scientists
Stanford CS 109: Probability and Statistics for Computer Scientists - Lectures
Steven Brunton - Probability Bootcamp
Steven Brunton - Introduction to Statistics and Data Analysis
Bayesian Statistics

At KTH, one of the required ML courses is DD2434-Advanced Machine Learning which focuses solely on variational inference, one of the most mind-bending concepts in machine learning at the heart of today’s generative modeling. Variational Inference assumes a background in Bayesian Statistics which is usually taught as an advanced graduate statistics class. I came across this series by Ben Lambert after the class which attempts to teach this statistics assuming no statistics background and had I known about it during the class, it would have helped me better understand some of David Blei’s papers which we had to read and implement in this class. I highly recommend this series to understand some of the techniques and math behind variational autoencoders, KL Divergence, and Bayesian Deep Learning.
A student’s guide to Bayesian Statistics

Linear Algebra

Mathematics of Machine Learning, Data Science and Big Data

Programming

Python Data Science Handbook
Python for Data Analysis
Machine Learning with PyTorch and Scikit-Learn - follow this book if you want to learn PyTorch
Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow - follow this book if you want to learn TensorFlow which is still used heavily for production machine learning

Machine Learning Fundamentals

Statistical Learning

Statistical Learning With Python
Machine Learning
KTH DD1420 - Foundations of Machine Learning
University of Utah CS 6350 - Machine Learning
Probabilistic Machine Learning

Probabilistic Machine Learning is probably the most mind-bending machine learning out there. It’s so mind-bending that almost all teaching and reference materials reference Michael I. Jordan and David Blei who not only pioneered applying bayesian statistics to this area but the applications in healthcare, economics, topic modeling, machine learning interpretability and more. Since the area is very broad, programs emphasize different methods: Markov Chain Monte Carlo (MCMC), Metropolis Hastings (MH), Gibbs, Variational Inference. The KTH DD2434 focuses exclusively on variational inference because it underpins the variational-autoencoders (VAE) technique. The courses listed here focus on a mixture of all the methods but for most people who want to understand VAE, I would just recommend focusing on variational inference.
Probabilistic Machine Learning
University of Utah CS 6190 - Probabilistic Modeling
Stanford CS 228 - Probabilistic Graphical Models Notes
Probabilistic Programming and Bayesian Methods for Hackers
Natural Language Processing and Information Retrieval
Stanford CS 124 - From Languages to Information
Stanford CS 124 - From Languages to Information - Lectures
Stanford CS 124 - From Languages to Information - Course GitHub

Machine Learning Subfields

Neural Networks - Fundamentals

Interpretability

Alignment

ARENA

Data Centric Machine Learning

MIT Data Centric AI

Scalable Machine Learning + ML Systems

Deep Generative Modeling

Search Engines, Information Retrieval, Data Mining

Image Analysis and Computer Vision

Human-AI Interaction - Visualization, Creative Coding, Audio ML

Visualization

Engineering - Machine Learning, Computer Science

answerAI

answerAI was founded by Jeremy Howard and Eric Rees in 2023 as a new R&D lab focusing on fundamental research and the development of practical applications based on research breakthroughs. answerAI is the successor to fastAI which was founded by Jeremy Howard and Rachel Thomas. fastAI left a legacy with its machine learning research and highly regarded state of the art machine learning courses that aimed to make deep learning accessible to everyone regardless of their math background which was revolutionary at the time the courses were released. Today fastai’s materials are still used as an entry to the fields of deep learning and generative AI and at the University of San Francisco MS Data Science Program. Jeremy and his team still produce educational videos from time to time on youtube and continue to teach and innovate.

How to Learn Machine Learning

Table of Contents

Background

Introduction

Mathematics

Calculus

Probability and Statistics

Advice

Deep Dives

Probability and Statistics for Machine Learning

Bayesian Statistics

Linear Algebra

Mathematics of Machine Learning, Data Science and Big Data

Programming

Machine Learning Fundamentals

Statistical Learning

Machine Learning

Probabilistic Machine Learning

Natural Language Processing and Information Retrieval

Machine Learning Subfields

Neural Networks - Fundamentals

Neural Networks - LLMS

Interpretability

Alignment

Data Centric Machine Learning

Scalable Machine Learning + ML Systems

Deep Generative Modeling

Search Engines, Information Retrieval, Data Mining

Image Analysis and Computer Vision

Human-AI Interaction - Visualization, Creative Coding, Audio ML

Visualization

Creative Coding

Audio ML

Engineering - Machine Learning, Computer Science

answerAI

Materials

fastAI

answerAI