How to Learn Machine Learning
Table of Contents
- Background
- Introduction
- Mathematics
- Engineering
- Machine Learning Fundamentals
- Deep Learning
- Systems - AIML, Distributed, Computer
- Generative Modeling, Deep Generative Modeling and Synthesis
- Visual Computing
- Audio, MIR, DSP
- Data Centric Machine Learning
- Search Engines, Information Retrieval, Data Mining
- answerAI
Background
This page is a list of everything that I have found to learn machine learning in the llm chatgpt era. I used a lot of these references during my undergrad and I discovered new resources and courses that helped me study for some of the challenging courses during my masters program. Kylie Ying, Chip Huyen, Josh Starmer were a huge inspiration for this page because of how they organized the content on their websites and youtube channels. I recommend following the materials starting from the Introduction to Machine Learning Fundamentals in order to understand the material in the Machine Learning Subfields section.
Lastly, I recommend that every person who wants to study machine learning read The Worlds I See: Curiosity, Exploration, and Discovery at the Dawn of AI by Fei-Fei Li.
Introduction
- Chip Huyen A survivor’s guide to Artificial Intelligence
- Chip Huyen ML Ops Guide
- What is Machine Learning?
- How to learn Machine Learning
Mathematics
Calculus
- Utah MATH 1210 - Differential Calculus
- Utah MATH 1220 - Integral Calculus
- Utah MATH 2210 - Multivariable Calculus
- Intro to Derivatives Calculus for Machine Learning
- 3Blue1Brown Essence of Calculus
Probability and Statistics
Advice
- 5 tips for getting better at statistics
Deep Dives
- Random Variables
- Parameters
- Estimators
- Explaining Probability Distributions
- Parametric Families
- Probability Must-Knows for Machine Learning-Math for ML Part 1
- Statistics: Sampling and Describing Data-Math for ML Part 2
- Random Variables and Normal Distributions-Math for Machine Learning Part 3
- The better way to do statistics - Introduction to Bayesian Statistics
- 3Blue1Brown Bayes Theorem
- 3Blue1Brown Probability of Probabilities
- 3Blue1Brown Central Limit Theorem and the Gaussian Distribution
Probability and Statistics for Machine Learning
- Stanford CS 109: Probability and Statistics for Computer Scientists
- Stanford CS 109: Probability and Statistics for Computer Scientists - Lectures
- Steven Brunton - Probability Bootcamp
- Steven Brunton - Introduction to Statistics and Data Analysis
Bayesian Statistics
At KTH, one of the required ML courses is DD2434-Advanced Machine Learning which focuses solely on variational inference, one of the most mind-bending concepts in machine learning at the heart of today’s generative modeling. Variational Inference assumes a background in Bayesian Statistics which is usually taught as an advanced graduate statistics class. I came across this series by Ben Lambert after the class which attempts to teach this statistics assuming no statistics background and had I known about it during the class, it would have helped me better understand some of David Blei’s papers which we had to read and implement in this class. I highly recommend this series to understand some of the techniques and math behind variational autoencoders, KL Divergence, and Bayesian Deep Learning.
- A student’s guide to Bayesian Statistics
Linear Algebra
Data Analysis
- Utah CS 3190 - Foundations of Data Analysis
Bayesian Data Analysis
- Aalto CS-E5710 - Bayesian Data Analysis
Engineering
Programming Languages
- QuantEcon
- Python Data Science Handbook
- Python for Data Analysis
- Machine Learning with PyTorch and Scikit-Learn - follow this book if you want to learn PyTorch
Tools and Practices
- MIT - The Missing Semester of Your CS Education - this course is updating January 2026 to introduce best practices for AIML Engineering. Curtis, Anish, Jonas and company are genius teachers and engineers so this is definitely a must watch
- Utah MSD Git
- Chip Huyen AIE Book Reference
Potpurri
- Coding Challenges
- Advent of Code
Machine Learning Fundamentals
- StatQuest
Statistical Learning
- Statistical Learning With Python
Machine Learning
- Sebastian Raschka Machine Learning
- Cornell Applied Machine Learning
- KTH DD1420 - Foundations of Machine Learning
- University of Utah CS 6350 - Machine Learning
Probabilistic Machine Learning
Probabilistic Machine Learning is probably the most mind-bending machine learning out there. It’s so mind-bending that almost all teaching and reference materials reference Michael I. Jordan and David Blei who not only pioneered applying bayesian statistics to this area but the applications in healthcare, economics, topic modeling, machine learning interpretability and more. Since the area is very broad, programs emphasize different methods: Markov Chain Monte Carlo (MCMC), Metropolis Hastings (MH), Gibbs, Variational Inference. The KTH DD2434 focuses exclusively on variational inference because it underpins the variational-autoencoders (VAE) technique. The courses listed here focus on a mixture of all the methods but for most people who want to understand VAE, I would just recommend focusing on variational inference.
- Probabilistic Machine Learning
- University of Utah CS 6190 - Probabilistic Modeling
- Stanford CS 228 - Probabilistic Graphical Models Notes
- Probabilistic Programming and Bayesian Methods for Hackers
Natural Language Processing and Information Retrieval
- Stanford CS 124 - From Languages to Information
- Stanford CS 124 - From Languages to Information - Lectures
- Stanford CS 124 - From Languages to Information - Course GitHub
Deep Learning
Fundamentals
- Sebastian Raschka Deep Learning
- Neural Networks Zero to to Hero - Lectures 1-6
- 3Blue1Brown Neural Networks Lectures 1-4
LLMS
- Grant Sanderson Visualizing transformers and attention
- Andrej Karpathy 1 Hour Intro to Large Language Models
- Andrej Karpathy Deep Dive Into LLMs Like ChatGPT
- Neural Networks Zero to to Hero - Lectures 7, 9-10
- 3Blue1Brown Neural Networks - Lectures 5-8
NLP
- CS 224N Natural Language Processing With Deep Learning
- CS 224N Natural Language Processing With Deep Learning - Lectures
- Utah CS 6340 - Natural Language Processing
Systems - AIML, Distributed, Computer
- Made With ML
- KTH ID2221 - Data Intensive Computing Platforms
- KTH ID2223 - Scalable Machine Learning and Deep Learning
- Utah CS 4400 - Computer Systems
- MIT 6.584 Distributed Systems
Generative Modeling, Deep Generative Modeling and Synthesis
- Cornell - Deep Generative Models
- Stanford CS 236 - Deep Generative Models
- Stanford CS 236 - Deep Generative Models Lectures
- Stanford CS 236 - Deep Generative Model Models Notes
Visual Computing
Computer Vision + Image Analysis
- Berkeley INFO290T
- Stanford CS 231N
- Stanford CS 231N 2025 - Lectures
Visualization
- MIT 6/11.C85 - Interactive Data Visualization & Society
- Svelte + D3
Creative Coding
- Machine Learning for the Web
- Machine Learning for Physical Computing
- Harvard CS 73
- UChicago Creative Coding
- Golan Levin Experimental Capture Class
Audio, MIR, DSP
- Audio Signal Processing for Machine Learning
- Audio Data Augmentation
- PyTorch for Audio and Music Processing
- Deep Learning For Audio with Python
- Generating Sound with Neural Networks
- Melody Generation with RNN-LSTM
Data Centric Machine Learning
Search Engines, Information Retrieval, Data Mining
answerAI
answerAI was founded by Jeremy Howard and Eric Rees in 2023 as a new R&D lab focusing on fundamental research and the development of practical applications based on research breakthroughs. answerAI is the successor to fastAI which was founded by Jeremy Howard and Rachel Thomas. fastAI left a legacy with its machine learning research and highly regarded state of the art machine learning courses that aimed to make deep learning accessible to everyone regardless of their math background which was revolutionary at the time the courses were released. Today fastai’s materials are still used as an entry to the fields of deep learning and generative AI and at the University of San Francisco MS Data Science Program. Jeremy and his team still produce educational videos from time to time on youtube and continue to teach and innovate.
Materials
- Book
- Kaggle Notebooks
- PyTorch Image Models
- FastAI Library
fastAI
- Lesson 0: How to fastAI
- Part 1: Deep Learning for Coders
- Part 2: From Deep Learning to Stable Diffusion
- Computational Linear Algebra
- Code-First Intro to NLP
answerAI
- Let’s Build the GPT Tokenizer: A Complete Guide to Tokenization in LLMs
- A Hacker’s Guide to Language Models
- Getting Started with CUDA for Python Progammers
- Going Further with CUDA for Python Progammers
- GPU Programming in the Browser