How to Learn Machine Learning
Table of Contents
- Background
- Introduction
- Mathematics
- Programming
- Machine Learning Fundamentals
- Machine Learning Subfields
- Neural Networks - Fundamentals
- Neural Networks - LLMS
- Interpretability
- Alignment
- Data Centric Machine Learning
- Scalable Machine Learning + ML Systems
- Deep Generative Modeling
- Search Engines, Information Retrieval, Data Mining
- Image Analysis and Computer Vision
- Human-AI Interaction - Visualization, Creative Coding, Audio ML
- Engineering - Machine Learning, Computer Science
- answerAI
Background
This page is a list of everything that I have found to learn machine learning in the llm chatgpt era. I used a lot of these references during my undergrad and I discovered new resources and courses that helped me study for some of the challenging courses during my masters program. Kylie Ying, Chip Huyen, Josh Starmer were a huge inspiration for this page because of how they organized the content on their websites and youtube channels. I recommend following the materials starting from the Introduction to Machine Learning Fundamentals in order to understand the material in the Machine Learning Subfields section.
Lastly, I recommend that every person who wants to study machine learning read The Worlds I See: Curiosity, Exploration, and Discovery at the Dawn of AI by Fei-Fei Li.
Introduction
- Chip Huyen A survivor’s guide to Artificial Intelligence
- Chip Huyen ML Ops Guide
- What is Machine Learning?
- How to learn Machine Learning
Mathematics
Calculus
- Utah MATH 1210 - Differential Calculus
- Utah MATH 1220 - Integral Calculus
- Utah MATH 2210 - Multivariable Calculus
- Intro to Derivatives Calculus for Machine Learning
- 3Blue1Brown Essence of Calculus
Probability and Statistics
Advice
- 5 tips for getting better at statistics
Deep Dives
- Random Variables
- Parameters
- Estimators
- Explaining Probability Distributions
- Parametric Families
- Probability Must-Knows for Machine Learning-Math for ML Part 1
- Statistics: Sampling and Describing Data-Math for ML Part 2
- Random Variables and Normal Distributions-Math for Machine Learning Part 3
- The better way to do statistics - Introduction to Bayesian Statistics
- 3Blue1Brown Bayes Theorem
- 3Blue1Brown Probability of Probabilities
- 3Blue1Brown Central Limit Theorem and the Gaussian Distribution
Probability and Statistics for Machine Learning
- Stanford CS 109: Probability and Statistics for Computer Scientists
- Stanford CS 109: Probability and Statistics for Computer Scientists - Lectures
- Steven Brunton - Probability Bootcamp
- Steven Brunton - Introduction to Statistics and Data Analysis
Bayesian Statistics
At KTH, one of the required ML courses is DD2434-Advanced Machine Learning which focuses solely on variational inference, one of the most mind-bending concepts in machine learning at the heart of today’s generative modeling. Variational Inference assumes a background in Bayesian Statistics which is usually taught as an advanced graduate statistics class. I came across this series by Ben Lambert after the class which attempts to teach this statistics assuming no statistics background and had I known about it during the class, it would have helped me better understand some of David Blei’s papers which we had to read and implement in this class. I highly recommend this series to understand some of the techniques and math behind variational autoencoders, KL Divergence, and Bayesian Deep Learning.
- A student’s guide to Bayesian Statistics
Linear Algebra
Mathematics of Machine Learning, Data Science and Big Data
Programming
- Python Data Science Handbook
- Python for Data Analysis
- Machine Learning with PyTorch and Scikit-Learn - follow this book if you want to learn PyTorch
- Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow - follow this book if you want to learn TensorFlow which is still used heavily for production machine learning
Machine Learning Fundamentals
Statistical Learning
- Statistical Learning With Python
Machine Learning
- KTH DD1420 - Foundations of Machine Learning
- University of Utah CS 6350 - Machine Learning
Probabilistic Machine Learning
Probabilistic Machine Learning is probably the most mind-bending machine learning out there. It’s so mind-bending that almost all teaching and reference materials reference Michael I. Jordan and David Blei who not only pioneered applying bayesian statistics to this area but the applications in healthcare, economics, topic modeling, machine learning interpretability and more. Since the area is very broad, programs emphasize different methods: Markov Chain Monte Carlo (MCMC), Metropolis Hastings (MH), Gibbs, Variational Inference. The KTH DD2434 focuses exclusively on variational inference because it underpins the variational-autoencoders (VAE) technique. The courses listed here focus on a mixture of all the methods but for most people who want to understand VAE, I would just recommend focusing on variational inference.
- Probabilistic Machine Learning
- University of Utah CS 6190 - Probabilistic Modeling
- Stanford CS 228 - Probabilistic Graphical Models Notes
- Probabilistic Programming and Bayesian Methods for Hackers
Natural Language Processing and Information Retrieval
- Stanford CS 124 - From Languages to Information
- Stanford CS 124 - From Languages to Information - Lectures
- Stanford CS 124 - From Languages to Information - Course GitHub
Machine Learning Subfields
Neural Networks - Fundamentals
- Neural Networks Zero to to Hero - Lectures 1-6
- 3Blue1Brown Neural Networks Lectures 1-4
Neural Networks - LLMS
- Grant Sanderson Visualizing transformers and attention
- Andrej Karpathy 1 Hour Intro to Large Language Models
- Andrej Karpathy Deep Dive Into LLMs Like ChatGPT
- Neural Networks Zero to to Hero - Lectures 7, 9-10
- 3Blue1Brown Neural Networks - Lectures 5-8
Interpretability
- Utah CS 6966 - Local Explanations for Deep Learning Models
- Neural Networks are Weird! Neel Nanda
- Mechanistic Interpretability - Neel Nanda
- Neel Nanda Mechanistic Interpretability Materials
Alignment
Data Centric Machine Learning
Scalable Machine Learning + ML Systems
- Made With ML
- KTH ID 2221 - Data Intensive Computing Platforms
- KTH ID 2223 - Scalable Machine Learning and Deep Learning
- Serverless Machine Learning & Principles of MLOps
Deep Generative Modeling
- Stanford CS 236 - Deep Generative Models
- Stanford CS 236 - Deep Generative Models Lectures
- Stanford CS 236 - Deep Generative Model Models Notes
Search Engines, Information Retrieval, Data Mining
- CS 224N Natural Language Processing With Deep Learning
- CS 224N Natural Language Processing With Deep Learning - Lectures
- Utah CS 6340 - Natural Language Processing
- Utah CS 6140 - Data Mining Spring 2023
- Utah CS 6140 - Data Mining
Image Analysis and Computer Vision
Human-AI Interaction - Visualization, Creative Coding, Audio ML
Visualization
- MIT 6/11.C85 - Interactive Data Visualization & Society
- Svelte + D3
Creative Coding
- Machine Learning for the Web
- Machine Learning for Physical Computing
- Harvard CS 73
- UChicago Creative Coding
- Golan Levin Experimental Capture Class
Audio ML
- Audio Signal Processing for Machine Learning
Engineering - Machine Learning, Computer Science
- MIT - The Missing Semester of Your CS Education
- Coding Challenges
- Advent of Code
- Utah MSD Git
- Chip Huyen AIE Book Reference
answerAI
answerAI was founded by Jeremy Howard and Eric Rees in 2023 as a new R&D lab focusing on fundamental research and the development of practical applications based on research breakthroughs. answerAI is the successor to fastAI which was founded by Jeremy Howard and Rachel Thomas. fastAI left a legacy with its machine learning research and highly regarded state of the art machine learning courses that aimed to make deep learning accessible to everyone regardless of their math background which was revolutionary at the time the courses were released. Today fastai’s materials are still used as an entry to the fields of deep learning and generative AI and at the University of San Francisco MS Data Science Program. Jeremy and his team still produce educational videos from time to time on youtube and continue to teach and innovate.
Materials
- Book
- Kaggle Notebooks
- PyTorch Image Models
- FastAI Library
fastAI
- Lesson 0: How to fastAI
- Part 1: Deep Learning for Coders
- Part 2: From Deep Learning to Stable Diffusion
- Computational Linear Algebra
- Code-First Intro to NLP
answerAI
- A Hacker’s Guide to Language Models
- Getting Started with CUDA for Python Progammers
- Going Further with CUDA for Python Progammers
- GPU Programming in the Browser