Study Path Agent
Copy link
X / Twitter
Facebook
LinkedIn
WhatsApp
Generate Your Own
Machine Learning
128 topics across 7 chapters
Chapter 1
Math foundations for ML
1
Linear algebra essentials
2 subtopics
2
Vectors, matrices, and basic operations (practice exercises)
3
Eigenvalues/eigenvectors + SVD intuition (worked problems)
4
Calculus for optimization
2 subtopics
5
Derivatives, gradients, Jacobians (compute by hand)
6
Chain rule & backprop intuition (manual derivations)
7
Probability & statistics
3 subtopics
8
Random variables, expectations, variance (problem set)
9
Common distributions + sampling (simulate in Python)
10
Bayes rule, MLE/MAP, bias-variance (short derivations)
11
Information theory (useful core ideas)
2 subtopics
12
Entropy, cross-entropy, KL divergence (compute examples)
13
Why log-loss works (binary + multiclass toy examples)
14
Numerical methods & stability
2 subtopics
15
Floating point, overflow/underflow, normalization tricks (mini-lab)
16
Gradient descent behavior: step sizes & conditioning (experiments)
Chapter 2
Programming & data tooling
17
Python for ML (core idioms)
2 subtopics
18
Functions, classes, typing, packaging basics (small exercises)
19
Performance basics: vectorization, profiling (micro-benchmarks)
20
NumPy, pandas, and data manipulation
2 subtopics
21
Indexing, broadcasting, reshape (NumPy drills)
22
Joins, groupby, time series basics (pandas mini-projects)
23
Data cleaning & feature engineering basics
2 subtopics
24
Missing data, leakage, outliers (checklist + practice)
25
Scaling, encoding, text basics (build sklearn pipelines)
26
Visualization & exploratory data analysis (EDA)
2 subtopics
27
Distributions, correlation pitfalls, target leakage detection (EDA lab)
28
Plotting with matplotlib/seaborn/plotly (recreate 10 plots)
29
Reproducibility & experiment tracking
2 subtopics
30
Random seeds, determinism, environment capture (practice)
31
Train/val/test splits, CV setup, logging runs (template project)
Chapter 3
Supervised learning algorithms
32
Problem framing: regression vs classification
2 subtopics
33
Define targets, loss, metrics; baseline models (worksheet)
34
Imbalanced classes and thresholding (hands-on notebook)
35
Linear models
4 subtopics
36
Ordinary least squares + diagnostics (implement from scratch)
37
Logistic regression + calibration (implement + evaluate)
38
Regularization: L1/L2/elastic net (experiments)
39
Feature interactions & polynomial features (compare models)
40
Tree-based models
3 subtopics
41
Decision trees: splitting criteria, pruning (from-scratch toy)
42
Random forests: bagging, OOB error, feature importance (lab)
43
Gradient boosting (XGBoost/LightGBM/CatBoost) model tuning (lab)
44
Kernel methods & SVMs
2 subtopics
45
Margins, hinge loss, soft margin (derive + visualize)
46
Kernel trick: RBF vs polynomial; scaling effects (experiments)
47
Model evaluation & selection
4 subtopics
↗
Train/val/test splits, CV setup, logging runs (template project)
(see Chapter 2)
48
Metrics by task: RMSE/MAE, ROC-AUC/PR-AUC, F1 (cheat-sheet + code)
49
Cross-validation patterns (K-fold, stratified, time series) (lab)
50
Hyperparameter search: random, Bayesian, early stopping (apply)
51
Interpretability & debugging
3 subtopics
52
Permutation importance, PDP/ICE (interpret a tabular model)
53
SHAP basics (explain predictions; sanity checks)
54
Error analysis: slice metrics, confusion matrix deep-dive (template)
Chapter 4
Unsupervised learning
55
Clustering
3 subtopics
56
K-means: objective, init, scaling; evaluate with silhouette (lab)
57
DBSCAN/HDBSCAN: density intuition; parameter sweeps (lab)
58
Gaussian Mixture Models + EM (fit & compare to k-means)
59
Dimensionality reduction
3 subtopics
60
PCA: variance, whitening, reconstruction error (implement + apply)
61
t-SNE/UMAP: what plots mean (and don’t) (guided experiments)
62
Autoencoders for embeddings (train a small one)
63
Anomaly detection
2 subtopics
64
Isolation Forest + one-class SVM (compare on a dataset)
65
Evaluation without labels + alert thresholds (playbook)
66
Recommendation basics (unsupervised & implicit feedback)
2 subtopics
67
Similarity search: cosine, ANN indexes (FAISS-like) (lab)
68
Matrix factorization for implicit data (implement ALS/BPR toy)
Chapter 5
Deep learning
69
Neural network basics
4 subtopics
↗
Chain rule & backprop intuition (manual derivations)
(see Chapter 1)
70
Activations, initialization, normalization (ablation experiments)
71
Optimization: SGD, momentum, Adam; LR schedules (training runs)
72
Regularization: dropout, weight decay, augmentation (lab)
73
Computer vision
3 subtopics
74
CNN building blocks (conv/pool) + receptive field (exercises)
75
Transfer learning with pretrained CNNs (fine-tune a classifier)
76
Detection/segmentation overview (YOLO/U-Net) (implement a small demo)
77
Natural language processing (NLP)
3 subtopics
78
Tokenization + embeddings (train/compare)
79
Transformers: attention, positional encoding (trace shapes in code)
80
Fine-tuning a small transformer (classification or NER) (lab)
81
Generative models (core ideas)
3 subtopics
82
Autoregressive modeling + sampling strategies (top-k/top-p) (lab)
83
Diffusion models: noise schedule intuition (guided notebook)
84
VAEs vs GANs: what each optimizes (compare on MNIST/CIFAR)
85
Deep learning engineering & scaling
3 subtopics
86
GPU basics + mixed precision + memory debugging (practical checklist)
87
Data loaders, batching, throughput tuning (profile and improve)
88
Distributed training overview (DDP) + gradient accumulation (demo)
Chapter 6
ML systems & MLOps
89
Data pipelines & feature stores
2 subtopics
90
Batch vs streaming; backfills; idempotency (design exercises)
91
Feature store concepts: training/serving parity (case study)
92
Model serving
3 subtopics
93
Batch scoring vs online inference (choose architecture)
94
Latency, throughput, caching; vector search serving (mini-design)
95
Packaging models: APIs, Docker, versioning (ship a demo service)
96
Monitoring & continuous evaluation
3 subtopics
97
Data drift, concept drift, label delay (monitoring plan)
98
Model quality dashboards + alerting thresholds (build a small one)
99
Offline-to-online evaluation + A/B testing basics (checklist)
100
Experiment management & model registry
2 subtopics
101
Track parameters/metrics/artifacts; compare runs (set up tooling)
102
Model registry: stages, approvals, reproducible promotion (workflow)
103
Testing & reliability for ML code
2 subtopics
104
Unit/integration tests for data + models (write 10 tests)
105
CI/CD for ML: training pipelines + deployment gates (design)
106
Security, privacy, and governance
3 subtopics
107
PII handling, minimization, retention (policy checklist)
108
Adversarial ML + prompt injection awareness (threat modeling)
109
Model governance: documentation, audits, model cards (write one)
Chapter 7
Ethics, fairness, and responsible ML
110
Fairness concepts & metrics
2 subtopics
111
Demographic parity, equalized odds, calibration tradeoffs (toy cases)
112
Sampling bias and measurement bias (diagnose in a dataset)
113
Privacy-preserving ML basics
2 subtopics
114
Differential privacy intuition (epsilon experiments)
115
Federated learning overview (design tradeoffs)
116
Robustness & safety
2 subtopics
117
OOD detection & robustness testing (stress test a model)
118
Human-in-the-loop + red teaming for ML systems (process draft)
119
Communication & documentation
2 subtopics
↗
Model governance: documentation, audits, model cards (write one)
(see Chapter 6)
120
Write a model report: goal, data, metrics, risks, next steps (template)