Study Path Agent Study Path Agent
Generate Your Own
Machine Learning
130 topics across 6 chapters
Chapter 1
Math Foundations for ML
1
Linear Algebra Essentials
3 subtopics
2
Vectors, matrices, and operations (dot, transpose, inverse intuition)
3
Eigenvalues/eigenvectors and SVD intuition (why they matter in ML)
4
Matrix calculus basics (gradients for linear models)
5
Calculus & Optimization Basics
3 subtopics
6
Derivatives, partial derivatives, and the chain rule
7
Gradient descent and learning rate behavior
8
Convexity intuition and common loss functions
9
Probability & Statistics for ML
4 subtopics
10
Random variables and common distributions (Gaussian, Bernoulli, Poisson)
11
Expectation, variance, covariance, correlation
12
Bayes’ rule and conditional probability (posterior intuition)
13
Statistical estimation (MLE/MAP) and confidence vs. credible intervals
14
Numerical Computing Skills
3 subtopics
15
Floating point, stability, and conditioning (why training can diverge)
16
Vectorization mindset (broadcasting and avoiding Python loops)
17
Automatic differentiation basics (what backprop is computing)
Chapter 2
Core ML Concepts & Workflow
18
Problem framing and success metrics
3 subtopics
19
Choose task type (classification, regression, ranking, forecasting)
20
Define baseline and target metric (accuracy, F1, AUROC, RMSE, etc.)
21
Set constraints (latency, memory, interpretability, cost)
22
Data splitting and evaluation hygiene
3 subtopics
23
Train/validation/test splits and leakage patterns
24
Cross-validation (when to use it and pitfalls)
25
Calibration and thresholding for decision-making
26
Generalization, bias–variance, and regularization
3 subtopics
27
Underfitting vs overfitting (diagnosis with learning curves)
28
Regularization methods (L1/L2, early stopping, dropout intuition)
29
Hyperparameter tuning basics (search spaces and budgets)
30
Feature engineering and preprocessing
4 subtopics
31
Scaling/normalization and handling missing values
32
Encoding categorical variables (one-hot, target, embeddings overview)
33
Text/vector representations overview (bag-of-words to embeddings)
34
Feature selection and dimensionality reduction basics
35
Metrics and error analysis
3 subtopics
36
Confusion matrix, precision/recall, ROC/PR curves
37
Residual analysis for regression and heteroscedasticity clues
38
Slice-based evaluation (subgroups, rare cases, long tail)
39
Reproducibility and experiment tracking
2 subtopics
40
Random seeds, deterministic ops, and versioning data/code
41
Run logging (configs, metrics) and comparing experiments
Chapter 3
Supervised Learning
42
Linear Models
3 subtopics
43
Linear regression (loss, closed form vs GD)
44
Logistic regression (decision boundary, calibration)
45
Regularized models (Ridge/Lasso/Elastic Net)
46
Tree-Based Methods
3 subtopics
47
Decision trees (splits, impurity, overfitting controls)
48
Random forests (bagging, OOB error, feature importance caveats)
49
Gradient boosting (XGBoost/LightGBM/CatBoost concepts)
50
Kernel Methods and Margin-Based Models
2 subtopics
51
SVM basics (margin, kernels, C and gamma intuition)
52
Kernel ridge regression and Gaussian processes (high-level)
53
Nearest Neighbors and Similarity
2 subtopics
54
kNN (distance metrics, scaling sensitivity)
55
Metric learning intuition (why embeddings help)
56
Model Interpretation for Supervised Learning
3 subtopics
57
Permutation importance and partial dependence (when they mislead)
58
SHAP/LIME overview and practical pitfalls
59
Interpreting linear/logistic models (coefficients, odds ratios)
Chapter 4
Unsupervised & Self-Supervised Learning
60
Clustering
3 subtopics
61
k-means (initialization, scaling, choosing k)
62
Hierarchical clustering and dendrogram interpretation
63
Density-based methods (DBSCAN/HDBSCAN intuition)
64
Dimensionality Reduction
3 subtopics
65
PCA (variance, SVD connection)
66
t-SNE/UMAP (visualization-focused caveats)
67
Autoencoders basics (representation learning)
68
Representation Learning (Self-Supervised)
2 subtopics
69
Contrastive learning intuition (positives/negatives, collapse)
70
Pretext tasks and masked prediction (high-level)
71
Generative Modeling Basics
3 subtopics
72
Likelihood-based models (autoregressive overview)
73
VAEs (encoder/decoder, KL term intuition)
74
Diffusion models intuition (noise-to-data idea)
Chapter 5
Deep Learning
75
Neural Network Fundamentals
4 subtopics
76
Perceptron, MLPs, activations (ReLU, sigmoid, GELU)
77
Forward/backprop and computation graphs (conceptual + simple derivation)
78
Initialization, normalization (BatchNorm/LayerNorm) basics
79
Optimizers (SGD+momentum, Adam) and training stability
80
Convolutional Neural Networks (Vision)
3 subtopics
81
Convolutions, padding/stride, receptive fields
82
Modern CNN blocks (residual connections, depthwise separable conv)
83
Data augmentation and transfer learning for vision
84
Sequence Models
3 subtopics
85
RNN/LSTM/GRU intuition (vanishing gradients)
86
Attention mechanism basics
87
Transformers overview (encoder/decoder, positional encoding)
88
Training Practice & Debugging
4 subtopics
89
Overfit a small batch and sanity checks
90
Diagnose gradient issues (vanishing/exploding, clipping)
91
Data pipeline bugs (shapes, labels, normalization mismatches)
92
Compute planning (batch size, mixed precision, throughput)
93
Deep Learning for Text (NLP)
3 subtopics
94
Tokenization (BPE/WordPiece overview) and embeddings
95
Fine-tuning vs prompt-based usage (high-level tradeoffs)
96
Evaluation for NLP (BLEU/ROUGE vs task-specific + human eval)
97
Generative Deep Learning
3 subtopics
VAEs (encoder/decoder, KL term intuition) (see Chapter 4)
Diffusion models intuition (noise-to-data idea) (see Chapter 4)
98
GANs (generator/discriminator game, mode collapse)
Chapter 6
ML Engineering, Deployment, and Responsible AI
99
Data & Pipelines
3 subtopics
100
Dataset creation/labeling strategy and quality checks
101
Data versioning and lineage (schemas, snapshots)
102
Feature stores concept (offline/online consistency)
103
Serving and Deployment
3 subtopics
104
Batch vs online inference; latency/throughput tradeoffs
105
Model packaging and inference optimization (ONNX, quantization overview)
106
CI/CD for ML (testing data, models, and pipelines)
107
Monitoring and Maintenance
3 subtopics
108
Data drift and concept drift (detection signals)
109
Model performance monitoring and alerting
110
Retraining strategies and rollout (shadow, canary, A/B tests)
111
Scalable Training and Systems
3 subtopics
112
GPU basics and data loading bottlenecks
113
Distributed training concepts (DDP, data/model parallel)
114
Checkpointing and fault tolerance
115
Responsible AI and Safety
4 subtopics
116
Fairness basics (bias sources, group metrics)
117
Privacy and security (PII handling, membership inference intuition)
118
Robustness to distribution shift and adversarial examples (overview)
119
Model cards and documentation (communicating limitations)
120
Productization and Stakeholders
3 subtopics
121
Translating business goals to ML objectives and KPIs
122
Human-in-the-loop systems (labeling feedback, review queues)
123
Communicating results (uncertainty, demos, failure modes)