Study Path Agent Study Path Agent
Generate Your Own
Machine Learning
75 topics across 7 chapters
Chapter 1
Prerequisites: Math, Programming, and Data
1
Linear Algebra Essentials for ML
2
Calculus Essentials for Optimization
3
Probability Fundamentals
4
Statistics for Inference and Estimation
5
Python for Data Science (NumPy, Pandas)
6
Data Visualization and Communication (Matplotlib/Seaborn)
7
Data Cleaning and Feature Basics
Chapter 2
Core ML Concepts and Workflow
8
Problem Framing: Inputs, Outputs, and Objectives
9
Training vs. Validation vs. Test Splits
10
Loss Functions and Empirical Risk Minimization
11
Optimization Basics: Gradient Descent and Variants
12
Overfitting, Underfitting, Bias–Variance
13
Data Leakage and Causality Pitfalls (Intro)
Chapter 3
Supervised Learning
14
Linear Regression and Regularization (Ridge/Lasso)
15
Logistic Regression and Linear Classifiers
16
k-Nearest Neighbors (kNN)
17
Decision Trees and Random Forests
18
Gradient Boosting (XGBoost/LightGBM/CatBoost)
19
Support Vector Machines (SVMs)
20
Feature Engineering for Tabular Data
Chapter 4
Unsupervised Learning
21
Clustering: k-Means and Hierarchical Clustering
22
Dimensionality Reduction: PCA and t-SNE/UMAP (Concepts)
23
Density Estimation and Anomaly Detection (Intro)
24
Topic Modeling (LDA Basics)
25
Association Rules (Apriori Basics)
26
Recommender Systems (Collaborative Filtering Intro)
Chapter 5
Deep Learning
27
Neural Network Foundations (Perceptron → MLP)
28
Backpropagation and Automatic Differentiation
29
Training Deep Nets: Initialization, Optimizers, Regularization
30
Convolutional Neural Networks (CNNs) for Vision
31
Sequence Models: RNN/LSTM/GRU Basics
32
Transformers (Attention Basics)
Chapter 6
Model Evaluation, Tuning, and MLOps
33
Evaluation Metrics for Regression and Classification
34
Cross-Validation and Robust Model Selection
35
Hyperparameter Tuning (Grid/Random/Bayesian)
36
Calibration, Thresholding, and Decision Costs
37
Experiment Tracking and Reproducibility
38
Deployment Basics: Batch, Real-time, and Edge
39
Monitoring and Drift Detection (Data/Concept Drift)
Chapter 7
Applied ML: Projects, Domains, and Ethics
40
End-to-End Project: Tabular Prediction (Baseline → Production-ish)
3 subtopics
41
Build a clean training pipeline with scikit-learn Pipelines
42
Create a strong baseline and compare to boosting models
43
Package the model + preprocessing for consistent inference
44
Natural Language Processing (Applied)
3 subtopics
45
Text preprocessing and vectorization (TF-IDF, n-grams)
46
Fine-tune a transformer for text classification
47
Evaluate NLP systems (F1, calibration, error analysis)
48
Computer Vision (Applied)
3 subtopics
49
Image data pipelines and augmentation
50
Transfer learning with pretrained CNNs
51
Evaluate vision models (accuracy, confusion, robustness checks)
52
Time Series and Forecasting (Applied)
3 subtopics
53
Train/validation splits for temporal data (leakage-safe)
54
Classical forecasting baselines (ARIMA/ETS)
55
Feature-based ML for forecasting (lags, rolling stats)
56
Interpretability and Debugging Models
3 subtopics
57
Permutation importance and partial dependence (PDP/ICE)
58
SHAP basics for tabular models
59
Systematic error analysis (slices, counterfactual tests)
60
Fairness, Privacy, and Responsible ML
3 subtopics
61
Fairness metrics and trade-offs (group parity basics)
62
Privacy basics (PII handling, differential privacy intro)
63
Model cards, documentation, and risk assessment
64
Capstone: Portfolio, Write-ups, and Interviews
3 subtopics
65
Write project reports with problem framing, metrics, and lessons learned
66
Create a portfolio (GitHub repos, demos, reproducible notebooks)
67
Interview prep: ML system design + modeling questions