ml-debug/docs/evidence/cs229_ml_advice.md at a602ea5a0e7bf30154b69ec4c9cd5b2b2a6d81e0

mirror of https://github.com/wassname/ml-debug.git synced 2026-06-27 18:24:28 +08:00

Files

T

wassname 4393cceefd initial: ML debugging folklore skill

Deep research to uplift LLMs for ML debugging, opinionated by source
selection. Distilled from Schulman, Jones, Rahtz, Goodfellow, CS231n,
FSDL, and more. Includes runnable diagnostic scripts and LLM-specific
anti-patterns.

Author: wassname (https://github.com/wassname)

2026-03-06 10:11:30 +08:00

13 KiB

Raw Blame History

Source: https://cs229.stanford.edu/materials/ML-advice.pdf Title: CS229 - Advice for Applying Machine Learning (Andrew Ng) Fetched-via: bash -c 'uvx "markitdown[pdf]" https://cs229.stanford.edu/materials/ML-advice.pdf' Fetch-status: verbatim

Advice for applying Machine Learning

Andrew Ng

Stanford University

Andrew Y. Ng

Today’s Lecture

• Advice on how getting learning algorithms to different applications.

• Most of today’s material is not very mathematical. But it’s also some of the

hardest material in this class to understand.

• Some of what I’ll say today is debatable.

• Some of what I’ll say is not good advice for doing novel machine learning

research.

• Key ideas:

Diagnostics for debugging learning algorithms.
Error analyses and ablative analysis.
How to get started on a machine learning problem.

– Premature (statistical) optimization.

Andrew Y. Ng

Debugging Learning Algorithms

Andrew Y. Ng

Debugging learning algorithms

Motivating example:

• Anti-spam. You carefully choose a small set of 100 words to use as

features. (Instead of using all 50000+ words in English.)

• Bayesian logistic regression, implemented with gradient descent, gets 20%

test error, which is unacceptably high.

• What to do next?

Andrew Y. Ng

Fixing the learning algorithm

• Bayesian logistic regression:

• Common approach: Try improving the algorithm in different ways.

– Try getting more training examples. – Try a smaller set of features. – Try a larger set of features. – Try changing the features: Email header vs. email body features. – Run gradient descent for more iterations. – Try Newton’s method. – Use a different value for λ. – Try using an SVM.

• This approach might work, but it’s very time-consuming, and largely a matter

of luck whether you end up fixing what the problem really is.

Andrew Y. Ng

Diagnostic for bias vs. variance

Better approach:

– Run diagnostics to figure out what the problem is. – Fix whatever the problem is.

Bayesian logistic regression’s test error is 20% (unacceptably high).

Suppose you suspect the problem is either:

– Overfitting (high variance). – Too few features to classify spam (high bias).

Diagnostic:

– Variance: Training error will be much lower than test error. – Bias: Training error will also be high.

Andrew Y. Ng

13 KiB Raw Blame History Unescape Escape

13 KiB

Raw Blame History