mirror of
https://github.com/wassname/ml-debug.git
synced 2026-06-27 17:49:08 +08:00
Update loss_surface.md
This commit is contained in:
committed by
GitHub
parent
6e9a3ca633
commit
1ad74e14c6
@@ -68,3 +68,5 @@ When your loss is a product of factors A*B and one factor can be near zero:
|
|||||||
```
|
```
|
||||||
|
|
||||||
General principle: if you want gradient to flow independently through two factors, decompose multiplicatively in log space.
|
General principle: if you want gradient to flow independently through two factors, decompose multiplicatively in log space.
|
||||||
|
|
||||||
|
You can also design surrogate losses that are better behaved but move in the right direction in a better behaved well.
|
||||||
|
|||||||
Reference in New Issue
Block a user