Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

These two papers are not necessarily contradicting each other, but perhaps my description was a bit sloppy.

Sagun et al. (and derivative works) only focus on the Hessian on the trajectory followed by gradient descent, while Li et al. give a broader look at the loss surface as a whole.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: