Outer Web | Explore

Explore >> Select a destination

You are here		neuralnetworksanddeeplearning.com Neural networks and deep learning
\|	\|	iclr-blogposts.github.io How to compute Hessian-vector products? \| ICLR Blogposts 2024	10.9 parsecs away Travel
\|	\|	The product between the Hessian of a function and a vector, the Hessian-vector product (HVP), is a fundamental quantity to study the variation of a function. It is ubiquitous in traditional optimization and machine learning. However, the computation of HVPs is often considered prohibitive in the context of deep learning, driving practitioners to use proxy quantities to evaluate the loss geometry. Standard automatic differentiation theory predicts that the computational complexity of an HVP is of the same order of magnitude as the complexity of computing a gradient. The goal of this blog post is to provide a practical counterpart to this theoretical result, showing that modern automatic differentiation frameworks, JAX and PyTorch, allow for efficient computation of these HVPs in standard deep learning cost functions.	10.9 parsecs away Travel
\|	\|	sriku.org The Calculus You Actually Need for Deep Learning \| Codaholic	12.8 parsecs away Travel
\|	\|		12.8 parsecs away Travel
\|	\|	michael-lewis.com A machine learning glossary for hackers · Michael I Lewis	11.4 parsecs away Travel
\|	\|	This is a short summary of some of the terminology used in machine learning, with an emphasis on neural networks. I've put it together primarily to help my own understanding, phrasing it largely in non-mathematical terms. As such it may be of use to others who come from more of a programming than a mathematical background.	11.4 parsecs away Travel
\|	\|	www.depthfirstlearning.com Variational Inference with Normalizing Flows · Depth First Learning	62.0 parsecs away Travel
\|			62.0 parsecs away Travel