Outer Web | Explore

Explore >> Select a destination

You are here		jingnanshi.com Automatic Differentiation: Forward and Reverse
\|	\|	comsci.blog Understanding and Building Neural Networks: A Step-by-Step Guide from Scratch \| ML and robotics notes	10.4 parsecs away Travel
\|	\|	In this tutorial, we will learn two different methods to implement neural networks from scratch using Python: Extremely simple method: Finite difference Still a very simple method: Backpropagation	10.4 parsecs away Travel
\|	\|	matbesancon.xyz Differentiating the discrete: Automatic Differentiation meets Integer Optimization \| ??	9.4 parsecs away Travel
\|	\|	What can automated gradient computations bring to mathematical optimizers, what does it take to compute?	9.4 parsecs away Travel
\|	\|	iclr-blogposts.github.io How to compute Hessian-vector products? \| ICLR Blogposts 2024	8.0 parsecs away Travel
\|	\|	The product between the Hessian of a function and a vector, the Hessian-vector product (HVP), is a fundamental quantity to study the variation of a function. It is ubiquitous in traditional optimization and machine learning. However, the computation of HVPs is often considered prohibitive in the context of deep learning, driving practitioners to use proxy quantities to evaluate the loss geometry. Standard automatic differentiation theory predicts that the computational complexity of an HVP is of the same order of magnitude as the complexity of computing a gradient. The goal of this blog post is to provide a practical counterpart to this theoretical result, showing that modern automatic differentiation frameworks, JAX and PyTorch, allow for efficient computation of these HVPs in standard deep learning cost functions.	8.0 parsecs away Travel
\|	\|	dennybritz.com Recurrent Neural Networks Tutorial, Part 3 - Backpropagation Through Time and Vanishing Gradients · Denny's Blog	42.2 parsecs away Travel
\|		This the thirdpart of the Recurrent Neural Network Tutorial.	42.2 parsecs away Travel