Explore >> Select a destination


You are here

sebastianraschka.com
| | dennybritz.com
2.3 parsecs away

Travel
| | All the code is also available as an Jupyter notebook on Github.
| | jxmo.io
1.8 parsecs away

Travel
| | A primer on variational autoencoders (VAEs) culminating in a PyTorch implementation of a VAE with discrete latents.
| | neuralnetworksanddeeplearning.com
2.9 parsecs away

Travel
| | [AI summary] The text provides an in-depth explanation of the backpropagation algorithm in neural networks. It starts by discussing the concept of how small changes in weights propagate through the network to affect the final cost, leading to the derivation of the partial derivatives required for gradient descent. The explanation includes a heuristic argument based on tracking the perturbation of weights through the network, resulting in a chain of partial derivatives. The text also touches on the historical context of how backpropagation was discovered, emphasizing the process of simplifying complex proofs and the role of using weighted inputs (z-values) as intermediate variables to streamline the derivation. Finally, it concludes with a citation and licens...
| | programmathically.com
11.8 parsecs away

Travel
| Sharing is caringTweetIn this post, we develop an understanding of why gradients can vanish or explode when training deep neural networks. Furthermore, we look at some strategies for avoiding exploding and vanishing gradients. The vanishing gradient problem describes a situation encountered in the training of neural networks where the gradients used to update the weights []