|
You are here |
sebastianraschka.com | ||
| | | | |
dennybritz.com
|
|
| | | | | All the code is also available as an Jupyter notebook on Github. | |
| | | | |
jxmo.io
|
|
| | | | | A primer on variational autoencoders (VAEs) culminating in a PyTorch implementation of a VAE with discrete latents. | |
| | | | |
neuralnetworksanddeeplearning.com
|
|
| | | | | [AI summary] The text provides an in-depth explanation of the backpropagation algorithm in neural networks. It starts by discussing the concept of how small changes in weights propagate through the network to affect the final cost, leading to the derivation of the partial derivatives required for gradient descent. The explanation includes a heuristic argument based on tracking the perturbation of weights through the network, resulting in a chain of partial derivatives. The text also touches on the historical context of how backpropagation was discovered, emphasizing the process of simplifying complex proofs and the role of using weighted inputs (z-values) as intermediate variables to streamline the derivation. Finally, it concludes with a citation and licens... | |
| | | | |
programmathically.com
|
|
| | | Sharing is caringTweetIn this post, we develop an understanding of why gradients can vanish or explode when training deep neural networks. Furthermore, we look at some strategies for avoiding exploding and vanishing gradients. The vanishing gradient problem describes a situation encountered in the training of neural networks where the gradients used to update the weights [] | ||