Explore >> Select a destination


You are here

windowsontheory.org
| | marcospereira.me
2.3 parsecs away

Travel
| | In this post we summarize the math behind deep learning and implement a simple network that achieves 85% accuracy classifying digits from the MNIST dataset.
| | theorydish.blog
1.8 parsecs away

Travel
| | The chain rule is a fundamental result in calculus. Roughly speaking, it states that if a variable $latex c$ is a differentiable function of intermediate variables $latex b_1,\ldots,b_n$, and each intermediate variable $latex b_i$ is itself a differentiable function of $latex a$, then we can compute the derivative $latex \frac{{\mathrm d} c}{{\mathrm d} a}$ as...
| | newvick.com
2.0 parsecs away

Travel
| | I've been working my way through Andrej Karpathy's 'spelled-out intro to backpropagation', and this post is my recap of how backpropagation works.
| | swethatanamala.github.io
14.6 parsecs away

Travel
| The authors developed a straightforward application of the Long Short-Term Memory (LSTM) architecture which can solve English to French translation.