|
You are here |
windowsontheory.org | ||
| | | | |
marcospereira.me
|
|
| | | | | In this post we summarize the math behind deep learning and implement a simple network that achieves 85% accuracy classifying digits from the MNIST dataset. | |
| | | | |
theorydish.blog
|
|
| | | | | The chain rule is a fundamental result in calculus. Roughly speaking, it states that if a variable $latex c$ is a differentiable function of intermediate variables $latex b_1,\ldots,b_n$, and each intermediate variable $latex b_i$ is itself a differentiable function of $latex a$, then we can compute the derivative $latex \frac{{\mathrm d} c}{{\mathrm d} a}$ as... | |
| | | | |
newvick.com
|
|
| | | | | I've been working my way through Andrej Karpathy's 'spelled-out intro to backpropagation', and this post is my recap of how backpropagation works. | |
| | | | |
swethatanamala.github.io
|
|
| | | The authors developed a straightforward application of the Long Short-Term Memory (LSTM) architecture which can solve English to French translation. | ||