|
You are here |
swethatanamala.github.io | ||
| | | | |
jalammar.github.io
|
|
| | | | | Translations: Chinese (Simplified), French, Japanese, Korean, Persian, Russian, Turkish, Uzbek Watch: MIT's Deep Learning State of the Art lecture referencing this post May 25th update: New graphics (RNN animation, word embedding graph), color coding, elaborated on the final attention example. Note: The animations below are videos. Touch or hover on them (if you're using a mouse) to get play controls so you can pause if needed. Sequence-to-sequence models are deep learning models that have achieved a lot of success in tasks like machine translation, text summarization, and image captioning. Google Translate started using such a model in production in late 2016. These models are explained in the two pioneering papers (Sutskever et al., 2014, Cho et al., 2014)... | |
| | | | |
bdtechtalks.com
|
|
| | | | | The transformer model has become one of the main highlights of advances in deep learning and deep neural networks. | |
| | | | |
dennybritz.com
|
|
| | | | | Recurrent Neural Networks (RNNs) are popular models that have shown great promise in manyNLP tasks. | |
| | | | |
datadan.io
|
|
| | | Linear regression and gradient descent are techniques that form the basis of many other, more complicated, ML/AI techniques (e.g., deep learning models). They are, thus, building blocks that all ML/AI engineers need to understand. | ||