|
You are here |
teddykoker.com | ||
| | | | |
sigmoidprime.com
|
|
| | | | | An exploration of Transformer-XL, a modified Transformer optimized for longer context length. | |
| | | | |
vxlabs.com
|
|
| | | | | I have recently become fascinated with (Variational) Autoencoders and with PyTorch. Kevin Frans has a beautiful blog post online explaining variational autoencoders, with examples in TensorFlow and, importantly, with cat pictures. Jaan Altosaar's blog post takes an even deeper look at VAEs from both the deep learning perspective and the perspective of graphical models. Both of these posts, as well as Diederik Kingma's original 2014 paper Auto-Encoding Variational Bayes, are more than worth your time. | |
| | | | |
harvardnlp.github.io
|
|
| | | | | [AI summary] The provided code is a comprehensive implementation of the Transformer model, including data loading, model architecture, training, and visualization. It also includes functions for decoding and visualizing attention mechanisms across different layers of the model. The code is structured to support both training and inference, with examples provided for running the model and visualizing attention patterns. | |
| | | | |
www.jerpint.io
|
|
| | | A collection of anything and everything. | ||