Explore >> Select a destination


You are here

blog.briankitano.com
| | comsci.blog
6.5 parsecs away

Travel
| | In this tutorial, we will implement transformers step-by-step and understand their implementation. There are other great tutorials on the implementation of transformers, but they usually dive into the complex parts too early, like they directly start implementing additional parts like masks and multi-head attention, but it is not very intuitional without first building the core part of the transformers.
| | nlp.seas.harvard.edu
6.3 parsecs away

Travel
| | The Annotated Transformer
| | www.nicktasios.nl
5.4 parsecs away

Travel
| | In the Latent Diffusion Series of blog posts, I'm going through all components needed to train a latent diffusion model to generate random digits from the MNIST dataset. In this first post, we will tr
| | neptune.ai
63.1 parsecs away

Travel
| Insights and strategies for selecting the best LLM and conducting efficient fine-tuning, even when resources are constrained.