Outer Web | Explore

Explore >> Select a destination

You are here		sigmoidprime.com Transformer-XL: A Memory-Augmented Transformer
\|	\|	teddykoker.com NLP from Scratch: Annotated Attention \| Teddy Koker	9.3 parsecs away Travel
\|	\|	This post is the first in a series of articles about natural language processing (NLP), a subfield of machine learning concerning the interaction between computers and human language. This article will be focused on attention, a mechanism that forms the backbone of many state-of-the art language models, including Googles BERT (Devlin et al., 2018), and OpenAIs GPT-2 (Radford et al., 2019).	9.3 parsecs away Travel
\|	\|	peterbloem.nl Transformers from scratch \| peterbloem.nl	9.4 parsecs away Travel
\|	\|		9.4 parsecs away Travel
\|	\|	blog.briankitano.com Llama from scratch (or how to implement a paper without crying) \| Brian Kitano	6.4 parsecs away Travel
\|	\|	Llama from scratch I want to provide some tips from my experience implementing a paper. I'm going to cover my tips so far from implementing a dramatically sc...	6.4 parsecs away Travel
\|	\|	www.danieldjohnson.com Composing Music With Recurrent Neural Networks \| Daniel D. Johnson	56.7 parsecs away Travel
\|		Writeup for my first major machine learning project.	56.7 parsecs away Travel