|
You are here |
comsci.blog | ||
| | | | |
teddykoker.com
|
|
| | | | | This post is the first in a series of articles about natural language processing (NLP), a subfield of machine learning concerning the interaction between computers and human language. This article will be focused on attention, a mechanism that forms the backbone of many state-of-the art language models, including Googles BERT (Devlin et al., 2018), and OpenAIs GPT-2 (Radford et al., 2019). | |
| | | | |
sigmoidprime.com
|
|
| | | | | An exploration of Transformer-XL, a modified Transformer optimized for longer context length. | |
| | | | |
blog.briankitano.com
|
|
| | | | | Llama from scratch I want to provide some tips from my experience implementing a paper. I'm going to cover my tips so far from implementing a dramatically sc... | |
| | | | |
www.jeremymorgan.com
|
|
| | | Want to learn about PyTorch? Of course you do. This tutorial covers PyTorch basics, creating a simple neural network, and applying it to classify handwritten digits. | ||