Explore >> Select a destination


You are here

harvardnlp.github.io
| | teddykoker.com
1.3 parsecs away

Travel
| | This post is the first in a series of articles about natural language processing (NLP), a subfield of machine learning concerning the interaction between computers and human language. This article will be focused on attention, a mechanism that forms the backbone of many state-of-the art language models, including Googles BERT (Devlin et al., 2018), and OpenAIs GPT-2 (Radford et al., 2019).
| | nlp.seas.harvard.edu
0.0 parsecs away

Travel
| | The Annotated Transformer
| | sigmoidprime.com
2.0 parsecs away

Travel
| | An exploration of Transformer-XL, a modified Transformer optimized for longer context length.
| | blog.fastforwardlabs.com
13.6 parsecs away

Travel
| This article is available as a notebook on Github. Please refer to that notebook for a more detailed discussion and code fixes and updates. Despite all the recent excitement around deep learning, neural networks have a reputation among non-specialists as complicated to build and difficult to interpret. And while interpretability remains an issue, there are now high-level neural network libraries that enable developers to quickly build neural network models without worrying about the numerical details of floating point operations and linear algebra.