You are here |
rohan.ga | ||
| | | |
haifengl.wordpress.com
|
|
| | | | Generative artificial intelligence (GenAI), especially ChatGPT, captures everyone's attention. The transformerbased large language models (LLMs), trained on a vast quantity of unlabeled data at scale, demonstrate the ability to generalize to many different tasks. To understand why LLMs are so powerful, we will deep dive into how they work in this post. LLM Evolutionary Tree... | |
| | | |
swethatanamala.github.io
|
|
| | | | The authors developed a straightforward application of the Long Short-Term Memory (LSTM) architecture which can solve English to French translation. | |
| | | |
d2l.ai
|
|
| | | | ||
| | | |
marcospereira.me
|
|
| | In this post we summarize the math behind deep learning and implement a simple network that achieves 85% accuracy classifying digits from the MNIST dataset. |