Explore >> Select a destination


You are here

minish.ai
| | www.mosaicml.com
5.2 parsecs away

Travel
| | Introducing MPT-30B, a new, more powerful member of our Foundation Series of open-source models, trained with an 8k context length on NVIDIA H100 Tensor Core GPUs.
| | deepmind.google
4.5 parsecs away

Travel
| | We ask the question: "What is the optimal model size and number of training tokens for a given compute budget?" To answer this question, we train models of various sizes and with various numbers...
| | bdtechtalks.com
5.0 parsecs away

Travel
| | The transformer model has become one of the main highlights of advances in deep learning and deep neural networks.
| | blogditifet.com
10.7 parsecs away

Travel
|