Explore >> Select a destination


You are here

davidmytton.blog
| | simonwillison.net
8.4 parsecs away

Travel
| | I think it's now possible to train a large language model with similar functionality to GPT-3 for $85,000. And I think we might soon be able to run the resulting ...
| | deepmind.google
8.1 parsecs away

Travel
| | We ask the question: "What is the optimal model size and number of training tokens for a given compute budget?" To answer this question, we train models of various sizes and with various numbers...
| | lambda.ai
9.1 parsecs away

Travel
| | Benchmarks on NVIDIA's Transformer Engine, which boosts FP8 performance by an impressive 60% on GPT3-style model testing on NVIDIA H100 Tensor Core GPUs.
| | www.sbrebrown.com
11.2 parsecs away

Travel
|