Outer Web | Explore

Explore >> Select a destination

You are here		davidmytton.blog Expect more overestimates of AI energy consumption
\|	\|	simonwillison.net Could you train a ChatGPT-beating model for $85,000 and run it in a browser?	8.4 parsecs away Travel
\|	\|	I think it's now possible to train a large language model with similar functionality to GPT-3 for $85,000. And I think we might soon be able to run the resulting ...	8.4 parsecs away Travel
\|	\|	deepmind.google An empirical analysis of compute-optimal large language model training - Google DeepMind	8.1 parsecs away Travel
\|	\|	We ask the question: "What is the optimal model size and number of training tokens for a given compute budget?" To answer this question, we train models of various sizes and with various numbers...	8.1 parsecs away Travel
\|	\|	lambda.ai Unleashing the power of Transformers with NVIDIA Transformer Engine	9.1 parsecs away Travel
\|	\|	Benchmarks on NVIDIA's Transformer Engine, which boosts FP8 performance by an impressive 60% on GPT3-style model testing on NVIDIA H100 Tensor Core GPUs.	9.1 parsecs away Travel
\|	\|	www.sbrebrown.com Inks \| Hey there!	11.2 parsecs away Travel
\|			11.2 parsecs away Travel