|
You are here |
davidmytton.blog | ||
| | | | |
simonwillison.net
|
|
| | | | | I think it's now possible to train a large language model with similar functionality to GPT-3 for $85,000. And I think we might soon be able to run the resulting ... | |
| | | | |
deepmind.google
|
|
| | | | | We ask the question: "What is the optimal model size and number of training tokens for a given compute budget?" To answer this question, we train models of various sizes and with various numbers... | |
| | | | |
lambda.ai
|
|
| | | | | Benchmarks on NVIDIA's Transformer Engine, which boosts FP8 performance by an impressive 60% on GPT3-style model testing on NVIDIA H100 Tensor Core GPUs. | |
| | | | |
www.sbrebrown.com
|
|
| | | |||