|
You are here |
dynomight.net | ||
| | | | |
windowsontheory.org
|
|
| | | | | [Yet another "philosophizing" post, but one with some actual numbers. See also this follow up. --Boaz] Recently there have been many debates on "artificial general intelligence" (AGI) and whether or not we are close to achieving it by scaling up our current AI systems. In this post, I'd like to make this debate a bit... | |
| | | | |
www.alignmentforum.org
|
|
| | | | | On March 29th, DeepMind published a paper, "Training Compute-Optimal Large Language Models", that shows that essentially everyone -- OpenAI, DeepMind... | |
| | | | |
deepmind.google
|
|
| | | | | We ask the question: "What is the optimal model size and number of training tokens for a given compute budget?" To answer this question, we train models of various sizes and with various numbers... | |
| | | | |
trishagee.com
|
|
| | | Find out where to catch Trisha Gee in the autumn of 2024 | ||