|
You are here |
www.alignmentforum.org | ||
| | | | |
blog.moonglow.ai
|
|
| | | | | Parameters and data. These are the two ingredients of training ML models. The total amount of computation ("compute") you need to do to train a model is proportional to the number of parameters multiplied by the amount of data (measured in "tokens"). Four years ago, it was well-known that if | |
| | | | |
www.lesswrong.com
|
|
| | | | | Our alignment research aims to make artificial general intelligence (AGI) aligned with human values and follow human intent. We take an iterative, em... | |
| | | | |
www.lesswrong.com
|
|
| | | | | On March 29th, DeepMind published a paper, "Training Compute-Optimal Large Language Models", that shows that essentially everyone -- OpenAI, DeepMind... | |
| | | | |
simonwillison.net
|
|
| | | A month ago I asked Could you train a ChatGPT-beating model for $85,000 and run it in a browser?. $85,000 was a hypothetical training cost for LLaMA 7B plus Stanford ... | ||