|
You are here |
vkrakovna.wordpress.com | ||
| | | | |
www.lesswrong.com
|
|
| | | | | Introduction You open the Alignment Forum one day, and a new post stares at you. By sheer luck you have some time, so you actually read it. And then... | |
| | | | |
joecarlsmith.com
|
|
| | | | | From a talk at Anthropic in April 2025. | |
| | | | |
www.greaterwrong.com
|
|
| | | | | Find all Alignment Newsletter resources here. In particular, you can sign up, or look through this spreadsheet of all summaries that have ever been in the newsletter. I'm always happy to hear feedback; you can send it to me by replying to this email. Audio version here (may not be up yet). Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model (Julian Schrittwieser et al) (summarized by Nicholas): Up until now, model-free RL approaches have been state of the art at visually rich domains such as Atari, while model-based RL has excelled for games which require planning many steps ahead, such as Go, chess, and shogi. This paper attains state of the art performance on Atari using a model-based approach, MuZero, while matching AlphaZero (AN #36) at... | |
| | | | |
www.lesswrong.com
|
|
| | | I sometimes hear people asking: "What is the plan for avoiding a catastrophe from misaligned AI?" ... | ||