Outer Web | Explore

Explore >> Select a destination

You are here		vkrakovna.wordpress.com Paradigms of AI alignment: components and enablers \| Victoria Krakovna
\|	\|	www.lesswrong.com Epistemological Framing for AI Alignment Research - LessWrong	2.1 parsecs away Travel
\|	\|	Introduction You open the Alignment Forum one day, and a new post stares at you. By sheer luck you have some time, so you actually read it. And then...	2.1 parsecs away Travel
\|	\|	joecarlsmith.com Video and transcript of talk on automating alignment research - Joe Carlsmith	4.4 parsecs away Travel
\|	\|	From a talk at Anthropic in April 2025.	4.4 parsecs away Travel
\|	\|	www.greaterwrong.com [AN #75]: Solving Atari and Go with learned game models, and thoughts from a MIRI employee - LessWrong 2.0 viewer	3.9 parsecs away Travel
\|	\|	Find all Alignment Newsletter resources here. In particular, you can sign up, or look through this spreadsheet of all summaries that have ever been in the newsletter. I'm always happy to hear feedback; you can send it to me by replying to this email. Audio version here (may not be up yet). Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model (Julian Schrittwieser et al) (summarized by Nicholas): Up until now, model-free RL approaches have been state of the art at visually rich domains such as Atari, while model-based RL has excelled for games which require planning many steps ahead, such as Go, chess, and shogi. This paper attains state of the art performance on Atari using a model-based approach, MuZero, while matching AlphaZero (AN #36) at...	3.9 parsecs away Travel
\|	\|	www.lesswrong.com A Playbook for AI Risk Reduction (focused on misaligned AI) - LessWrong	30.8 parsecs away Travel
\|		I sometimes hear people asking: "What is the plan for avoiding a catastrophe from misaligned AI?" ...	30.8 parsecs away Travel