Outer Web | Explore

/explore

Click through on any links that interest you or select the planets on the right to continue exploring the Outer Web.

You are here		www.lesswrong.com On "first critical tries" in AI alignment - LessWrong
\|	\|	blog.redwoodresearch.org Notes on cooperating with unaligned AIs	4.7 parsecs away Travel
\|	\|	More thoughts on making deals with schemers	4.7 parsecs away Travel
\|	\|	www.greaterwrong.com What is it to solve the alignment problem? (Notes) - LessWrong 2.0 viewer	2.3 parsecs away Travel
\|	\|	(I originally wrote this post as some rough notes on defining the alignment problem, with the intention of turning them into something more polished later. I've now started doing that, as part of a broader series introduced here. In particular, the first post in that series covers some of the same ground as section 1 of this post. It also has the same title. And some of essays in the series will draw on these notes as well.) People often talk about "solving the alignment problem." But what is it to do such a thing? I wanted to clarify my thinking about this topic, so I wrote up some notes.	2.3 parsecs away Travel
\|	\|	www.alignmentforum.org AGI safety from first principles: Introduction - AI Alignment Forum	2.1 parsecs away Travel
\|	\|	Richard Ngo lays out the core argument for why AGI could be an existential threat: we might build AIs that are much smarter than humans, that act aut...	2.1 parsecs away Travel
\|	\|	www.index.dev Understand all the LLM Models in this Guide	30.1 parsecs away Travel
\|		Learn all about Large Language Models (LLMs) in our comprehensive guide. Understand their capabilities, applications, and impact on various industries.	30.1 parsecs away Travel