Outer Web | Explore

Explore >> Select a destination

You are here		www.lesswrong.com Anthropic: Core Views on AI Safety: When, Why, What, and How - LessWrong
\|	\|	www.alignmentforum.org The Importance of AI Alignment, explained in 5 points - AI Alignment Forum	1.9 parsecs away Travel
\|	\|	This piece gives an overview of the alignment problem and makes the case for AI alignment research. It is crafted both to be broadly accessible to th...	1.9 parsecs away Travel
\|	\|	www.greaterwrong.com The Open Agency Model - LessWrong 2.0 viewer	2.1 parsecs away Travel
\|	\|	Eric DrexlerCentre for the Governance of AIUniversity of Oxford This document argues for "open agencies" - not opaque, unitary agents - as the appropriate model for applying future AI capabilities to consequential tasks that call for combining human guidance with delegation of planning and implementation to AI systems. This prospect reframes and can help to tame a wide range of classic AI safety challenges, leveraging alignment techniques in a relatively fault-tolerant context.	2.1 parsecs away Travel
\|	\|	www.alignmentforum.org Discovering Language Model Behaviors with Model-Written Evaluations - AI Alignment Forum	2.3 parsecs away Travel
\|	\|	"Discovering Language Model Behaviors with Model-Written Evaluations" is a new Anthropic paper by Ethan Perez et al. that I (Evan Hubinger) also coll...	2.3 parsecs away Travel
\|	\|	gilkalai.wordpress.com A Visit to the Israeli Quantum Computing Center (IQCC) \| Combinatorics and more	27.4 parsecs away Travel
\|		Two weeks ago I was invited together with my colleague Shay Mozes to visit the Israeli Quantum Computing Center located near the Tel Aviv University quite close to my home. That morning my wife told me not to be disappointed if I happened to see some quantum computers there :) , and I assured her...	27.4 parsecs away Travel