|
You are here |
www.lesswrong.com | ||
| | | | |
joecarlsmith.com
|
|
| | | | | It's really important; we have a real shot; there are a lot of ways we can fail. | |
| | | | |
www.alignmentforum.org
|
|
| | | | | Evan et al argue for developing "model organisms of misalignment" - AI systems deliberately designed to exhibit concerning behaviors like deception o... | |
| | | | |
www.greaterwrong.com
|
|
| | | | | TL;DR:Strong problem-solving systems can be built from AI systems that play diverse roles, LLMs can readily play diverse roles in role architectures, and AI systems based on role architectures can be practical, safe, and effective in undertaking complex and consequential tasks. This article explores the practicalities and challenges of aligning large language models (LLMs[1]) to play central roles in performing tasks safely and effectively. It highlights the potential value of Open Agency and related role architectures in aligning AI for general applications while mitigating risks. | |
| | | | |
www.lesswrong.com
|
|
| | | The most hyped event of the week, by far, was the Manus Marketing Madness. Manus wasn't entirely hype, but there was very little there there in that... | ||