|
You are here |
www.superannotate.com | ||
| | | | |
neptune.ai
|
|
| | | | | Reinforcement learning from human feedback has turned out to be the key to unlocking the full potential of today's LLMs. | |
| | | | |
www.v7labs.com
|
|
| | | | | Here's the list of the most prominent applications of Reinforcement Learning shaping the future of Artificial Intelligence. | |
| | | | |
www.lesswrong.com
|
|
| | | | | "Discovering Language Model Behaviors with Model-Written Evaluations" is a new Anthropic paper by Ethan Perez et al. that I (Evan Hubinger) also coll... | |
| | | | |
www.greaterwrong.com
|
|
| | | Eric DrexlerCentre for the Governance of AIUniversity of Oxford This document argues for "open agencies" - not opaque, unitary agents - as the appropriate model for applying future AI capabilities to consequential tasks that call for combining human guidance with delegation of planning and implementation to AI systems. This prospect reframes and can help to tame a wide range of classic AI safety challenges, leveraging alignment techniques in a relatively fault-tolerant context. | ||