|
You are here |
www.lesswrong.com | ||
| | | | |
joecarlsmith.com
|
|
| | | | | From a talk at UT Austin in September 2025. | |
| | | | |
www.alignmentforum.org
|
|
| | | | | Introduction Imagine you are tasked with curing a disease which hasn't appeared yet. Setting aside why you would know about such a disease's emergen... | |
| | | | |
joecarlsmith.com
|
|
| | | | | Let's be the sort of species that aliens wouldn't fear the way we fear paperclip maximizers. | |
| | | | |
www.alignmentforum.org
|
|
| | | Executive Summary * The Google DeepMind mechanistic interpretability team has made a strategic pivot over the past year, from ambitious reverse-engi... | ||