Explore >> Select a destination


You are here

thezvi.wordpress.com
| | scottaaronson.blog
3.8 parsecs away

Travel
| | Two weeks ago, I gave a lecture setting out my current thoughts on AI safety, halfway through my year at OpenAI. I was asked to speak by UT Austin's Effective Altruist club. You can watch the lecture on YouTube here (I recommend 2x speed). The timing turned out to be weird, coming immediately after the...
| | www.theverge.com
3.7 parsecs away

Travel
| | The head of ChatGPT at OpenAI on AI attachment, ads, and whata's next for chatbots.
| | www.lesswrong.com
4.1 parsecs away

Travel
| | Comment by Ethan Perez - Evan and others on my team are working on non-mechanistic-interpretability directions primarily motivated by inner alignment: 1. Developing model organisms for deceptive inner alignment, which we may use to study the risk factors for deceptive alignment 2. Conditioning predictive models as an alternative to training agents. Predictive models may pose fewer inner alignment risks, for reasons discussed here 3. Studying the extent to which models exhibit likely pre-requisites to deceptive inner alignment, such as situational awareness (a very preliminary exploration is in Sec. 5 in our paper on model-written evaluations) 4. Investigating the extent to which externalized reasoning (e.g. chain of thought) is a way to gain transparency int...
| | www.heady.io
26.5 parsecs away

Travel
| Heady's Product Management team expertly guides you through the entire digital product lifecycle using agile processes and experienced product managers.