|
You are here |
www.greaterwrong.com | ||
| | | | |
scottaaronson.blog
|
|
| | | | | Two weeks ago, I gave a lecture setting out my current thoughts on AI safety, halfway through my year at OpenAI. I was asked to speak by UT Austin's Effective Altruist club. You can watch the lecture on YouTube here (I recommend 2x speed). The timing turned out to be weird, coming immediately after the... | |
| | | | |
www.exxactcorp.com
|
|
| | | | | [AI summary] The text provides an in-depth overview of Deep Reinforcement Learning (DRL), focusing on its key components, challenges, and applications. It explains how DRL combines reinforcement learning (RL) with deep learning to handle complex decision-making tasks. The article discusses the limitations of traditional Q-learning, such as the need for a Q-table and the issue of unstable target values. It introduces Deep Q-Networks (DQNs) as a solution, highlighting the use of experience replay and target networks to stabilize training. Additionally, the text highlights real-world applications like AlphaGo, Atari game playing, and oil and gas industry use cases. It concludes by emphasizing DRL's potential for scalable, human-compatible AI systems and its rol... | |
| | | | |
www.lesswrong.com
|
|
| | | | | A collection of shorter posts by LessWrong user paulfchristiano | |
| | | | |
www.robinwaite.com
|
|
| | | AI tutors personalise learning and support teachers, but the emotional connection and creativity of human educators remain irreplaceable. | ||