Outer Web | Explore

Explore >> Select a destination

You are here		distill.pub The Paths Perspective on Value Learning
\|	\|	blog.otoro.net Collective Intelligence for Deep Learning: A Survey of Recent Developments \| ???	18.1 parsecs away Travel
\|	\|	We survey ideas from complex systems such as swarm intelligence, self-organization, and emergent behavior that are gaining traction in ML. (Figure: Emergence...	18.1 parsecs away Travel
\|	\|	louiskirsch.com MetaGenRL: Improving Generalization in Meta Reinforcement Learning \| Louis Kirsch	23.1 parsecs away Travel
\|	\|	Biological evolution has distilled the experiences of many learners into the general learning algorithms of humans. Inspired by this process, MetaGenRL distills the experiences of many complex agents to meta-learn a low-complexity neural objective function that affects how future individuals will learn. Unlike recent meta-RL algorithms, MetaGenRL can generalize to new environments that are entirely different from those used for meta-training. In some cases, it even outperforms human-engineered RL algorit...	23.1 parsecs away Travel
\|	\|	machinethoughts.wordpress.com Reinterpreting AlphaZero \| Machine Thoughts	25.8 parsecs away Travel
\|	\|	While teaching reinforcement learning I kept asking myself what AlphaZero teaches us about RL. That question has lead to this post. This post generalizes AlphaZero to a larger class of RL algorithms by reinterpreting AlphaZero's policy network as a belief function --- as the probability that is the optimal action at state . This gives...	25.8 parsecs away Travel
\|	\|	www.markrjohnsongames.com Some Games I Played in 2022 and What I Thought Of Them \| Dr Mark R Johnson	166.2 parsecs away Travel
\|		Welcome to 2022's edition of "Some Games I Played in [Year] and What I Thought Of Them"! This year I've played relatively few games compared to recent years, but that'...	166.2 parsecs away Travel