You are here |
distill.pub | ||
| | | |
blog.otoro.net
|
|
| | | | We survey ideas from complex systems such as swarm intelligence, self-organization, and emergent behavior that are gaining traction in ML. (Figure: Emergence... | |
| | | |
louiskirsch.com
|
|
| | | | Biological evolution has distilled the experiences of many learners into the general learning algorithms of humans. Inspired by this process, MetaGenRL distills the experiences of many complex agents to meta-learn a low-complexity neural objective function that affects how future individuals will learn. Unlike recent meta-RL algorithms, MetaGenRL can generalize to new environments that are entirely different from those used for meta-training. In some cases, it even outperforms human-engineered RL algorit... | |
| | | |
machinethoughts.wordpress.com
|
|
| | | | While teaching reinforcement learning I kept asking myself what AlphaZero teaches us about RL. That question has lead to this post. This post generalizes AlphaZero to a larger class of RL algorithms by reinterpreting AlphaZero's policy network as a belief function --- as the probability that is the optimal action at state . This gives... | |
| | | |
www.markrjohnsongames.com
|
|
| | Welcome to 2022's edition of "Some Games I Played in [Year] and What I Thought Of Them"! This year I've played relatively few games compared to recent years, but that'... |