| 
	     You are here  | 
        louiskirsch.com | ||
| | | | | 
            
              niklasriewald.com
             | 
        |
| | | | | The last post was about playing optimal Pokemon games by calculating the value function V(x) using Bellmann equations. Unfortunately, this was impractical, so we need another way to calculate the likelihood of winning games. One solution to this problem is called Value Function Approximation. The idea behind it is to not calculate the value function... | |
| | | | | 
            
              pytorch.org
             | 
        |
| | | | | In this blogpost we describe the recently proposed Stochastic Weight Averaging (SWA) technique [1, 2], and its new implementation in torchcontrib. SWA is a simple procedure that improves generalization in deep learning over Stochastic Gradient Descent (SGD) at no additional cost, and can be used as a drop-in replacement for any other optimizer in PyTorch. SWA has a wide range of applications and features: | |
| | | | | 
            
              www.assemblyai.com
             | 
        |
| | | | | In this video, we learn about Reinforcement Learning and (Deep) Q-Learning. | |
| | | | | 
            
              matbesancon.xyz
             | 
        |
| | | Learning by doing: predicting the outcome. | ||