Explore >> Select a destination


You are here

live-simons-blog.pantheonsite.io
| | francisbach.com
14.1 parsecs away

Travel
| |
| | francisbach.com
12.8 parsecs away

Travel
| |
| | mycqstate.wordpress.com
10.1 parsecs away

Travel
| | Last Spring I took part in the Simons Institute's semester on Quantum Hamiltonian Complexity. The semester was a great success, with an excellent batch of long-term participants and many fruitful interactions. The Institute asked me to write a short "Research Vignette" presenting, to a broad audience, an example scientific outcome of the programme. You can...
| | iclr-blogposts.github.io
97.8 parsecs away

Travel
| Reinforcement Learning from Human Feedback (RLHF) is pivotal in the modern application of language modeling, as exemplified by ChatGPT. This blog post delves into an in-depth exploration of RLHF, attempting to reproduce the results from OpenAI's inaugural RLHF paper, published in 2019. Our detailed examination provides valuable insights into the implementation details of RLHF, which often go unnoticed.