Outer Web | Explore

Explore >> Select a destination

You are here		liorsinai.github.io Generative transformer from first principles in Julia - Lior Sinai
\|	\|	iclr-blogposts.github.io The N Implementation Details of RLHF with PPO \| ICLR Blogposts 2024	21.3 parsecs away Travel
\|	\|	Reinforcement Learning from Human Feedback (RLHF) is pivotal in the modern application of language modeling, as exemplified by ChatGPT. This blog post delves into an in-depth exploration of RLHF, attempting to reproduce the results from OpenAI's inaugural RLHF paper, published in 2019. Our detailed examination provides valuable insights into the implementation details of RLHF, which often go unnoticed.	21.3 parsecs away Travel
\|	\|	jaykmody.com GPT in 60 Lines of NumPy \| Jay Mody	11.2 parsecs away Travel
\|	\|	Implementing a GPT model from scratch in NumPy.	11.2 parsecs away Travel
\|	\|	comsci.blog Transformers Unfolded: A Layered Approach to Implementation \| ML and robotics notes	15.7 parsecs away Travel
\|	\|	In this tutorial, we will implement transformers step-by-step and understand their implementation. There are other great tutorials on the implementation of transformers, but they usually dive into the complex parts too early, like they directly start implementing additional parts like masks and multi-head attention, but it is not very intuitional without first building the core part of the transformers.	15.7 parsecs away Travel
\|	\|	blog.google Introducing Gemini: Google's most capable AI model yet	65.4 parsecs away Travel
\|		Gemini is our most capable and general model, built to be multimodal and optimized for three different sizes: Ultra, Pro and Nano.	65.4 parsecs away Travel