Outer Web | Explore

Explore >> Select a destination

You are here		www.superannotate.com Reinforcement learning with human feedback (RLHF) for LLMs \| SuperAnnotate
\|	\|	amatria.in Beyond Token Prediction: the post-Pretraining journey of modern LLMs - AI, software, tech, and people. Not in that order. By X	2.3 parsecs away Travel
\|	\|	(This blog post, as most of my recent ones, is written with GPT-4 assistance and augmentation)	2.3 parsecs away Travel
\|	\|	www.lesswrong.com Discovering Language Model Behaviors with Model-Written Evaluations - LessWrong	5.5 parsecs away Travel
\|	\|	"Discovering Language Model Behaviors with Model-Written Evaluations" is a new Anthropic paper by Ethan Perez et al. that I (Evan Hubinger) also coll...	5.5 parsecs away Travel
\|	\|	www.index.dev Understand all the LLM Models in this Guide	3.5 parsecs away Travel
\|	\|	Learn all about Large Language Models (LLMs) in our comprehensive guide. Understand their capabilities, applications, and impact on various industries.	3.5 parsecs away Travel
\|	\|	bdtechtalks.com Machine learning: What is the transformer architecture? - TechTalks	14.3 parsecs away Travel
\|		The transformer model has become one of the main highlights of advances in deep learning and deep neural networks.	14.3 parsecs away Travel