|
You are here |
transformer-circuits.pub | ||
| | | | |
thesephist.com
|
|
| | | | | [AI summary] The text provides an in-depth overview of research on sparse autoencoders (SAEs) applied to embeddings for automated interpretability. It discusses methods for analyzing and manipulating embeddings, including feature extraction, gradient-based optimization, and visualization tools. The work emphasizes the importance of understanding model representations to improve human-computer interaction with information systems. Key components include: 1) Automated interpretability prompts for generating feature labels, 2) Feature gradients implementation for optimizing embeddings to match desired feature dictionaries, and 3) Visualizations of feature spaces and embedding transformations. The text also includes FAQs addressing the use of embeddings over lan... | |
| | | | |
cset.georgetown.edu
|
|
| | | | | Place to find CSET's publications, reports, and people | |
| | | | |
goodfire.ai
|
|
| | | | | Goodfire is an AI research company building practical interpretability tools for safe and reliable generative models. | |
| | | | |
www.onlandscape.co.uk
|
|
| | | [AI summary] The article discusses the privacy and cookie policies of the online magazine 'On Landscape', which focuses on landscape photography, and includes information about its registration and social media presence. | ||