Sparse Autoencoder

Formulation of Feature Circuits with Sparse Autoencoders in LLM
Large Language Models

Large Language models (LLMs) have witnessed impressive progress and these large models can do a…

Shuyang

February 19, 2025

6 min read
Open the Artificial Brain: Sparse Autoencoders for LLM Inspection
Artificial Intelligence

A deep dive into LLM visualization and interpretation using sparse autoencoders

Salvatore Raieli

November 16, 2024

15 min read
Towards Monosemanticity: A Step Towards Understanding Large Language Models
Machine Learning

Understanding the mechanistic interpretability research problem and reverse-engineering these large language models

Anish Dubey

July 11, 2024

12 min read