neuronpedia

Here are 2 public repositories matching this topic...

peppinob-ol / attribution-graph-probing

Automates attribution-graph analysis via probe prompting: circuit-trace a prompt, auto-generate concept probes, profile feature activations, cluster supernodes.

graph-analysis sparse-autoencoders mechanistic-interpretability llm-interpretability research-tooling circuit-tracing attribution-graphs probe-prompting prompt-probing neuronpedia feature-activation supernodes cross-layer-transcoder

Updated Apr 24, 2026
Python

nulone / sae-consciousness-steering-pitfalls

Star

Reproducible case study of pitfalls in contrastive SAE discovery and steering for "consciousness" features (GemmaScope SAEs, Gemma 3 4B/12B): reconstruction confound, delta-steering fix, matched controls, and false-positive scaling law vs dataset size.

gemma sae sparse-autoencoder contrastive-learning mechanistic-interpretability feature-steering neuronpedia null-result gemmascope delta-steering

Updated Feb 26, 2026
Python

Improve this page

Add a description, image, and links to the neuronpedia topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the neuronpedia topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly