Coin

Chain of Interpretability

First we adapt the original MAIA implementation from the paper - https://arxiv.org/abs/2404.14394, to operate it with Google Gemini. Second we notice that some neurons can be polysemantic. Therefore, we train sparse autoencoders (SAE) to resolve the polysemanticity into monosemantic neurons.

More information on the implementation can be found inside the maia_gemini folder, copied over from the original codebase - https://github.com/multimodal-interpretability/maia.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
maia_gemini		maia_gemini
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Coin

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

nitishmital/AutoInterpret

Folders and files

Latest commit

History

Repository files navigation

Coin

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages