I am doing AI Safety research, like prototyping interfaces, and passionate about meaningful human-AI collaboration
Autointerpretability agent
MCP, Mechanistic interpretability, Agentic workflows
GPT 2
Transformer architecture and optimizations, no PyTorch, comprehensive documentation
Attention-visualizer
GPT2 and Bert basic attention visualization + streamlit application


