Building production-grade AI systems — from RAG pipelines and agentic workflows to local LLM tooling. I work at the intersection of Generative AI, NLP, and MLOps, with a focus on shipping things that actually run.
- Day job: AI Engineer, building LLM-powered products in production
- Current interests: Agentic architectures, RAG with real retrieval quality (deduplication, ranking, citations), and local LLM setups with quantized models
- How I work: I prefer running things locally first — LM Studio, quantized Llama/Granite models — before scaling to cloud APIs
- Community: I write on Medium about things I wish were better documented, and I deploy experiments to Hugging Face Spaces
| Category | Tools |
|---|---|
| Languages | |
| LLM Orchestration | |
| Models & Embeddings | |
| Backend & Data | |
| Cloud & Infra | |
| UI |
- Briefcast: How I Built a Personal AI Intelligence Agent That Reads the Entire AI Ecosystem — For ~$10/Month
- What's new with OpenAI's gpt-4o-mini
- Deciphering the power of Vision Language Models
- AgentForge: A simple AI Agent with Web Search and Image Generation
- ProGAN, StyleGAN, StyleGAN2: Exploring NVIDIA's breakthroughs
- Oracle Certified Generative AI Professional
- Google Cloud Professional Data Engineer



