Siddharth Surange SID-SURANGE

Siddharth Surange · AI Engineer

Building production-grade AI systems — from RAG pipelines and agentic workflows to local LLM tooling. I work at the intersection of Generative AI, NLP, and MLOps, with a focus on shipping things that actually run.

About Me

Day job: AI Engineer, building LLM-powered products in production
Current interests: Agentic architectures, RAG with real retrieval quality (deduplication, ranking, citations), and local LLM setups with quantized models
How I work: I prefer running things locally first — LM Studio, quantized Llama/Granite models — before scaling to cloud APIs
Community: I write on Medium about things I wish were better documented, and I deploy experiments to Hugging Face Spaces

🛠️ Tech Stack

Category	Tools
Languages
LLM Orchestration
Models & Embeddings
Backend & Data
Cloud & Infra
UI

🌟 Featured Projects

Briefcast

Automated AI research briefing agent — monitors Google AI, DeepMind, OpenAI, Anthropic, arXiv and more, then delivers a curated daily digest to Telegram with follow-up Q&A over a 14-day rolling knowledge base.

Stack: FastAPI · PostgreSQL + pgvector · LangChain LCEL · OpenRouter (Gemini, Claude) · APScheduler · Railway

Highlights: dual-layer deduplication (SHA-256 + cosine similarity), tiered source ranking, RAG answers with citations, ~$8/month to run in production

ResumeParser

HR-focused resume analysis tool that runs entirely on local LLMs — no API keys required. Extracts structured data from PDFs, flags missing sections, generates tailored interview questions, and visualizes resume content.

Stack: FastAPI · Gradio · PyTorch · IBM Docling · LM Studio (quantized Llama 3.1/3.2, IBM Granite)

Highlights: fully offline, 8-bit quantized model support, spell-check analysis, word cloud generation

PageSense · AgentForge · AI-Sandbox

Monorepo of production-grade experiments. PageSense: Chrome extension + FastAPI/Qdrant backend for semantic search over your browsing history. AgentForge: deployed agentic app with web search and image generation.

Stack: FastAPI · Qdrant · Gradio · smolagents · LlamaIndex · JavaScript (extension)

✍️ Writing

🏆 Certifications

Oracle Certified Generative AI Professional
Google Cloud Professional Data Engineer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Siddharth Surange SID-SURANGE

Achievements