Skip to content
View ashutosh-utsav's full-sized avatar
🐒
🐒

Block or report ashutosh-utsav

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ashutosh-utsav/README.md

πŸ‘‹ Hi, I'm Ashutosh Utsav

Speech AI / Machine Learning Engineer focused on building real-world voice and conversational systems.

I work on end-to-end pipelines β€” from speech recognition and voice interfaces to scalable backend infrastructure powering AI applications.

Building voice-first systems for real-world environments where typing is not the interface.


βš™οΈ What I Work On

  • Speech-to-Text systems (ASR, Whisper fine-tuning, VAD)
  • Voice-first applications for multilingual users
  • Real-time AI pipelines (streaming, async workers, queues)
  • LLM-powered systems (RAG, agents, conversational workflows)
  • Scalable backend systems (FastAPI, microservices, cloud deployment)

πŸ› οΈ Tech Stack

Languages

Technologies / Frameworks

ML / LLM

Deployment


πŸ“Œ Highlights

  • Improved ASR performance for medical speech (WER ↓ significantly)
  • Built production-ready async pipelines handling real-time audio workloads
  • Experience deploying AI systems used in real environments

πŸ† Achievements

  • πŸ₯‡ Winner β€” National-level BharatGen AI Hackathon for developing JanVaani, an AI voice bot that automates grievance form submission
  • πŸ“Š Top 3 β€” Machine Learning Kaggle Competition
  • πŸ₯ˆ Runner-up β€” Hacksprint Hackathon for developing a sustainability-focused web application

🎭 Beyond Code

  • Hosted, directed, and wrote a mime play at a college cultural event attended by 2,000+ members

πŸ“« Connect

Pinned Loading

  1. Ambient-Listning Ambient-Listning Public

    Real-time clinical ambient listening system that transcribes and summarizes live audio into structured medical formats. Built with FastAPI and WebSockets, using Azure Queues for stream processing a…

    Python 1

  2. finance-assistance finance-assistance Public

    A multi-agent AI assistant delivering spoken market briefs and portfolio analysis. Features dynamic portfolio input, real-time data, news scraping with RAG, stateful analysis, and intent routing. B…

    Python 1

  3. LLL-From-Scratch LLL-From-Scratch Public

    Building a Large Language Model (LLM) from scratch using Python and PyTorch. This project explores the data handling, math, and Transformer (GPT) architecture to understand how LLMs work.

    Jupyter Notebook

  4. DishaAI DishaAI Public

    Forked from azeebneuron/DishaAI

    Vue