Skip to content
View Prateekbala's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report Prateekbala

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Prateekbala/README.md

Prateek Bala

LinkedIn Twitter Email

CSE undergrad drawn to the hard parts of backend engineering — distributed consensus, low-latency I/O, systems that hold up under pressure. I work closest to the metal when I can: lock-free queues in C++, Raft from scratch in Go, SSE over polling because polling is lazy.

Interested in fintech infrastructure, distributed systems, and MLOps tooling. Currently building microservices at Flexzistay and contributing to KServe (CNCF).


What I'm into

Building backend systems that hold up under pressure — high-throughput queues, low-latency pipelines, distributed coordination. And increasingly, making AI actually work in production: inference routing, agent orchestration, LLMs that do real things instead of demoing well.

The intersection is where it gets interesting — autonomous systems with financial logic, agents that don't just respond but act. That's the space I want to build in.

Projects

RiftMQ — Kafka-inspired queue in Go. Raft consensus, WAL, consistent hashing. 17,900 msg/sec, 100% delivery under 200 producers.

FIX Protocol Trading Gateway — C++20 order engine. Lock-free queues, async I/O. P50: 350µs, P99: 633µs at 250K orders/sec.

LLM Inference Router — OpenAI-compatible gateway across vLLM nodes. Circuit breakers, Prometheus/Grafana observability.


Open Source — KServe (CNCF)

#142 · Namespace filtering for multi-tenant K8s
#161 · InferenceGraph routing for multi-model pipelines


Pinned Loading

  1. Distributed-system Distributed-system Public

    A production-ready, high-performance distributed message broker system inspired by Apache Kafka, built from scratch in Go with advanced features like automatic partition rebalancing, rack-aware rep…

    Go

  2. eth-api-payments eth-api-payments Public

    Forked from MananSinghal123/eth-api-payments

    💰 Start charging for your api integrating our sdk without having to expose sensitive data or wait for days for settling payments. 🛡️ Trusted Payments though zk-verified usage .

    TypeScript

  3. Fix-Protocol-Tranding-Engine Fix-Protocol-Tranding-Engine Public

    A high-performance FIX (Financial Information eXchange) Protocol 4.4 trading gateway implemented in modern C++20. This system provides a complete order management and execution infrastructure with …

    C++

  4. Distributed-LLM-Router Distributed-LLM-Router Public

    Distributed LLM Router: an OpenAI-compatible FastAPI gateway that intelligently routes requests across multiple vLLM nodes with load balancing, failover retries, backpressure, and real-time Prometh…

    Python