Hi, I’m Austin —
I hate hard problems. I hate easy problems even more.
- Building Knowmi AI
- Working on Rust-based LLM inference (vllm-hb)
- Contributing to OSS (transformers, vllm, litellm, dotflow)
- Building operation-reassurance (structural memory for LLMs)
- Machine Learning Systems
- Inference runtimes & schedulers
- Agent / RAG architecture
- Performance + infra
Python • Rust • CUDA • FastAPI • Postgres • SQLite • Vector DBs
Designing systems that:
- understand structure (not just tokens)
- operate at runtime efficiently
- make better decisions with context


