β¨ "I've always been passionate about computers and the ability to make them solve problems for us. I value encountering challenging situations, enjoy analysing & solving problems and seeing the results. Code is a byproduct of this phenomenon."
I build things across the full stack β from low-level systems optimisation and distributed infrastructure, to ML platforms, recommender engines, clinical NLP services, and multi-agent GenAI systems. I care about making things work reliably at scale, in regulated environments, with measurable impact.
For over a decade I've navigated highly regulated domains β Healthcare, Energy, Finance β specialising in chipping away the "Hidden Technical Debt in Machine Learning".
Strong believer in ππΎ "AI is the new Electricity".
I want to use whatever skills and resources I have to help build a fairer world for everyone. That means fighting for climate action, standing against racism and inequality, spreading awareness about rational thinking and empathy, and advancing humanity's collective understanding through science and exploration. I'm actively looking for opportunities where technology β especially ML systems β can drive equitable outcomes in underserved communities, support public policy decisions with better data, and help us make progress on the problems that actually matter.
When I'm not debugging distributed systems, I'm probably reading about the intersection of responsible AI and public policy, or figuring out how to make technology work for people who need it most.
|
ποΈ Platform & Engineering
|
π§ͺ Product & Commercial
|
|
π Privacy & Compliance
|
π― NLP & GenAI
|
CS Fundamentals & Distributed SystemsPrimitives: C/C++, Big-O Tuning, LSM/B-Trees, PostgreSQL, Advanced SQL (Window Functions, CTEs), Bash/Shell, Perl. Python (Advanced internals, Asyncio, Metaprogramming).Systems: Rigorous cProfile analysis crossing CPython GIL/GC boundaries using pure stdlib buffers (heaps, deques). Architecting on CAP/PACELC theorem bounds with Paxos/Raft consensus, Event-Driven Architecture, Domain-Driven Design, and Gang of Four patterns.
Mathematics, Classical ML & Causal InferencePrimitives: Linear Algebra (SVD, PCA), Matrix Calculus, Bayesian Inference, Optimisation (SGD, Adam), Clustering. Execution: Determining when not to use Deep Learning. Wielding Causal Inference (Do-Calculus, Propensity matching), XGBoost/LightGBM, and specialised domain models: Recommender Systems (Two-Tower, Collaborative Filtering), Time-Series Forecasting (ARIMA, Prophet, DeepAR). Deep Learning Internals & Platform MLOpsDL Primitives: Re-deriving PyTorch Autograd/Transformer blocks from scratch. PEFT (LoRA/QLoRA), RLHF alignment (PPO/DPO). TensorFlow, Keras, Scikit-Learn.Platform: Strict FTI (Feature-Training-Inference) isolation via multi-stage OCI-compliant containerisation. Orchestrating with Kubeflow, ClearML, TFx, MLflow, DVC, Airflow, Feature Stores (Feast/Tecton). Enforcing data contracts, schema validation, and drift monitoring.
GenAI, HPC Optimisation & FinOps StrategyHPC: Optimising hardware utilisation via vLLM (PagedAttention), FlashAttention, Quantisation (INT8/FP4), GPU Profiling (torch.profiler).Agentic: Multi-Agent Orchestration via LangChain, LangGraph, DSPy, MCP, A2A. GraphRAG (Knowledge Graphs + LLMs). Guardrails, Prompt Engineering, and evals. Federated Learning and Differential Privacy guarantees. Strategy: Cross-functional leadership, RFC authorship, and FinOps architecture driving cost reductions. |
π₯ Healthcare & Clinical NLPAutomated consultation notes (32% time reduction), Care Quality assessments (70% of reviews automated), structured medical data extraction (93% accuracy, outperforming Google Healthcare NLP by 14%). Personalised prescription recommenders for 100k+ patients with 71.3% clinician adoption. Privacy-by-Design with 0.87+ F1 PII redaction. β‘ Energy & Time-Series ForecastingHigh-availability energy forecasting systems β regional price, load and solar generation forecasts for 10k+ sites, ensuring grid stability and 40% avg cost optimisation at Amber. ARIMA, Prophet, DeepAR for production anomaly detection across high-frequency streaming data with Kafka. π Recommender Systems & Consumer TechLed personalised consumer experiences at Linktree through end-to-end recommender systems (links & profiles) utilising collaborative filtering and embedding-based search β 33% gains in Link adoption & 19% profile subscription uplift. Patient segmentation driving 18β23% repeat order uplift. π¦ Enterprise ML & Document IntelligenceEnterprise ID Fraud Detection Platform with Kubeflow on GKE at ANZ. Synthetic data generation, font detection CNNs, and red flag identification in ID documents. Customer request triaging chatbot at HammondCare. ML infrastructure and SageMaker platform extensions at nib Group. "Thea" document mining framework at Eliiza. Won 2nd place in Cricket Australia DataJam 2020. π¬ Advanced ParadigmsPEFT (LoRA/QLoRA), RLHF alignment (PPO/DPO), A2A, MCP orchestration, evals & optimisation. Federated Learning, differential privacy guarantees. Agentic workflows β LangChain/LangGraph, Multi-Agent Systems, RAG, DSPy agents, guardrails. |
|
π€ GenAI & Agentic Systems
|
ποΈ Systems & Infrastructure
|
|
π§ͺ ML & Data Science
|
π§ Data Engineering & Tooling
|
π Full Catalogue
| Repo | Description | |
|---|---|---|
| blog | Personal blog | |
| suryaavala.github.io | Portfolio website | |
| powr | Energy data analysis | |
| Stellartube | Python project | |
| TimeStamper | Timestamping tool | |
| textparsing | Text parsing | |
| network | Network programming | |
| utilities | Reusable Python tooling | |
| CS231n | Stanford CNNs for Visual Recognition | |
| 18s1-9417 | UNSW ML (COMP9417) | |
| 17s1-cs9417 | UNSW ML (COMP9417) | |
| 16s2-comp2041-ass1 | UNSW Perl scripting β1 | |
| 16s2-comp2041-ass2 | UNSW Python scripting β1 | |
| 16s2-comp2041-labs | UNSW scripting labs | |
| egl_test | Technical assessment | |
| finder_test | Technical assessment | |
| tcal | Telugu movie releases β Google Calendar |
tensorflow/tfx (#3813) β Resolved strict dependency pinning conflicts for TF-Hub.
kubeflow/pipelines (#4702) β Fixed GCP inverse proxy URL routing priorities.
kubeflow/pipelines (#4706) β Reconciled SDK linting conflicts between pylint and yapf.
dask/dask (#5828) β Fixed multi-dimensional array ValueErrors in delayed map reductions.
iterative/katacoda-scenarios β DVC learning scenarios.





