Skip to content
View SuperMarioYL's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report SuperMarioYL

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SuperMarioYL/README.md

Hi, I'm Leo 👋

Typing SVG

AI Infrastructure Inference Acceleration Cloud Native

Building high-performance AI systems · LLM optimization · Multimodal inference · Scalable ML infrastructure


Blog Profile Views



🎯 Core Expertise

🏗️ AI Infrastructure

Building scalable ML systems and training pipelines

PyTorch TensorFlow CUDA Triton

⚡ Inference Acceleration

Optimizing model serving and reducing latency

vLLM TensorRT ONNX Quantization

☁️ Cloud Native

Deploying and orchestrating at scale

Kubernetes Docker Ray Istio



🛠️ Tech Stack

Deep Learning Frameworks

Python PyTorch TensorFlow JAX

Inference & Optimization

CUDA TensorRT ONNX Runtime vLLM Triton Inference Server

Cloud Native & DevOps

Kubernetes Docker Ray Helm ArgoCD Prometheus

Languages & Tools

C++ Go Rust Git Linux

📊 View Detailed Language Statistics →
Language Stats


📈 GitHub Stats

GitHub Contribution Graph


💭 About Me

Passionate about pushing the boundaries of AI performance.
When not optimizing inference pipelines, you'll find me cycling, exploring photography, or traveling.



© 2025 Leo · Powered by passion for AI and open source

Pinned Loading

  1. trouve trouve Public

    trouve : A built-in integrated service discovery, service registration, and service forwarding general component for Spring projects

    Java 29 9

  2. Bison Bison Public

    Enterprise GPU Resource Billing & Multi-Tenant Management Platform 企业级 GPU 资源计费与多租户管理平台

    TypeScript 2