MachineLearningSystem

All

750 repositories

LLMRouter
Public
LLMRouter: An Open-Source Library for LLM Routing
Python
•
MIT License
•65•0•0•0•Updated Jan 1, 2026Jan 1, 2026
DistCA
Public
Efficient Long-context Language Model Training by Core Attention Disaggregation
Python
•4•0•0•0•Updated Dec 20, 2025Dec 20, 2025
cornserve
Public
Easy, Fast, and Scalable Multimodal AI
Python
•
Apache License 2.0
•6•0•0•0•Updated Dec 17, 2025Dec 17, 2025
asystem-astate
Public
C++
•
Apache License 2.0
•5•0•0•0•Updated Dec 9, 2025Dec 9, 2025
fastrl
Public
Efficient Reinforcement Learning for Language Models
Python
•
Apache License 2.0
•8•0•0•0•Updated Nov 21, 2025Nov 21, 2025
miles
Public
Python
•
Apache License 2.0
•69•0•0•0•Updated Nov 20, 2025Nov 20, 2025
NexRL
Public
NexRL is an ultra-loosely-coupled LLM post-training framework.
Python
•
Apache License 2.0
•4•0•0•0•Updated Nov 18, 2025Nov 18, 2025
NexVenusCL
Public
Nex Venus Communication Library
C++
•
Apache License 2.0
•6•0•0•0•Updated Nov 17, 2025Nov 17, 2025
26NSDI-Project-Ava
Public
An implementation of Paper "Empowering Agentic Video Analytics Systems with Video Language Models"
Python
•
MIT License
•3•1•0•0•Updated Nov 5, 2025Nov 5, 2025
DynaPipe
Public
Python
•1•0•0•0•Updated Oct 23, 2025Oct 23, 2025
HamiltonAttention
Public
Python
•2•0•0•0•Updated Oct 15, 2025Oct 15, 2025
streaming-vlm-
Public
StreamingVLM: Real-Time Understanding for Infinite Video Streams
Python
•
MIT License
•51•0•0•0•Updated Oct 15, 2025Oct 15, 2025
streaming-vlm
Public
StreamingVLM: Real-Time Understanding for Infinite Video Streams
Python
•
MIT License
•51•0•0•0•Updated Oct 13, 2025Oct 13, 2025
LongLive
Public
LongLive: Real-time Interactive Long Video Generation
Python
•
Other
•65•0•0•0•Updated Oct 13, 2025Oct 13, 2025
25NeurIPS_SpaceServe
Public
Python
•
Apache License 2.0
•3•0•0•0•Updated Oct 11, 2025Oct 11, 2025
25SOSPmercury_artifact
Public
Python
•
MIT License
•6•0•0•0•Updated Oct 1, 2025Oct 1, 2025
26NSDI-hydraserve
Public
Python
•
Apache License 2.0
•3•0•0•0•Updated Sep 26, 2025Sep 26, 2025
25SC-BurstEngine
Public
BurstEngine is an efficient framework designed to train LLMs on long-sequence data.
Python
•3•0•0•0•Updated Sep 25, 2025Sep 25, 2025
BulletServe
Public
Boosting GPU utilization for LLM serving via dynamic spatial-temporal prefill & decode orchestration
Python
•
Apache License 2.0
•2•0•0•0•Updated Sep 24, 2025Sep 24, 2025
26Eurosys-lorafusion
Public
LoRAFusion: Efficient LoRA Fine-Tuning for LLMs
Python
•2•0•0•0•Updated Sep 23, 2025Sep 23, 2025
OSDI25-blitz-scale
Public
The official implementation of OSDI'25 paper BlitzScale
Rust
•2•0•0•0•Updated Sep 20, 2025Sep 20, 2025
25OSDI-blitz-scale
Public
The official implementation of OSDI'25 paper BlitzScale
Rust
•2•1•0•0•Updated Sep 20, 2025Sep 20, 2025
25SC-gLLM
Public
gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling
Python
•
Apache License 2.0
•4•0•0•0•Updated Sep 15, 2025Sep 15, 2025
25SOSP-pie2
Public
Rust
•
Apache License 2.0
•11•0•0•0•Updated Sep 10, 2025Sep 10, 2025
cheriot-rtos
Public
The RTOS components for the CHERIoT research platform
C++
•
MIT License
•59•0•0•0•Updated Sep 8, 2025Sep 8, 2025
NSDI26-FastServe
Public
Jupyter Notebook
•3•0•0•0•Updated Sep 6, 2025Sep 6, 2025
Gorgeous
Public
C++
•
Other
•3•0•0•0•Updated Sep 2, 2025Sep 2, 2025
26SIGMOD-Harmony
Public
C++
•
MIT License
•2•0•0•0•Updated Sep 2, 2025Sep 2, 2025
25SOSP-mage-artifact
Public
Artifact for SOSP 25 paper: Scalable Far Memory: Balancing Faults and Evictions
C
•1•0•0•0•Updated Aug 31, 2025Aug 31, 2025
25APsys-hypergen
Public
Python
•1•0•0•0•Updated Aug 27, 2025Aug 27, 2025