Pinned Loading
Repositories
Showing 6 of 6 repositories
- 1Cat-vLLM Public
vLLM fork for Tesla V100 (SM70) with AWQ 4-bit support, CUDA 12.8 build flow, and validated Qwen3.5 27B/35B deployment on multi-GPU V100.
1CatAI/1Cat-vLLM’s past year of commit activity - 1cat-llmtest Public
1CatAI/1cat-llmtest’s past year of commit activity - lmdeploy Public Forked from InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
1CatAI/lmdeploy’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…