Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[IMPROVEMENT] Change MistralReasoningParser behavior
#30391 opened Dec 10, 2025 by juliendenize Loading…
3 tasks done
Standardise get_rope to use rope_parameters["partial_rotary_factor"], not rotary_dim deepseek Related to DeepSeek models gpt-oss Related to GPT-OSS models llama Related to Llama models performance Performance-related issues qwen Related to Qwen models
#30389 opened Dec 10, 2025 by hmellor Loading…
[Docs] Generate full list of metrics in user docs documentation Improvements or additions to documentation
#30388 opened Dec 10, 2025 by markmc Loading…
[Core] Whisper support torch.compile v1
#30385 opened Dec 10, 2025 by NickLucche Loading…
[Fix]fix import error from lmcache kv-connector
#30376 opened Dec 10, 2025 by wz1qqx Loading…
5 tasks
Implement LMDB-based multi-modal cache ci/build multi-modality Related to multi-modality (#4194) v1
#30373 opened Dec 10, 2025 by petersalas Loading…
5 tasks
[Fix] Add default rope theta for qwen1 model qwen Related to Qwen models
#30369 opened Dec 10, 2025 by iwzbi Loading…
5 tasks
fix(gguf): Auto-select compatible dtype for GGUF models on Blackwell
#30365 opened Dec 9, 2025 by kitaekatt Loading…
4 tasks done
[Bugfix] awq_gemm: fix argument order swap
#30364 opened Dec 9, 2025 by mgehre-amd Loading…
Remove all2all backend envvar ci/build documentation Improvements or additions to documentation
#30363 opened Dec 9, 2025 by elizabetht Loading…
5 tasks
[WIP] Bump dockerfile to cuda 13.0.2 (for testing) ci/build nvidia
#30362 opened Dec 9, 2025 by dougbtv Loading…
2 tasks done
[Attention][AMD] Make flash-attn optional rocm Related to AMD ROCm speculative-decoding v1
#30361 opened Dec 9, 2025 by mgehre-amd Loading…
Upstream fp8 with static scales gpt oss gpt-oss Related to GPT-OSS models needs-rebase
#30357 opened Dec 9, 2025 by maleksan85 Draft
[CI][DeepSeek] Add nightly DeepSeek R1 lm_eval tests on H200 ci/build deepseek Related to DeepSeek models ready ONLY add when PR is ready to merge/full CI is needed
#30356 opened Dec 9, 2025 by MatthewBonanni Loading…
2 of 5 tasks
Remove virtual engine handling codex kv-connector needs-rebase qwen Related to Qwen models tpu Related to Google TPUs v1
#30350 opened Dec 9, 2025 by WoosukKwon Loading…
Fix gigachat3 parser + update tests frontend tool-calling
#30338 opened Dec 9, 2025 by ajpqs Loading…
3 of 5 tasks
ProTip! Exclude everything labeled bug with -label:bug.