-
-
Notifications
You must be signed in to change notification settings - Fork 11.9k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[IMPROVEMENT] Change MistralReasoningParser behavior
#30391
opened Dec 10, 2025 by
juliendenize
Loading…
3 tasks done
fix: Update json features supported by xGrammar
structured-output
v1
#30390
opened Dec 10, 2025 by
johannesflommersfeld
Loading…
Standardise Related to DeepSeek models
gpt-oss
Related to GPT-OSS models
llama
Related to Llama models
performance
Performance-related issues
qwen
Related to Qwen models
get_rope to use rope_parameters["partial_rotary_factor"], not rotary_dim
deepseek
#30389
opened Dec 10, 2025 by
hmellor
Loading…
[Docs] Generate full list of metrics in user docs
documentation
Improvements or additions to documentation
#30388
opened Dec 10, 2025 by
markmc
Loading…
adding constraint updates of cos-sin to improve mrope performance
#30377
opened Dec 10, 2025 by
wujinyuan1
Loading…
[Fix]fix import error from lmcache
kv-connector
#30376
opened Dec 10, 2025 by
wz1qqx
Loading…
5 tasks
Implement LMDB-based multi-modal cache
ci/build
multi-modality
Related to multi-modality (#4194)
v1
#30373
opened Dec 10, 2025 by
petersalas
Loading…
5 tasks
[Fix] Add default rope theta for qwen1 model
qwen
Related to Qwen models
#30369
opened Dec 10, 2025 by
iwzbi
Loading…
5 tasks
[Bug Fix] Fix Kimi-Linear model initialization crash due to missing 'indexer_rotary_emb' arg
#30366
opened Dec 10, 2025 by
yonasTMC
Loading…
fix(gguf): Auto-select compatible dtype for GGUF models on Blackwell
#30365
opened Dec 9, 2025 by
kitaekatt
Loading…
4 tasks done
Remove all2all backend envvar
ci/build
documentation
Improvements or additions to documentation
#30363
opened Dec 9, 2025 by
elizabetht
Loading…
5 tasks
[WIP] Bump dockerfile to cuda 13.0.2 (for testing)
ci/build
nvidia
#30362
opened Dec 9, 2025 by
dougbtv
Loading…
2 tasks done
[Attention][AMD] Make flash-attn optional
rocm
Related to AMD ROCm
speculative-decoding
v1
#30361
opened Dec 9, 2025 by
mgehre-amd
Loading…
Upstream fp8 with static scales gpt oss
gpt-oss
Related to GPT-OSS models
needs-rebase
#30357
opened Dec 9, 2025 by
maleksan85
•
Draft
[CI][DeepSeek] Add nightly DeepSeek R1 Related to DeepSeek models
ready
ONLY add when PR is ready to merge/full CI is needed
lm_eval tests on H200
ci/build
deepseek
#30356
opened Dec 9, 2025 by
MatthewBonanni
Loading…
2 of 5 tasks
[Fix] Handle multiple tool calls in Qwen3-MTP tool parser
frontend
qwen
Related to Qwen models
tool-calling
#30353
opened Dec 9, 2025 by
ArkVex
Loading…
Remove virtual engine handling
codex
kv-connector
needs-rebase
qwen
Related to Qwen models
tpu
Related to Google TPUs
v1
#30350
opened Dec 9, 2025 by
WoosukKwon
Loading…
[Core] Major fix catch backend grammar exceptions (xgrammar, outlines, etc) in scheduler
v1
#30346
opened Dec 9, 2025 by
blancsw
Loading…
[Bugfix] Fix HunyuanOCR cross-image contamination in batch processing
#30344
opened Dec 9, 2025 by
anker-c2
Loading…
3 of 5 tasks
[CI] refine more logic when generating and using nightly wheels & indices
ci/build
#30341
opened Dec 9, 2025 by
Harry-Chen
Loading…
3 of 5 tasks
Fix gigachat3 parser + update tests
frontend
tool-calling
#30338
opened Dec 9, 2025 by
ajpqs
Loading…
3 of 5 tasks
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.