jd-opensource / xllm Public

Notifications You must be signed in to change notification settings
Fork 91
Star 807

Code
Issues 42
Pull requests 18
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: jd-opensource/xllm

Labels 14 Milestones 0

New pull request New

18 Open 403 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

refactor: remove MTP-specific function requirement from non-MTP models.

#509 opened Dec 9, 2025 by yingxudeng

Loading…

feat: vlm support binary mm input.

#507 opened Dec 9, 2025 by xiao-yu-chen

Loading…

bugfix: support multiple models or multiple model version with independent instance.

#505 opened Dec 9, 2025 by liujinguang0125

Loading…

bugfix: fix the issue of missing MMData input during engine ->worker transfer via brpc format.

#501 opened Dec 8, 2025 by magicheng0816

Loading…

refactor: implement Programmatic Dependent Launch (PDL) support in Device class for cuda device.

#500 opened Dec 8, 2025 by XuZhang99

Loading…

feat: add moe all2all kernels and deep ep layer.

#497 opened Dec 8, 2025 by a120092009

Loading…

bugfix: fix the issue of ineffective input embedding transmission.

#490 opened Dec 5, 2025 by magicheng0816

Loading…

refactor: separate the weight loading in the npu layer class.

#489 opened Dec 5, 2025 by Clement-Wang26

Loading…

feat: optimize prefetch from kv cache store.

#486 opened Dec 4, 2025 by Kang-Meng

Loading…

feat: add wrappers for ATB and ACLNN fused operators.

#474 opened Dec 2, 2025 by yingxudeng

Loading…

feat: add mm embedding model and its factory.

#471 opened Dec 2, 2025 by dongxianzhe

Loading…

refactor: separate mlu and cuda version Qwen model implementation. cuda

#468 opened Dec 1, 2025 by XuZhang99

Loading…

feat: support deepseek mtp on mlu. mlu

#454 opened Nov 28, 2025 by a120092009

Loading…

refactor: optimize unique token count preparation of batch input builder.

#449 opened Nov 27, 2025 by RobbieLeung

Loading…

[WIP] feat: support loading model weights and forward overlap.

#441 opened Nov 26, 2025 by Clement-Wang26

Loading…

feat: support Qwen2-VL & GME-Qwen2-VL model on npu device.

#399 opened Nov 18, 2025 by xanecdotex

Loading…

feat: enable torch_npu graph mode for Qwen-3 dense with TP support.

#325 opened Nov 6, 2025 by yingxudeng

Loading…

【WIP】feat: add rec framwork.

#305 opened Oct 31, 2025 by DragonFive

Loading…

3 of 5 tasks

ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!