llama : add simple option to enable CPU for MoE weights (--cpu-moe) by slaren · Pull Request #14992 · ggml-org/llama.cpp

slaren · 2025-07-31T16:10:36Z

This is intended to be a simple and curated way to use the CPU for the MoE weights. Internally, it is just setting up the appropriate tensor overrides, but this should be easier to use.

jacekpoplawski · 2025-07-31T19:16:04Z

Am I correct that this is on/off? It would be better to have an option for the number of layers (similar to -ngl).

slaren · 2025-07-31T21:23:37Z

I am not convinced that it would be worth it. The goal here is to have a very simple option that works well enough for most people. If you want to min-max, you can still use the --override-tensor option to customize it in any way you want.

jacekpoplawski · 2025-08-01T04:18:24Z

Yes, I understand. And now I have an idea for my experiments :)

…gml-org#14992)

…#14992)

…gml-org#14992)

llama : add simple option to enable CPU for MoE weights (--cpu-moe)

8833f22

ggerganov approved these changes Jul 31, 2025

View reviewed changes

Comment thread common/arg.cpp

slaren merged commit a06ed5f into master Jul 31, 2025
47 checks passed

slaren deleted the sl/moe-switch branch July 31, 2025 18:15

Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Aug 1, 2025

llama : add simple option to enable CPU for MoE weights (--cpu-moe) (g…

630e9a6

…gml-org#14992)

slaren mentioned this pull request Aug 4, 2025

llama : add --n-cpu-moe option #15077

Merged

blime4 referenced this pull request in blime4/llama.cpp Feb 5, 2026

llama : add simple option to enable CPU for MoE weights (--cpu-moe) (…

26d0416

…#14992)

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

llama : add simple option to enable CPU for MoE weights (--cpu-moe) (g…

ae273eb

…gml-org#14992)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : add simple option to enable CPU for MoE weights (--cpu-moe)#14992

llama : add simple option to enable CPU for MoE weights (--cpu-moe)#14992
slaren merged 1 commit intomasterfrom
sl/moe-switch

slaren commented Jul 31, 2025

Uh oh!

Uh oh!

Uh oh!

jacekpoplawski commented Jul 31, 2025

Uh oh!

slaren commented Jul 31, 2025

Uh oh!

jacekpoplawski commented Aug 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

slaren commented Jul 31, 2025

Uh oh!

Uh oh!

Uh oh!

jacekpoplawski commented Jul 31, 2025

Uh oh!

slaren commented Jul 31, 2025

Uh oh!

jacekpoplawski commented Aug 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants