Skip to content

Adding LLMKube to Infrastructure list on README#20212

Merged
ggerganov merged 1 commit intoggml-org:masterfrom
Defilan:docs/adding_llmkube
Mar 8, 2026
Merged

Adding LLMKube to Infrastructure list on README#20212
ggerganov merged 1 commit intoggml-org:masterfrom
Defilan:docs/adding_llmkube

Conversation

@Defilan
Copy link
Copy Markdown
Contributor

@Defilan Defilan commented Mar 7, 2026

LLMKube is a Kubernetes operator for llama.cpp. It handles model downloads, GPU scheduling (Nvidia CUDA and Apple Silicon Metal), health probes, and Prometheus metrics through Metal and InferenceService CRDs.

Related: #6546

@Defilan Defilan requested a review from ggerganov as a code owner March 7, 2026 19:58
@ggerganov ggerganov merged commit a950479 into ggml-org:master Mar 8, 2026
1 check passed
bartowski1182 pushed a commit to bartowski1182/llama.cpp that referenced this pull request Mar 10, 2026
Ethan-a2 pushed a commit to Ethan-a2/llama.cpp that referenced this pull request Mar 20, 2026
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
rsenthilkumar6 pushed a commit to rsenthilkumar6/llama.cpp that referenced this pull request May 1, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants