prismml

Here are 3 public repositories matching this topic...

carlosfundora / llama.cpp-1-bit-turbo

HIP/ROCm fork of llama.cpp optimized for AMD gfx1030/RDNA2 architecture with support for PrismML's Bonsai Q1_0_G128 '1-bit' models, TurboQuant TQ3_0 KV cache, RotorQuant (iso and planar), and EAGLE3 speculative decoding.

hip quantization bonsai rocm amd-gpu llama-cpp gguf rdna2 turboquant prismml gfx1030

Updated Apr 7, 2026
C++

carlosfundora / sglang-1-bit-turbo

Star

ROCm/HIP fork of SGLang with TurboQuant tq2/tq3/tq4 KV cache, RotorQuant (iso and planar), Triton and radix-cache serving, EAGLE3 speculative decoding, P-EAGLE checkpoint support, and PrismML Bonsai 1-bit GGUF compatibility on gfx1030/RDNA2.

triton hip bonsai rocm amd-gpu gguf speculative-decoding sglang rdna2 eagle3 turboquant prismml gfx1030 p-eagle radix-cache

Updated Apr 9, 2026
Python

Kxrbx / BonsaiDesk

Star

Local Prism-powered chat app for Bonsai GGUF models

react windows prism bonsai chat-ui vite fastapi llama-cpp local-ai gguf prismml

Updated Apr 4, 2026
TypeScript

Improve this page

Add a description, image, and links to the prismml topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the prismml topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly