llm-quantization

PentaNet extends BitNet's ternary quantization to pentanary {-2,-1,0,+1,+2}, improving perplexity by 6.4% at 124M params while preserving zero-multiplier arithmetic.

python machine-learning neural-networks model-optimization llm-quantization

Updated Apr 17, 2026
Python

christopherdanie / GovOn

Star

Develop an on-device AI system that processes and analyzes complaints using lightweight, fine-tuned LLMs optimized for industrial field use.

nlp transformers pytorch fine-tuning capstone-project edge-ai github-config on-device-ai llm llm-quantization industry-collaboration

Updated May 10, 2026
Python

samarthamp / advanced-nlp-course-projects

Star

Implementation of advanced Natural Language Processing architectures and optimization techniques, built from scratch. The projects focus on understanding the internal mechanics of Transformers, LLM efficiency through quantization, and scaling via Mixture-of-Experts (MoE).

load-balancing mixture-of-experts transformer-architecture positional-encoding llm-fine-tuning llm-quantization

Updated Jan 8, 2026
Python

Improve this page

Add a description, image, and links to the llm-quantization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-quantization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-quantization

Here are 12 public repositories matching this topic...

snu-mllab / GuidedQuant

GongCheng1919 / bias-compensation

zlaabsi / opentq

Mine-Fire-Bro / llm-diagnostic-reasoning

Iro96 / TurboQuant-H

Kyworn / ShiftQuant

MagicTeaMC / AutoGGUF

Dookoo2 / SVSK

hemantjuyal / LLM-Quantization-Lab

Kyworn / PentaNet-v1.0

christopherdanie / GovOn

samarthamp / advanced-nlp-course-projects

Improve this page

Add this topic to your repo