feat(model): add support for 4-bit quantization #209

peakji · 2023-06-09T02:58:23Z

No description provided.

codecov-commenter · 2023-06-09T03:04:52Z

Codecov Report

Patch coverage: 33.33% and project coverage change: -0.27 ⚠️

Comparison is base (0c4b8a5) 94.54% compared to head (2338e60) 94.27%.

❗ Your organization is not using the GitHub App Integration. As a result you may experience degraded service beginning May 15th. Please install the Github App Integration for your organization. Read more.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #209      +/-   ##
==========================================
- Coverage   94.54%   94.27%   -0.27%     
==========================================
  Files           7        7              
  Lines         330      332       +2     
==========================================
+ Hits          312      313       +1     
- Misses         18       19       +1

Impacted Files	Coverage Δ
basaran/model.py	`88.46% <0.00%> (-0.58%)`	⬇️
basaran/__init__.py	`96.55% <100.00%> (+0.12%)`	⬆️

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

LoopControl · 2023-06-09T11:57:51Z

@peakji Thanks for working on this.

Just wanted to mention that while qlora 4 bit is a good option to have, it is around 8x slower in inference than GPTQ/AutoGPTQ.

I hope autogptq support is added one day in addition to this (also, gptq model download sizes are like a quarter of the size of full precision models so it can save a lot of space as well)

feat(model): add support for 4-bit quantization

2338e60

peakji requested a review from fardeon June 9, 2023 02:58

fardeon approved these changes Jun 9, 2023

View reviewed changes

This was referenced Jun 9, 2023

QLoRa support #202

Closed

Allow loading model with BitsAndBytes 4bit quantization, PEFT LoRA adapters. #203

Open

peakji merged commit 5e34a84 into master Jun 10, 2023

peakji deleted the load-in-4bit branch June 10, 2023 03:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(model): add support for 4-bit quantization #209

feat(model): add support for 4-bit quantization #209

Uh oh!

peakji commented Jun 9, 2023

Uh oh!

codecov-commenter commented Jun 9, 2023

Uh oh!

LoopControl commented Jun 9, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

feat(model): add support for 4-bit quantization #209

feat(model): add support for 4-bit quantization #209

Uh oh!

Conversation

peakji commented Jun 9, 2023

Uh oh!

codecov-commenter commented Jun 9, 2023

Codecov Report

Uh oh!

LoopControl commented Jun 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

LoopControl commented Jun 9, 2023 •

edited

Loading