Update Model Card for Mamba by ParagEkbote · Pull Request #37863 · huggingface/transformers

ParagEkbote · 2025-04-29T13:33:36Z

What does this PR do?

As described in the issue, this PR updates the model card for Mamba. Please let me know if any modifications are required and I will make the necessary changes.

Refs #36979

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).

Who can review?

@stevhliu

github-actions · 2025-04-29T13:33:48Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. The CI will be paused while the PR is in draft mode. When it is ready for review, please click the Ready for review button (at the bottom of the PR page). This will assign reviewers and trigger CI.

stevhliu

Thanks!

Maybe let's also add a quantization example with https://huggingface.co/state-spaces/mamba-2.8b-hf.

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer, TorchAoConfig

quantization_config = TorchAoConfig("int4_weight_only", group_size=128)
tokenizer = AutoTokenizer.from_pretrained("state-spaces/mamba-2.8b-hf")
model = AutoModelForCausalLM.from_pretrained("state-spaces/mamba-2.8b-hf", torch_dtype=torch.bfloat16, quantization_config=quantization_config, device_map="auto",)
input_ids = tokenizer("Plants create energy through a process known as", return_tensors="pt").to("cuda")

output = model.generate(**input_ids)
print(tokenizer.decode(output[0], skip_special_tokens=True))

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

ParagEkbote · 2025-05-01T07:34:48Z

Thanks!

Maybe let's also add a quantization example with https://huggingface.co/state-spaces/mamba-2.8b-hf.

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer, TorchAoConfig

quantization_config = TorchAoConfig("int4_weight_only", group_size=128)
tokenizer = AutoTokenizer.from_pretrained("state-spaces/mamba-2.8b-hf")
model = AutoModelForCausalLM.from_pretrained("state-spaces/mamba-2.8b-hf", torch_dtype=torch.bfloat16, quantization_config=quantization_config, device_map="auto",)
input_ids = tokenizer("Plants create energy through a process known as", return_tensors="pt").to("cuda")

output = model.generate(**input_ids)
print(tokenizer.decode(output[0], skip_special_tokens=True))

I've added this example and updated it as per the new AOBaseConfig-based approach since the approach used above is deprecated as mentioned in the docs.

Could you please review the changes?

cc: @stevhliu

stevhliu

LGTM!

HuggingFaceDocBuilderDev · 2025-05-21T17:53:37Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

update model card.

dc49848

github-actions Bot marked this pull request as draft April 29, 2025 13:33

ParagEkbote marked this pull request as ready for review April 29, 2025 13:34

github-actions Bot requested a review from stevhliu April 29, 2025 13:35

stevhliu mentioned this pull request Apr 29, 2025

[Community contributions] Model cards #36979

Closed

stevhliu reviewed May 1, 2025

View reviewed changes

ParagEkbote and others added 2 commits May 1, 2025 12:49

Apply suggestions from code review

bc44cb4

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

update quantization example.

a34b485

ParagEkbote requested a review from stevhliu May 2, 2025 18:29

ParagEkbote added 2 commits May 4, 2025 13:32

update example.

d257749

update

57d0838

stevhliu approved these changes May 21, 2025

View reviewed changes

stevhliu merged commit 28d3148 into huggingface:main May 21, 2025
10 checks passed

ParagEkbote deleted the Improve-Model-Card-For-Mamb branch May 21, 2025 17:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Model Card for Mamba#37863

Update Model Card for Mamba#37863
stevhliu merged 5 commits intohuggingface:mainfrom
ParagEkbote:Improve-Model-Card-For-Mamb

ParagEkbote commented Apr 29, 2025 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 29, 2025

Uh oh!

stevhliu left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ParagEkbote commented May 1, 2025

Uh oh!

stevhliu left a comment

Uh oh!

HuggingFaceDocBuilderDev commented May 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ParagEkbote commented Apr 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

github-actions Bot commented Apr 29, 2025

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ParagEkbote commented May 1, 2025

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented May 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ParagEkbote commented Apr 29, 2025 •

edited

Loading