Update modernbertdecoder docs by orionw · Pull Request #39453 · huggingface/transformers

orionw · 2025-07-16T14:24:35Z

What does this PR do?

Updates the docs for the ModernBERTDecoder class to link to the paper and to the correct model name.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@stevhliu for docs, if you have a sec!

stevhliu

Thanks for updating, I think you can remove the ## Usage tips section at the bottom since most of the information is already incorporated in the card.

It may also be nice to include a quantization example with the ettin-decoder-1b checkpoint!

Thanks to @stevhlui! Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

orionw · 2025-07-18T14:16:13Z

Thanks so much for the suggestions @stevhliu, added them!

It may also be nice to include a quantization example with the ettin-decoder-1b checkpoint!

I actually am not super good at quantization (others were super kind and contributed an onyx version to some models that IDK how to do), so I might just leave it for now - although I love the idea!

stevhliu · 2025-07-18T21:41:35Z

The snippet below should work for quantization! :)

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig

quantization_config = BitsAndBytesConfig(
    load_in_8bit=True,
)

tokenizer = AutoTokenizer.from_pretrained("jhu-clsp/ettin-decoder-1b")
model = AutoModelForCausalLM.from_pretrained(
    "jhu-clsp/ettin-decoder-1b",
    torch_dtype=torch.float16,
    device_map="auto",
    quantization_config=quantization_config
)

prompt = "The future of artificial intelligence is"
inputs = tokenizer(prompt, return_tensors="pt").to("cuda")

with torch.no_grad():
    outputs = model.generate(
        **inputs,
        max_length=50,
        num_return_sequences=1,
        temperature=0.7,
        do_sample=True,
        pad_token_id=tokenizer.eos_token_id
    )

generated_text = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(f"Generated text: {generated_text}")

orionw · 2025-07-21T00:18:07Z

Awesome, thank you so much @stevhliu, added that to the model card!

Is there anything else to add for this PR or is it good to go?

HuggingFaceDocBuilderDev · 2025-07-21T18:51:06Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

stevhliu · 2025-07-21T18:54:02Z

I think for this PR, we should still:

remove the ## Usage tips section at the bottom since most of the information is already incorporated in the card
add the quantization example to this doc as well

orionw · 2025-07-21T21:05:59Z

Perfect, sorry I missed that. Added now!

stevhliu

ty ❤️

* update docs with paper and real model * nit * Apply suggestions from code review Thanks to @stevhlui! Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Remove usage examples, add quantization --------- Co-authored-by: oweller2 <oweller2@dsailogin.mgmt.ai.cluster> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

oweller2 added 2 commits July 16, 2025 10:22

update docs with paper and real model

d49438b

nit

8e032f5

stevhliu reviewed Jul 16, 2025

View reviewed changes

Comment thread docs/source/en/model_doc/modernbert-decoder.md Outdated

Comment thread docs/source/en/model_doc/modernbert-decoder.md

Comment thread docs/source/en/model_doc/modernbert-decoder.md

Apply suggestions from code review

eefaaf6

Thanks to @stevhlui! Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

stevhliu mentioned this pull request Jul 19, 2025

[Community contributions] Model cards #36979

Closed

Remove usage examples, add quantization

cb112b3

stevhliu approved these changes Jul 21, 2025

View reviewed changes

stevhliu merged commit 641aaed into huggingface:main Jul 21, 2025
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update modernbertdecoder docs#39453

Update modernbertdecoder docs#39453
stevhliu merged 4 commits intohuggingface:mainfrom
JHU-CLSP:update-modernbertdecoder-docs

orionw commented Jul 16, 2025

Uh oh!

stevhliu left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

orionw commented Jul 18, 2025 •

edited

Loading

Uh oh!

stevhliu commented Jul 18, 2025

Uh oh!

orionw commented Jul 21, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 21, 2025

Uh oh!

stevhliu commented Jul 21, 2025

Uh oh!

orionw commented Jul 21, 2025 •

edited

Loading

Uh oh!

stevhliu left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

orionw commented Jul 16, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

orionw commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevhliu commented Jul 18, 2025

Uh oh!

orionw commented Jul 21, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 21, 2025

Uh oh!

stevhliu commented Jul 21, 2025

Uh oh!

orionw commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

orionw commented Jul 18, 2025 •

edited

Loading

orionw commented Jul 21, 2025 •

edited

Loading