Update Model Card for Encoder Decoder Model#39272
Update Model Card for Encoder Decoder Model#39272stevhliu merged 15 commits intohuggingface:mainfrom
Conversation
stevhliu
left a comment
There was a problem hiding this comment.
Thanks for working on this one!
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
…gEkbote/transformers into Update-Encoder-Decoder-Card
stevhliu
left a comment
There was a problem hiding this comment.
Nice job, just a few more things!
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
|
|
||
| [`EncoderDecoderModel`] can be randomly initialized from an encoder and a decoder config. In the following example, we show how to do this using the default [`BertModel`] configuration for the encoder and the default [`BertForCausalLM`] configuration for the decoder. | ||
| </hfoption> | ||
| <hfoption id="AutoModel"> |
There was a problem hiding this comment.
The AutoModel example should be unchanged from the last review except for adding attn_implementation=sdpa so that it's aligned with the Pipeline example to show how users can achieve the same thing using the two separate APIs.
This translation example here in the AutoModel section should update the translation example in the ## Notes (which doesn't work by the way).
There was a problem hiding this comment.
I have updated the example. Could you please review?
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
* update model card. * add back the model contributors for mamba and mamba2. * update the model card. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update batches with correct alignment. * update examples and remove quantization example. * update the examples. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update example. * correct the example. --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* update model card. * add back the model contributors for mamba and mamba2. * update the model card. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update batches with correct alignment. * update examples and remove quantization example. * update the examples. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update example. * correct the example. --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* update model card. * add back the model contributors for mamba and mamba2. * update the model card. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update batches with correct alignment. * update examples and remove quantization example. * update the examples. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update example. * correct the example. --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* update model card. * add back the model contributors for mamba and mamba2. * update the model card. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update batches with correct alignment. * update examples and remove quantization example. * update the examples. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update example. * correct the example. --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* update model card. * add back the model contributors for mamba and mamba2. * update the model card. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update batches with correct alignment. * update examples and remove quantization example. * update the examples. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update example. * correct the example. --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* update model card. * add back the model contributors for mamba and mamba2. * update the model card. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update batches with correct alignment. * update examples and remove quantization example. * update the examples. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update example. * correct the example. --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* update model card. * add back the model contributors for mamba and mamba2. * update the model card. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update batches with correct alignment. * update examples and remove quantization example. * update the examples. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update example. * correct the example. --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* update model card. * add back the model contributors for mamba and mamba2. * update the model card. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update batches with correct alignment. * update examples and remove quantization example. * update the examples. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update example. * correct the example. --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
What does this PR do?
As described in the issue, this PR updates the model card for the encoder decoder model with an additional translation example. I have also re-added the contributor names for the Mamba and Mamba-2 models which were previously removed by me. Please let me know if any modifications are required and I will make the necessary changes.
Fixes #8944
Refs #36979
Before submitting
Who can review?
@stevhliu