Uniform kwargs for processors of audio-text models by leloykun · Pull Request #32906 · huggingface/transformers

leloykun · 2024-08-21T00:10:16Z

What does this PR do?

Uniformizes kwargs for processors of audio-text models.
An extension of Uniform kwargs for processors #31911
NOTE: don't review nor merge until this PR is complete: Uniformize model processors (models *with* special arg names) #32841

TODO Models:

TODO tests

Add audio-text-specific processor tests
Remove unnecessary/duplicated tests

Models with special args (will not be done in this PR):

PopPiano

Models with weird _in_target_context_manager logic (will not be done in this PR):

MusicGen
SpeechToText
Wav2Vec2
Wav2Vec2 w/ LM
Whisper

Fixes # (issue)

Uniform kwargs for processors #31911

Who can review?

@zucchini-nlp @molbap @yonigozlan

…lipvideo, llava_next, llava_next_video, siglip, video_llava, vilt

…ages

leloykun · 2024-08-21T14:40:56Z

@zucchini-nlp the tests are failing because of this: #32921

leloykun added 26 commits August 16, 2024 15:38

uniformize kwargs of Chameleon

2f4163a

fix linter nit

2588144

rm stride default

6454130

add tests for chameleon processor

9949e72

fix tests

58c6b53

fix chameleon tests

6592ce3

don't hardcode arg names

c4f5474

uniformize processor kwargs of altclip, bridgetower, flava, instructb…

ce9cc73

…lipvideo, llava_next, llava_next_video, siglip, video_llava, vilt

fix linter issue

d325914

address @zucchini-nlp's comments

935d6e5

improve docs

39650f6

don't dw from hub for video tests

539da9d

add video processing tests for instructblipvideo & video_llava

c8b2384

add git, mgp, tvp, & x-clip

423d864

fix docs

5fd2c32

address @zucchini-nlp's comments

9e00f68

simplify implementations

a2672a6

uniformize implementations of make_batched_videos and make_batched_im…

721d1c8

…ages

fix instructblipvideo tests

c0f3abb

fix copies

bb5debd

fix make_batched_videos

d9bc2e9

fix MGP-str

f6e7914

fix make_batched_videos

acd2c56

fix make_batched_videos

5c39f4f

fix make_batched_videos

ea06e45

uniformize kwargs for audio-text processors

44023bc

leloykun mentioned this pull request Aug 21, 2024

Uniform kwargs for processors #31911

Closed

40 tasks

leloykun added 2 commits August 21, 2024 14:45

add clap, clvp, musicgen melody, qwen2, & seamless m4t

ea3d36e

fix wav2vec2 bert & speecht5

3e46327

leloykun mentioned this pull request Aug 21, 2024

Fix regression on Processor.save_pretrained caused by #31691 #32921

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uniform kwargs for processors of audio-text models#32906

Uniform kwargs for processors of audio-text models#32906
leloykun wants to merge 28 commits intohuggingface:mainfrom
leloykun:fc--uniformize-processor-kwargs-audio-text

leloykun commented Aug 21, 2024 •

edited

Loading

Uh oh!

leloykun commented Aug 21, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

leloykun commented Aug 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Who can review?

Uh oh!

leloykun commented Aug 21, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

leloykun commented Aug 21, 2024 •

edited

Loading