Skip to content

Update tiny model creation script#45241

Merged
ydshieh merged 10 commits intomainfrom
fix_tiny_model_script
Apr 4, 2026
Merged

Update tiny model creation script#45241
ydshieh merged 10 commits intomainfrom
fix_tiny_model_script

Conversation

@ydshieh
Copy link
Copy Markdown
Collaborator

@ydshieh ydshieh commented Apr 4, 2026

What does this PR do?

After the series of fixes in other previous PRs, we can now update the tiny model creation script. This update makes the script running without any failure, just 10 warnings.

There are many # TODO, some of them may just be quick remarks only. I decide to push and merge without removing them, so we still have the context to further improve the script to be more robust and clean.

The workflow file is also changed, so it could run on a daily basis, for us to check if there is any issue with the more future PRs merged into main. It doesn't upload the tiny models to the hub at this moment, which is a task for me to work on in a separate PR.

ydshieh and others added 9 commits April 4, 2026 14:25
- Remove leftover debug artifacts: commented-out `from transformers import *`,
  all `# breakpoint()` calls
- Move inline imports to top-level: `importlib`, `re`, `TokenizersBackend`,
  `VibeVoiceAcousticTokenizerFeatureExtractor`, `FastSpeech2ConformerTokenizer`
- Fix 3-space indent in `_to_tuple`
- Remove no-op in `fill_result_with_error`
- Add `CHECKPOINT_REVISIONS` and `CHECKPOINT_SUBFOLDERS` module-level dicts to
  replace repetitive if/elif chains in `build_processor`
- Extract `_build_model_tester_and_get_config` helper to eliminate duplicated
  tester instantiation + config-retrieval logic in `get_tiny_config`

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@ydshieh ydshieh merged commit eb981ae into main Apr 4, 2026
20 checks passed
@ydshieh ydshieh deleted the fix_tiny_model_script branch April 4, 2026 17:19
louzongzhi pushed a commit to louzongzhi/transformers that referenced this pull request Apr 6, 2026
* clean

* cleanup 1

* Minor cleanups to create_dummy_models.py for readability

- Remove leftover debug artifacts: commented-out `from transformers import *`,
  all `# breakpoint()` calls
- Move inline imports to top-level: `importlib`, `re`, `TokenizersBackend`,
  `VibeVoiceAcousticTokenizerFeatureExtractor`, `FastSpeech2ConformerTokenizer`
- Fix 3-space indent in `_to_tuple`
- Remove no-op in `fill_result_with_error`
- Add `CHECKPOINT_REVISIONS` and `CHECKPOINT_SUBFOLDERS` module-level dicts to
  replace repetitive if/elif chains in `build_processor`
- Extract `_build_model_tester_and_get_config` helper to eliminate duplicated
  tester instantiation + config-retrieval logic in `get_tiny_config`

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* remove other bad blocks

* remove comment

* fix wrong _models

* revert

* fix

* fix

* update workflow (to be continued)

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
louzongzhi pushed a commit to louzongzhi/transformers that referenced this pull request Apr 6, 2026
* clean

* cleanup 1

* Minor cleanups to create_dummy_models.py for readability

- Remove leftover debug artifacts: commented-out `from transformers import *`,
  all `# breakpoint()` calls
- Move inline imports to top-level: `importlib`, `re`, `TokenizersBackend`,
  `VibeVoiceAcousticTokenizerFeatureExtractor`, `FastSpeech2ConformerTokenizer`
- Fix 3-space indent in `_to_tuple`
- Remove no-op in `fill_result_with_error`
- Add `CHECKPOINT_REVISIONS` and `CHECKPOINT_SUBFOLDERS` module-level dicts to
  replace repetitive if/elif chains in `build_processor`
- Extract `_build_model_tester_and_get_config` helper to eliminate duplicated
  tester instantiation + config-retrieval logic in `get_tiny_config`

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* remove other bad blocks

* remove comment

* fix wrong _models

* revert

* fix

* fix

* update workflow (to be continued)

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
sirzechs66 pushed a commit to sirzechs66/transformers that referenced this pull request Apr 18, 2026
* clean

* cleanup 1

* Minor cleanups to create_dummy_models.py for readability

- Remove leftover debug artifacts: commented-out `from transformers import *`,
  all `# breakpoint()` calls
- Move inline imports to top-level: `importlib`, `re`, `TokenizersBackend`,
  `VibeVoiceAcousticTokenizerFeatureExtractor`, `FastSpeech2ConformerTokenizer`
- Fix 3-space indent in `_to_tuple`
- Remove no-op in `fill_result_with_error`
- Add `CHECKPOINT_REVISIONS` and `CHECKPOINT_SUBFOLDERS` module-level dicts to
  replace repetitive if/elif chains in `build_processor`
- Extract `_build_model_tester_and_get_config` helper to eliminate duplicated
  tester instantiation + config-retrieval logic in `get_tiny_config`

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

* remove other bad blocks

* remove comment

* fix wrong _models

* revert

* fix

* fix

* update workflow (to be continued)

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants