Rename Speech Dataset Processor to Speech Data Processor#5378
Merged
erastorgueva-nv merged 2 commits intor1.13.0from Nov 9, 2022
Merged
Rename Speech Dataset Processor to Speech Data Processor#5378erastorgueva-nv merged 2 commits intor1.13.0from
erastorgueva-nv merged 2 commits intor1.13.0from
Conversation
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Kipok
approved these changes
Nov 9, 2022
github-actions bot
pushed a commit
that referenced
this pull request
Nov 9, 2022
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
erastorgueva-nv
added a commit
that referenced
this pull request
Nov 10, 2022
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
1-800-BAD-CODE
pushed a commit
to 1-800-BAD-CODE/NeMo
that referenced
this pull request
Nov 13, 2022
…#5378) (NVIDIA-NeMo#5381) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: 1-800-bad-code <shane.carroll@utsa.edu>
Davood-M
added a commit
that referenced
this pull request
Nov 15, 2022
* remove stage wrapper from parallel
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
* fixed the onnx bug in conformer for non-streaming models. (#5242)
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
* [Tools][ASR] Tool for generating data using simulated RIRs (#5158)
[Tools][ASR] Tool for generating data using simulated RIRs
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Add fully torch.jit.script-able speaker clustering module (#5191)
* Add files for commit
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Added parallelism on p-value search
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Changed speaker clustering to accept torch.tensor
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Cleaned up the code and tested to have identical output
Signed-off-by: Taejin Park <tango4j@gmail.com>
* update on Notebook demo
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Added eigvalsh for faster eig val calculation:
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Remove NMESC_JitScriptedModule.ipynb
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Cleaned code and style fix
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Modified MSDD framework to fit torch-scripted clustering
Signed-off-by: Taejin Park <tango4j@gmail.com>
* LGTM fix
Signed-off-by: Taejin Park <tango4j@gmail.com>
* removed all string based timestamps
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Removed unnecessary lines
Signed-off-by: Taejin Park <tango4j@gmail.com>
* removed redundant lines
Signed-off-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update perturb.py (#5231)
* Update perturb.py
Add checking for channels mismatch for audio and noise data, throw an exception if they have different number of channels. Also fixed `perturb_with_foreground_noise` as done in `perturb_with_input_noise`
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update check and teest
Signed-off-by: stevehuang52 <heh@nvidia.com>
* fix test
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* remove CV requirements. (#5233)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Fix link to inference notebook (#5247) (#5251)
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Co-authored-by: Jocelyn <jocelynh@nvidia.com>
* checks for accepted adapter type at module level (#5194)
* add accepted adapter functionality into transformer, mlp and attention
Signed-off-by: arendu <adithya.r@gmail.com>
* fix to t5 adapter and ia3 evals due to predict_step dictionary key changes
Signed-off-by: arendu <adithya.r@gmail.com>
* use mixin logic for adapters in ParallelAttention and ParallelMLP classes
Signed-off-by: arendu <adithya.r@gmail.com>
* typo fix
Signed-off-by: arendu <adithya.r@gmail.com>
* updates
Signed-off-by: arendu <adithya.r@gmail.com>
* moved adapter tools
Signed-off-by: arendu <adithya.r@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix error with t5 adapter
Signed-off-by: arendu <adithya.r@gmail.com>
* updates'
Signed-off-by: arendu <adithya.r@gmail.com>
* replace ColumnParallelLinear with nn.Linear in export_utils
Signed-off-by: arendu <adithya.r@gmail.com>
* remove ColumnLinear
Signed-off-by: arendu <adithya.r@gmail.com>
* typo fix
Signed-off-by: arendu <adithya.r@gmail.com>
* update to check config targets
Signed-off-by: arendu <adithya.r@gmail.com>
* updates
Signed-off-by: arendu <adithya.r@gmail.com>
* refactor so that mixin is adapter name agnostic
Signed-off-by: arendu <adithya.r@gmail.com>
* fix merge conflict
Signed-off-by: arendu <adithya.r@gmail.com>
* minor
Signed-off-by: arendu <adithya.r@gmail.com>
* minor
Signed-off-by: arendu <adithya.r@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* using class comparison instead of string match
Signed-off-by: arendu <adithya.r@gmail.com>
* fix test fail
Signed-off-by: arendu <adithya.r@gmail.com>
* fixed checks for add_adapter
Signed-off-by: arendu <adithya.r@gmail.com>
* fixed checks for add_adapter
Signed-off-by: arendu <adithya.r@gmail.com>
Signed-off-by: arendu <adithya.r@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
* fix groovy syntax
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
* fix hypotheses return (#5253)
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* Update ASR scores table (#5254) (#5255)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
* Support for inserting additional subsampling in conformer encoder (#5224)
* Change the default position of the reduction position to null and rename subsampling reduction to striding
Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
* Put the caching logic outside the conformer encoder
Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Add description of the reduction parameters in the configs
Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update test_asr_exportables with correct reduction position value
Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* asr and nmt tests in parallel
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
* add more users who can trigger blossom-ci
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
* path fix
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
* fix paths, remove redundant test
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
* add symlink
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
* Modernize RNNT ONNX export and add TS export (#5248)
* Upgrade rnnt export for CUDA/CPU/TRT
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update runtime script for onnx exported model to modern API
Signed-off-by: smajumdar <titu1994@gmail.com>
* Finalize code
Signed-off-by: smajumdar <titu1994@gmail.com>
* Remove comments
Signed-off-by: smajumdar <titu1994@gmail.com>
* Remove redundant stuff from tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update test
Signed-off-by: smajumdar <titu1994@gmail.com>
* Remove onnx rnnt export test due to lack of onnxruntime install
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
* update tutorials to use meeting config as default and VAD (#5237)
* update tutorials to use meeting config as default and VAD
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* update model path
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* Fix links to speaker identification notebook (#5260) (#5261)
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
* add shm-size
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
* [TTS] Fastpitch energy condition and refactoring (#5218)
* Incorporating Energy conditioning in FastPitch
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
* Minor fixes in Energy conditioning in FastPitch
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
* Add Energy conditioning in FastPitch to infer method
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
* adding fn to function names
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
* Incorporating Energy conditioning in FastPitch
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
* Minor fixes in Energy conditioning in FastPitch
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
* Add Energy conditioning in FastPitch to infer method
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
* adding fn to function names
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* remove ifelse from batching, minor refactoring changes in energy code
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Refactor based on PR comments.
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
* Added support for not learning alignment in energy
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
* Fix typo in assert statemetn
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
* Renaming average_pitch to average_features
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
* Renaming len variable name as it is a keyword
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
* Renaming len variable name as it is a keyword
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* [TTS] HiFi-TTS Download Script (#5241)
* Hifi tts download script
Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu>
* comment and remove imports
Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu>
Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Specifying audio signal dropout separately for the Conformer Encoder (#5263)
* Fixed bug in transcribe_speech.py where decoding strategy was not being updated.
Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Add option to specify audio dropout separately for conformer encoders
Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
* Add audio dropout option to test_asr_exportables
Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
* Rename dropout_audio to dropout_pre_encode
Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
* Update the comments in squeezeformer configs referring to conformer modules
Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* created (#5268)
* created
* bug
Signed-off-by: Dima Rekesh <drekesh@nvidia.com>
Signed-off-by: Dima Rekesh <drekesh@nvidia.com>
Co-authored-by: Dima Rekesh <drekesh@nvidia.com>
* [TTS] Add Mandarin/English Bilingual Recipe for Training Fastpitch Models (#5208)
* Add Chinese TTS tokenizer and G2P.
* Add data process script.
* Add tutorial.
Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>
* Minor typo fixes in TTS tutorial (#5266) (#5272)
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Co-authored-by: Jocelyn <jocelynh@nvidia.com>
* Fix failing speaker counting for short audio samples (#5267)
* Add files for commit
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Added parallelism on p-value search
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Changed speaker clustering to accept torch.tensor
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Cleaned up the code and tested to have identical output
Signed-off-by: Taejin Park <tango4j@gmail.com>
* update on Notebook demo
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Added eigvalsh for faster eig val calculation:
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Remove NMESC_JitScriptedModule.ipynb
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Cleaned code and style fix
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Modified MSDD framework to fit torch-scripted clustering
Signed-off-by: Taejin Park <tango4j@gmail.com>
* LGTM fix
Signed-off-by: Taejin Park <tango4j@gmail.com>
* removed all string based timestamps
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Removed unnecessary lines
Signed-off-by: Taejin Park <tango4j@gmail.com>
* removed redundant lines
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Add enhanced speaker count back
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fixed minor docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* removed import Counter
Signed-off-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Pcla tutorial fixes (#5271) (#5273)
* Fixed typos
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
* Fixed cell type and tatoeba reference
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
* Fixed typo
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
* Fixed branch variable
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Co-authored-by: Matvei Novikov <mattyson.so@gmail.com>
* Add Gradio App to ASR Docs (#5270)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
* Fix bug into Dialogue tutorial (#5277) (#5280)
Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com>
* [TTS] fixed type of filepath and rename openslr. (#5276)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* O2bert + apex pipeline functions (#5221)
* Global batch size support for validation
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Global batch size support for bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* bert batch support
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* bert batch size support
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* O2 support for bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update megatron_bert_pretraining.py
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Update megatron_bert_model.py
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update megatron_bert_config.yaml
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update megatron_bert_model.py
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Bug fix
* Bug fix
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Bug fix
* Bug fix
* Bug fix
* Update megatron_bert_config.yaml
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Addressed Sandeeps comments
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Update Jenkinsfile
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Update Jenkinsfile
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Update megatron_bert_model.py
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Update Jenkinsfile
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Typo fix (#5288) (#5291)
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Co-authored-by: Matvei Novikov <mattyson.so@gmail.com>
* Upperbound PTL (#5302)
* Upperbound PTL
Signed-off-by: smajumdar <titu1994@gmail.com>
* Upperbound PTL
Signed-off-by: smajumdar <titu1994@gmail.com>
* Upperbound PTL
Signed-off-by: smajumdar <titu1994@gmail.com>
* Upperbound PTL
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add support for Sampled Softmax for RNNT Joint (#5216)
* Initial prototype of SampldRNNTJoint
Signed-off-by: smajumdar <titu1994@gmail.com>
* Implement randperm based noise selection algo
Signed-off-by: smajumdar <titu1994@gmail.com>
* First working prototype of sampled rnnt !
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add note for why we need this remap before i forget
Signed-off-by: smajumdar <titu1994@gmail.com>
* Finalize version that works with sampling
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update docs for rnnt decoder and joint
Signed-off-by: smajumdar <titu1994@gmail.com>
* Remove the adjustment_val for softmax
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update config and docs for Sampled Softmax
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update Interface(s) phonetic entry (#5212)
* change interface(s) phone
Signed-off-by: Jason <jasoli@nvidia.com>
* push version
Signed-off-by: Jason <jasoli@nvidia.com>
* update dict path
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
* [TTS] remove obsolete torch_tts unit test marker and replace with run_only_on('CPU') (#5307)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Fixes for Conformer-xl export (#5309)
* Fixing runtime check for ONNX > 2G
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Fixing ONNX export
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* add label inference support to EncDecSpeakerLabel class (#5278)
* add label inference support to EncDecSpeakerLabel class
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* add necessary tests
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* reflect on comments
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* grammatical correction
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* minor doc string changes
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
* [TTS] bugfix IPAG2P and refactor to remove duplicate process. (#5304)
* [TTS] bugfix IPAG2P and refactor to remove duplicate process.
* added type hints and rename func.
* unify str and list(str) as list(str).
* revise logging message when phoneme_dict_obj is empty
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Update path to get_data.py in TTS tutorial (#5311)
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
* Add italian model checkpoints (#5315)
Signed-off-by: Igor Gitman <igitman@nvidia.com>
Signed-off-by: Igor Gitman <igitman@nvidia.com>
* Text Memmap Parsing Improvements (#5265)
* 1. Fixed text-memmap issue when boundary (new-line) is missing from end of file).
Signed-off-by: Micha Livne <mlivne@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* 1. Fixed style.
Signed-off-by: Micha Livne <mlivne@nvidia.com>
* 1. Added support in paratial sample loading and alternative decoding.
Signed-off-by: Micha Livne <mlivne@nvidia.com>
* 1. Fixed syntax issues.
Signed-off-by: Micha Livne <mlivne@nvidia.com>
* 1. Minor change.
Signed-off-by: Micha Livne <mlivne@nvidia.com>
* 1. Extended flexibility of mapping indices.
Signed-off-by: Micha Livne <mlivne@nvidia.com>
* 1. Added validation ofdtype of indexing function.
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Co-authored-by: Micha Livne <mlivne@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
* [TTS] Replace IPA lambda arguments with locale string (#5298)
* [TTS] Replace IPA lambda arguments with locale string
* [TTS] Add locale validation
* Fixed typos
* Return punctuation as sorted list
Signed-off-by: Ryan <rlangman@nvidia.com>
* Remove onnx graphsurgery from Dockerfile (#5320)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update refspec (#5321)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
* Force wav file format for audio_filepath (#5323)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
* Updates to T0 Dataset and Model (#5201)
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Update config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Restore function needed for NMT
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Change output file format from JSON to JSONL
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Add T0 data preproc scripts
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Merge and multiprocessing
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix for is_correct
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Refactor T0 dataset
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Add script to merge train folder into individual training files to minimize number of blends
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Minor changes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Remove bin compat
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix header lines
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* [DOC] add sphinx-copybutton requirement to copy button on code snippets. (#5326)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* [TTS] expand to support flexible dictionary entry formats in IPAG2P. (#5318)
* expand to support flexible dictionary entry formats in IPAG2P.
* removed unused imports in test.collections.tts
* removed unused imports in nemo.collections.tts.modules
* removed unused imports in nemo_text_processing.text_normalization.zh
* updated unit tests with new cases
* renamed test function names because we only test IPAG2P rather than all classes in the modules.py.
* revise current test dict with a single space between word and pronunications.
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* small bugfix for r1.13.0 (#5310) (#5325)
* typo fix
* udpate transcribe
Signed-off-by: fayejf <fayejf07@gmail.com>
Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com>
* Option to pad the last validation input sequence if its smaller than the encoder sequence length for MegatronGPT (#5243)
* Option to pad the last input sequence of validation dataset if its smaller than the encoder sequence length for MegatronGPT
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added default value for drop last argument
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Anmol Gupta <anmolg@nvidia.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
* Add support for Hydra multirun to NeMo (#5159)
* Update execution doc and remove old snippet
Signed-off-by: smajumdar <titu1994@gmail.com>
* Fix types
Signed-off-by: smajumdar <titu1994@gmail.com>
* Fix defaults
Signed-off-by: smajumdar <titu1994@gmail.com>
* Fix types for ParallelAdapterConfig
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add hash for config cache
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Add support to delete redundant ckpt files for HP search
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct config for IA3
Signed-off-by: smajumdar <titu1994@gmail.com>
* Fix check to <= 0
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* typo fix (#5328)
* Speed up HF data processing script for ASR (#5330)
* Correct hydra issue with relative filepaths
Signed-off-by: smajumdar <titu1994@gmail.com>
* Improve speed of dataset processing
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
* add precommit hood to automatic sort entries in requirements. (#5333)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* [TTS] update organization of model checkpoints and their pointers. (#5327)
* [TTS] update orgnization of model checkpoints and their pointers.
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* move model name column to the 2nd col and correct model names as
predefined_model_name.
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Add speaker clustering arguments to forward function (#5306)
* Move arguments to forward function
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Resolved type issue
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (#5340) (#5341)
* [STT] Add stt_ru_conformer_ctc_large
Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com>
* [STT] Add stt_ru_conformer_transducer_large
Add stt_ru_conformer_transducer_large
Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com>
Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* [TTS] bugfix for the script of generating mels. (#5344)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Fixing de-autocast (#5319)
* Fixing de-autocast
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Cleanup
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Refining export with max_dim/batch
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Moving cast utils to its own module
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Pcla tutorial fixes (#5313) (#5347)
* fixes
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
* fixes
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
* moved `create_text_and_labels` to token_classification_utils.py
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Co-authored-by: Matvei Novikov <mattyson.so@gmail.com>
* bug (#5348)
Co-authored-by: Dima Rekesh <drekesh@nvidia.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
* [Bugfix] Added rm -f / wget- nc command to avoid bash error in multispeaker sim notebook (#5292)
* Added rm -f command to avoid error message
Signed-off-by: Taejin Park <tango4j@gmail.com>
* removed unnecessary changes
Signed-off-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [DOC] added ipython dependency to support IPython.sphinxext extension (#5345)
* [DOC] added ipython dependency to support IPython.sphinxext extension
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* revert ipython extension in the doc and replace ipython block with
shell-session.
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Fix dialogue tutorial bug (#5297) (#5303)
* set add_pooling_layer=False for huggingface bert model
* remove add_pooling_layer=False and set find_unused_parameters=True
* set num_prompt_tokens to 0 for huggingface
Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
* Fix issue with HF Model upload tutorial (#5359) (#5360)
* Add Gradio App to ASR Docs (#5270)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
(cherry picked from commit e4b6a387e3b3d9cdf511f7b9bbb5e94925e48cc2)
* Fix issue with normalized config for dataset name
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
* Bug fix (removing old compute consumed samples) (#5355)
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* removed uninstall nemo_cv and nemo_simple_gan and relax numba version… (#5332)
* Update reinstall.sh and requirements.
* removed nemo_cv and nemo_simple_gan in reinstall.sh.
* relaxed numba version limits.
* added tensorboard requirement to avoid any incpmpatible issue.
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* revert changes for numba
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Pipeline paralleism in Bert (#5293)
* Global batch size support for validation
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Global batch size support for bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* bert batch support
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* bert batch size support
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* O2 support for bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update megatron_bert_pretraining.py
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Update megatron_bert_model.py
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update megatron_bert_config.yaml
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update megatron_bert_model.py
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Bug fix
* Bug fix
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Bug fix
* Bug fix
* Bug fix
* Update megatron_bert_config.yaml
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* PPBert
* PPBert
* PPBert
* PPBert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update megatron_bert_config.yaml
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* bug fix
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* bug fix
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* bug fix
* bug fix
* bug fix
* bug fix
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
* tutorial fixes (#5354) (#5361)
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Co-authored-by: Matvei Novikov <mattyson.so@gmail.com>
* Enable mlflow logger (#4893)
* Enable mlflow logger
Signed-off-by: whrichd <trabeitwrq@gmail.com>
* fix style
Signed-off-by: whrichd <trabeitwrq@gmail.com>
* Add doc lines.
Signed-off-by: whrichd <trabeitwrq@gmail.com>
* change default value
Signed-off-by: whrichd <trabeitwrq@gmail.com>
* fix doc
Signed-off-by: whrichd <trabeitwrq@gmail.com>
* addressed comments, added dataclass
Signed-off-by: whrichd <trabeitwrq@gmail.com>
* fix style
Signed-off-by: whrichd <trabeitwrq@gmail.com>
* fix doc
Signed-off-by: whrichd <trabeitwrq@gmail.com>
Signed-off-by: whrichd <trabeitwrq@gmail.com>
* Add SDP documentation (#5274) (#5376)
* Add details to SDP README.md
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add docstring to WriteManifest processor
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add docstring to CreateInitialManifestMLS
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add ModifyManifestTextProcessor docstring
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add ASRInference docstring
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add base_processor docstrings
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add minimal SDP docs page
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update tools/speech_dataset_processor/README.md
Co-authored-by: Igor Gitman <igitman@nvidia.com>
Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
* Write simple README for SDP and move complex explanations to docs
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove incorrect type hints
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make config example less confusing
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix typo
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Clarify that YAML file is config file in README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove unused imports
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove SDP docs for now
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove links to docs in SDP README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Co-authored-by: Igor Gitman <igitman@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Co-authored-by: Igor Gitman <igitman@nvidia.com>
* Rename Speech Dataset Processor to Speech Data Processor (#5378) (#5381)
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
* fix for num worker 0 causing issues in losses after 1 epoch (#5379) (#5384)
Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com>
* [TTS] Add Spanish model documentation (#5390)
Signed-off-by: Ryan <rlangman@nvidia.com>
* [TTS] Add Spanish FastPitch training configs (#5383)
* [TTS] Add Spanish FastPitch training configs
* [TTS] Add single speaker Spanish configs
Signed-off-by: Ryan <rlangman@nvidia.com>
* Fix Python type hints according to Python Docs (#5370)
* Remove duplicated type annotations
Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
* Fix tuple annotations in function return types
Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
* Add necessary imports
Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
* Add necessary imports
Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
* Fix types in obvious places
Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
* Fix types in obvious places
Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
* Fix unused import (avoid quotes in type annotations)
Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
* Revert "Fix unused import (avoid quotes in type annotations)"
This reverts commit ea433efcd9916abf8944879e791484a0a1437f83.
Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
* Remove problematic import
Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
* Fix list_available_models method type
Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
* Revert some changes
Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
* Revert quotes in list_available_models
Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
* Force MHA QKV onto fp32 (#5391) (#5395)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
* Add cpWER for evaluation of ASR with diarization (#5279)
* Add cpWER calculation feature
Signed-off-by: Taejin Park <tango4j@gmail.com>
* added notebook
Signed-off-by: Taejin Park <tango4j@gmail.com>
* updated notebook and diarization_utils
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Minor update on tutorial notebook
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Style fix
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update on missing docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fixed an unfinished docstring
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Removed unused variables
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fixed dict input to list input
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Style fix
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed LGTM issues
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fixed error in cpWER cal
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fix some of the typing issues, lower case names
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Replaced bruteforce with LSA alg for cpWER
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Reflected PR comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Cleaned notebook
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* updated notebook
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fixed LGTM warnings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added test_diar_metrics.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fixed typos
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fixed wrong type annotations
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Added bruteforce mode and its unit-test
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* LGTM issues fixed
Signed-off-by: Taejin Park <tango4j@gmail.com>
* reolve LGTM issues
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* unified speaker key in trans_dict
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Removed unused variable and imports
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/diarization_utils.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/diarization_utils.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
* moved all the diarization eval to der.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update tests/collections/asr/test_diar_metrics.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
* der.py update on tests
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* unused imports and style fix
Signed-off-by: Taejin Park <tango4j@gmail.com>
* style fix
Signed-off-by: Taejin Park <tango4j@gmail.com>
* unused import
Signed-off-by: Taejin Park <tango4j@gmail.com>
* reflected review comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fixed an import bug in tutorial notebook
Signed-off-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: SeanNaren <snarenthiran@nvidia.com>
Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
* Added cast
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Signed-off-by: Vahid <vnoroozi@nvidia.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: arendu <adithya.r@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: Shantanu Acharya <shantanua@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: subhankar-ghosh <subhankar2321@gmail.com>
Signed-off-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu>
Signed-off-by: Dima Rekesh <drekesh@nvidia.com>
Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>
Signed-off-by: Matvei Novikov <mattyson.so@gmail.com>
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Signed-off-by: Jason <jasoli@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Igor Gitman <igitman@nvidia.com>
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Ryan <rlangman@nvidia.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: fayejf <fayejf07@gmail.com>
Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com>
Signed-off-by: whrichd <trabeitwrq@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
Signed-off-by: David <amosalla@asu.edu>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com>
Co-authored-by: anteju <108555623+anteju@users.noreply.github.com>
Co-authored-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: Jocelyn <jocelynh@nvidia.com>
Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Co-authored-by: Shantanu Acharya <shantanua@nvidia.com>
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Co-authored-by: Subhankar Ghosh <subhankar2321@gmail.com>
Co-authored-by: Oleksii Volkovskyi <volkovskyi@berkeley.edu>
Co-authored-by: Dima Rekesh <bmwshop@gmail.com>
Co-authored-by: Dima Rekesh <drekesh@nvidia.com>
Co-authored-by: Yuekai Zhang <zhangyuekai@foxmail.com>
Co-authored-by: Matvei Novikov <mattyson.so@gmail.com>
Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com>
Co-authored-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Co-authored-by: Jason <jasoli@nvidia.com>
Co-authored-by: Igor Gitman <igitman@nvidia.com>
Co-authored-by: Micha Livne <michalivne@users.noreply.github.com>
Co-authored-by: Micha Livne <mlivne@nvidia.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Ryan Langman <rlangman@nvidia.com>
Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com>
Co-authored-by: anmolgupt <14880251+anmolgupt@users.noreply.github.com>
Co-authored-by: Anmol Gupta <anmolg@nvidia.com>
Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Co-authored-by: Riqiang Wang <43883260+whrichd@users.noreply.github.com>
Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Co-authored-by: Vladimir Bataev <vbataev@nvidia.com>
Co-authored-by: David <amosalla@asu.edu>
hainan-xv
pushed a commit
to hainan-xv/NeMo
that referenced
this pull request
Nov 29, 2022
…#5378) (NVIDIA-NeMo#5381) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com>
hainan-xv
pushed a commit
to hainan-xv/NeMo
that referenced
this pull request
Nov 29, 2022
…#5378) (NVIDIA-NeMo#5381) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com>
ericharper
pushed a commit
that referenced
this pull request
Dec 7, 2022
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
ericharper
added a commit
that referenced
this pull request
Dec 8, 2022
* update branch Signed-off-by: ericharper <complex451@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * update readme Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
erastorgueva-nv
added a commit
that referenced
this pull request
Dec 9, 2022
* update branch Signed-off-by: ericharper <complex451@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * update readme Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
JimmyZhang12
pushed a commit
to JimmyZhang12/NeMo
that referenced
this pull request
Dec 14, 2022
…#5378) (NVIDIA-NeMo#5381) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
andrusenkoau
pushed a commit
to andrusenkoau/NeMo
that referenced
this pull request
Jan 5, 2023
…#5378) (NVIDIA-NeMo#5381) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>
andrusenkoau
pushed a commit
to andrusenkoau/NeMo
that referenced
this pull request
Jan 5, 2023
* update branch Signed-off-by: ericharper <complex451@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA-NeMo#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Megatron Export Update (NVIDIA-NeMo#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * Disable sync_batch_comm in validation_step for GPT (NVIDIA-NeMo#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Radtts 1.13 (NVIDIA-NeMo#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA-NeMo#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA-NeMo#5339) (NVIDIA-NeMo#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA-NeMo#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA-NeMo#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * added set_start_method + function param bugfix (NVIDIA-NeMo#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (NVIDIA-NeMo#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * update readme Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>
erastorgueva-nv
added a commit
that referenced
this pull request
Jan 12, 2023
* update branch Signed-off-by: ericharper <complex451@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * update readme Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
titu1994
added a commit
that referenced
this pull request
Jan 17, 2023
* Merge r1.13.0 main (#5570)
* update branch
Signed-off-by: ericharper <complex451@gmail.com>
* Rename Speech Dataset Processor to Speech Data Processor (#5378)
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Megatron Export Update (#5343)
* export update for Megatron + change ORT optimization
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* updated export_utils to use autocast instead of manually casting >:/
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* removed dtype from LayerNorm
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* added comment
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* reverting changes on FloatCast
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* Cherry-picked changes from megatron-norm
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* updated asr_model import to cast_utils
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* updated del onnx_model place
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* changed ort optimization to basic -> temp fix
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Disable sync_batch_comm in validation_step for GPT (#5397)
* disable sync_batch_comm in validation_step
Signed-off-by: ericharper <complex451@gmail.com>
* Read sync_batch_comm from config or default to False
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
* Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
* Empty
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Comment out test
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
* Radtts 1.13 (#5451)
* [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358)
* [TTS] add CI test for RADTTS training recipe.
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
* Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478)
* Initial refactor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Resolve config before passing to load_from_checkpoint
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes for model parallel and nemo restore
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes for eval
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Revert config changes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Refactor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix typo
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Remove comments
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Minor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix validation reconfiguration
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Remove old comment
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes for test_ds
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* export_utils bugfix (#5480)
* updated export_utils
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Export fixes for Riva (#5496)
* Export fixes for Riva
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Cleaning up training_utils
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* added set_start_method + function param bugfix (#5539)
* added set_start_method + function param bugfix
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* upper bound torchmetrics
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: ericharper <complex451@gmail.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ericharper <complex451@gmail.com>
* remove notebook (#5548)
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
* update readme
Signed-off-by: ericharper <complex451@gmail.com>
* update branch
Signed-off-by: ericharper <complex451@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Co-authored-by: David <amosalla@asu.edu>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Optimized loop and bugfix in SDE (#5573)
- Fixed bug with loading custom data attributes from JSON in Speech Data Explorer
Signed-off-by: George Zelenfroynd <gzelenfroind@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update torchmetrics (#5566)
* add task arg
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* update state
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Co-authored-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove useless files. (#5580)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add initial NFA code
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make use of the specified device during viterbi decoding
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix CodeQL notes
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix CodeQL warning
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add an option to defer data setup from ``__init__`` to ``setup`` (#5569)
* Add an option to defer dataloader setup from __init__ to setup
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Updated doc
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make utt_id specified by number of parts of audio_filepath user wishes to use
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove audio_sr TODO - reduce risk of silent bugs
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add check that model is CTC
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove unused import
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Text generation improvement (UI client, data parallel support) (#5437)
* Squashed commit of the following:
commit a5e124f34be31bd6eafe5e5fdf5bedcd0d50915c
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Oct 13 15:07:42 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 35b424044fe80c3081e7756ab21244f701716f7e
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 08:04:49 2022 -0700
get rid of base
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 2955210e2311791543538cfbb5ad26b79414c954
Merge: d52edef8c eaf6757ca
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:17:02 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit d52edef8cd7b36593838fb270047e80f8ccb652e
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:16:24 2022 +0000
align with main
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit eaf6757ca5be8e099492f57c81d984429b0ad49c
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Oct 13 13:12:11 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit c4b86d97626ea0721bf8fb4c0a45dec5becc94c9
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:10:58 2022 +0000
same as main
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit e335de51bcc0d681c58b568c3d8c238bc5687c3b
Merge: c231086e0 4463a9fe9
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:08:09 2022 +0000
Merge branch 'main' into universal_prompt
commit c231086e057f1efaa915f691d84664cb3d5aad85
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Wed Oct 12 19:59:12 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 6a821a4b49a23dd3408a706a2a3dd393149b0bb1
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:56:17 2022 +0000
default to pad
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9d908e39fef1beed9ba2da4d1a6806161eb7ef25
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:55:44 2022 +0000
add the option to pad the tokens
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 876dc395b43fdeeaa2bcbbe13c76523633764c33
Merge: fbb0f4035 fe3c77ee9
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:20:47 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit fe3c77ee93ab6cf3ea152db68cb6beefcac2a392
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 18:59:49 2022 +0000
fix import again
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit fbb0f4035c6cd6bfefed50a20605503de8c1dccb
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Wed Oct 12 16:00:24 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 372ca8c0d7988f2339b15888dc72aa21f4fb6937
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 15:58:32 2022 +0000
enable server
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit cbe05d9fbc978f812cfbb671f45f147f300713c4
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 13:07:28 2022 +0000
fix comment error
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 1948048922e726ec6131e44b1a745389f18d4ef2
Merge: 232c2cce3 984f5c09a
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 13:05:30 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit 232c2cce34d7a8b902da406706f3dd9b39475091
Merge: 34c8a68df 658243fb6
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 12:50:00 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit 984f5c09a6dbf1d1fb5aa30ed9b0df188e66a50f
Merge: 658243fb6 3fda5de46
Author: Yi Dong <43824965+yidong72@users.noreply.github.com>
Date: Wed Oct 12 08:42:11 2022 -0400
Merge branch 'main' into fix_global_init
commit 658243fb6580191b5d60edd30cde16dcc23cbb85
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 12 12:40:57 2022 +0000
fix import error
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 8e0fe1cad05ec288ec122b3cd0e139a96872e08c
Author: Yi Dong <doyend@gmail.com>
Date: Tue Oct 11 22:44:12 2022 +0000
update the fused kernel
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 536cf6bef9447b75843fad630729c47a2fba35f3
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:44:52 2022 -0700
add the missing file
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 1b437ec41dc5e354453ce0a089bca0171cbcb6c2
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:43:14 2022 -0700
fix fused softmax
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 7813f60e05f9783af61f8c14ec1cb0c6c4f1f263
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:16:48 2022 -0700
move global step to base
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 34c8a68df084b18d377e84415d9f07b2cd6673dd
Author: Yi Dong <doyend@gmail.com>
Date: Thu Oct 6 13:50:11 2022 +0000
fix pipeline for eval
Signed-off-by: Yi Dong <doyend@gmail.com>
commit eee5d38218f26660c3ffebe9f615c850c80a1f0d
Author: Yi Dong <doyend@gmail.com>
Date: Thu Oct 6 13:48:22 2022 +0000
fix for pipleline parallel
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 323bca73e7ef6099ee79c0a2fffac7b709ed6c5d
Merge: 125e49947 e3b4c4d1f
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 19:29:13 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit 125e4994760448ff75dd9328395813eda1c87547
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 19:29:04 2022 +0000
add share option
Signed-off-by: Yi Dong <doyend@gmail.com>
commit e3b4c4d1f7346c9fa596f3cca6d4df0a9e05c368
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 5 11:43:48 2022 -0700
make sure consolidation works
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit a5c833964ecf05dc460ca1da69275c4019742150
Merge: 2a07ab52d abcb74be2
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:40:29 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit 2a07ab52d95f15ba666823028c69e23825666c05
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:40:23 2022 +0000
added requirement
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 3abecd9dd1611993a87c537636abe7f7e6a9b04c
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:39:42 2022 +0000
added a simple web server
Signed-off-by: Yi Dong <doyend@gmail.com>
commit abcb74be2caf1cdec40eb9ba2be4dde4d45a3b4b
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 5 06:54:12 2022 -0700
fix empty val loss
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit b8eb92ac4a0d665570af75e34c9ba3c2e2420c26
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 19:25:30 2022 -0700
text gen working
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit d59f3e3f3a6fd19736d1c5706fed65a3dd4049ba
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 16:08:40 2022 -0700
first change
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 59d077585e6962a669b824af58f64e8a0bea6547
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 15:00:40 2022 -0700
revert
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 12a0f3902d99e9179403644bd951c045df716ca7
Author: Yi Dong <doyend@gmail.com>
Date: Tue Oct 4 21:26:23 2022 +0000
init imp
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 62a15dfd943cc48be495ac61b9f2f00995775c5f
Merge: 82c90d2cd e0cc6b767
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 11:58:26 2022 -0700
Merge branch 'main' into universal_prompt
commit 82c90d2cd0fd156f16a4b899f8c741d598f33990
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 11:17:13 2022 -0700
add sync
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9819b703eef877d90cd1257bf3610c69de9b4d7e
Author: Yi Dong <yidong@nvidia.com>
Date: Sun Oct 2 17:52:34 2022 -0700
fix save model
Signed-off-by: root <root@luna-0197.selene.nvidia.com>
commit e4937e2fc5fb7d70754c97668416e4a69c3079fe
Author: Yi Dong <yidong@nvidia.com>
Date: Sat Oct 1 18:56:09 2022 +0000
working
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit b73b06d1c7cf5417a6d87cb33d8ed83a57e38b7b
Author: Yi Dong <yidong@nvidia.com>
Date: Sat Oct 1 17:34:03 2022 +0000
calcuate the mask
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9db3bc13eb65a94a475b837603351da68e3745bc
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 23:26:32 2022 +0000
fix bug in datasets
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit f289900375d4412f53f8110be00fec6587627550
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 22:29:40 2022 +0000
update the code
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 8e28a1f208aabaab72dbe769e72756baada04d99
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 21:52:52 2022 +0000
added new ds
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 8d41315bab7ce90e200a8a7d1023c34f8e046897
Author: Yi Dong <doyend@gmail.com>
Date: Fri Sep 30 18:57:09 2022 +0000
added new files
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 984e0e94e15e16323c1ba1ca2efeabd84f69463f
Merge: cbe8b7ab1 fa6cd8588
Author: Yi Dong <doyend@gmail.com>
Date: Thu Sep 29 21:43:29 2022 +0000
Merge branch 'llm-prompt-learning-improvements' into universal_prompt
commit fa6cd858839277939446afe7275976078d54c512
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Sep 29 16:47:30 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 78ba46e5d6fde1be53c08e1e30a54cce59824be0
Merge: 7d6d46742 8d670bc77
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Thu Sep 29 09:43:27 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 7d6d46742170a66758287a207d67e1b1bfd15613
Author: Virginia Adams <vadams@nvidia.com>
Date: Thu Sep 29 16:42:43 2022 +0000
Removed inference step and added sentence peice check to predict step
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 20fd265acd6f7f9912cf52155fe66ccfa6b201a2
Author: Virginia Adams <vadams@nvidia.com>
Date: Thu Sep 29 15:26:32 2022 +0000
fixed first stage check for pipeline parallel T5 pt
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 3637be2b258c8d9028856f9971edb7da4a8121f0
Merge: a3ea722fd 986a76612
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Wed Sep 28 10:23:30 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit a3ea722fdc12fbcc5989b76ef5643a574b763bc4
Merge: 770967a52 971485ce7
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Mon Sep 26 13:35:52 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 770967a5251a474b6dcc2d44bf9a2076adbcb604
Merge: d23bf6c30 e3ac280a8
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Mon Sep 26 10:17:03 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit d23bf6c30acc0e3f6af9b4e24547669866a34d62
Merge: de6a31651 333d2b749
Author: Virginia Adams <vadams@nvidia.com>
Date: Mon Sep 26 10:05:16 2022 -0700
Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements
commit de6a31651e63d88a42b971794d93f18ff5a3cdff
Author: Virginia Adams <vadams@nvidia.com>
Date: Mon Sep 26 17:00:53 2022 +0000
Updated PP check to be on first stage pipeline only
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 333d2b7498e6742ce66436f733c980a74616900c
Merge: 592c0986a a39fc925a
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Fri Sep 23 16:11:21 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 592c0986a476a91b57b8605d7b70830d7acfa021
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 23:08:41 2022 +0000
Fixed unused import and CI test bug
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit ea9cd82d85638bc60ae4ad7ef105db931c8e3455
Merge: ce4b72c8c b566c2d0e
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 18:57:25 2022 +0000
Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements
commit ce4b72c8c52f32be336e323dd78a38089edc3e7c
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 18:57:16 2022 +0000
Switch to import from base class
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit b566c2d0e35a068f758fd1310bc620a47be4590b
Merge: 6621f2854 e872061ac
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Fri Sep 23 10:09:03 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 6621f28543828a48484a5637f6c9f3ccb23a5b02
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 20:47:35 2022 +0000
python format fix
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 8deafc8987b6af5f7b99a250310f57a40198c37f
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 20:28:02 2022 +0000
Save .nemo on new best val score
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 761bd36969cb465d6a129e9eee6ce1f883d3cf41
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 18:03:19 2022 +0000
Added automatic checkpoint to nemo file method
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 3be4ed57b6cd3ddfe4876d78650dfe8fe794598b
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 02:11:56 2022 +0000
Make GPT use base prompt learning model class:
Signed-off-by: Virginia Adams <vadams@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix LGTM
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix validation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* change for the lm eval
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* make text generation work in data parallel environment
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* implement the service with rest service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* surpress log
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Update config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Restore function needed for NMT
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* handles no answer only
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Fix config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* added knn to web
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix lgtm.com comments
Signed-off-by: Yi Dong <yidong@nvidia.com>
* output the retrieved context
Signed-off-by: Yi Dong <yidong@nvidia.com>
* allow no neighbor query
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove the imports
Signed-off-by: Yi Dong <yidong@nvidia.com>
* warn only once
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Change output file format from JSON to JSONL
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* new t0 dataset
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Add T0 data preproc scripts
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Merge and multiprocessing
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix for is_correct
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* fix epoch > 2
Signed-off-by: Yi Dong <yidong@nvidia.com>
* handles multiple dataloader
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove template
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Refactor T0 dataset
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Add script to merge train folder into individual training files to minimize number of blends
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added on the fly service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add combo instance
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added combo service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* send weights back to server
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix index store
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Minor changes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add reset button
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add add eos
Signed-off-by: Yi Dong <yidong@nvidia.com>
* use a seperate bert service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* no loss of accuracy
Signed-off-by: Yi Dong <yidong@nvidia.com>
* pin the gradio version
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Remove bin compat
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix header lines
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* evaluate based on text generation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* exact match result aggregation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* working SP and SA
Signed-off-by: Yi Dong <yidong@nvidia.com>
* sync
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix checkpoint
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix eval
Signed-off-by: Yi Dong <yidong@nvidia.com>
* backup states
Signed-off-by: Yi Dong <yidong@nvidia.com>
* backup states reset
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix the bug
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix evaluation for sentence piece
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix a bug
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* potential fix in the future
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove the universal codes
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove universal strategy
Signed-off-by: Yi Dong <yidong@nvidia.com>
* address reviewer comment
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add align function docstrings and make most args optional
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove redundant returns of viterbi and log probs matrices
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Rename h# to <initial_silence>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update manifest format description in README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* always remove any spaces from utt_id
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Patch the hanging of threads on very large stderr (#5589) (#5590)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* O2 style amp for gpt3 ptuning (#5246)
* enable amp o2 plugin
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* only create master param if param requires gradient
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* remove pytorch autocast
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* Update optimizer_with_main_params.py
Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
* create master grad only if param group requires grad
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* fix grad scaler for pp > 1
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
Co-authored-by: Jimmy Zhang <jiemingz@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Better patch hydra (#5591) (#5592)
* Readd buffereing and thread drain to Hydra Launcher
Signed-off-by: smajumdar <titu1994@gmail.com>
* Readd buffereing and thread drain to Hydra Launcher
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Yet another fix with hydra multirun (#5594) (#5595)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add RETRO model documentation (#5578)
* added retro doc
Signed-off-by: Yi Dong <yidong@nvidia.com>
* finish data part
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the data format
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added training script
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added training and evaluation steps
Signed-off-by: Yi Dong <yidong@nvidia.com>
* edit the text
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the images
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix beginning
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix the grammar
Signed-off-by: Yi Dong <yidong@nvidia.com>
* trim it down
Signed-off-by: Yi Dong <yidong@nvidia.com>
* add wandb option
Signed-off-by: Yi Dong <yidong@nvidia.com>
* add reference
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix path
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the parameters table
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix section
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix: setup_multiple validation/test data (#5585)
Fix: setup_multiple validation/test data (#5585)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Move to optimizer based EMA implementation (#5169)
* Move to optimizer
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fix replacing weights
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Allow swapping of weights be optional
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Save 2 models
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Use different hook
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Expose cpu device
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add clause to see if this fixes issue with O2 optimizer
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Try to get O2 working
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* WIP
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fixes
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fixes to tests
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add guard
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Remove import
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add guard
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add comment
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Remove overwrite
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add BatchNorm, currently tests fail
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fix tests/functionality for batch norm
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Get rid of NLP changes
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* AIStore for ASR datasets (#5462)
AIStore for ASR datasets
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add support for MHA adapters to ASR (#5396)
* Convert AbstractAdapterModule to AbstractAdapterMixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* Temporary fixes to new signature of mixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add adapter util for constants, add all mha adapters.
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update name of function
Signed-off-by: smajumdar <titu1994@gmail.com>
* Roll back changes to convASR
Signed-off-by: smajumdar <titu1994@gmail.com>
* Convert AbstractAdapterModule to AbstractAdapterMixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* First draft of Conformer support for MHA attention
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add some preliminary tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add support for projection of the hidden dimension for attention
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add support for squeezeformer
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update train adapter config
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add tests for squeezeformer and unit tests for new modules
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update config for hp search,set limits on modules for conformer and squeezeformer, update adapter mixin, add cache to import_from_class_path
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update location of adapters
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add pre_norm for proper attention learning, Fix the issue with nan/inf in pos_bias_u and pos_bias_v
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update expmanager to clean up checkpoints
Signed-off-by: smajumdar <titu1994@gmail.com>
* Fix style
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update training scripts
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update config and docs
Signed-off-by: smajumdar <titu1994@gmail.com>
* Expose nemo delete function
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct adapter partial state saving
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct a bug with state management of adapter tokens
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Pull down EMA test
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct name of adapter module utility class
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove unused TTS eval functions w/ pesq and pystoi dependencies (#5605) (#5606)
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Co-authored-by: Jocelyn <jocelynh@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Create separator parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Call align function with hydra config
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* update usage example
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update Dockerfile (#5614) (#5616)
Pinned to use `numba==0.53.1` to avoid crashing in training with `num_workers > 0`. This is just a temporary workaround, still need to fix it in the future.
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Co-authored-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make separate pretrained_name and model_path parameters
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* make "optional" tags bold in markdown
Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Move non-main functions to utils dir
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Temp workaround: Disable test with cache_audio=True since it is failing in CI (#5607) (#5615)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Co-authored-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] fix ranges of char set for accented letters. (#5607)
* [TTS] fix ranges of char set for accented letters.
* remove digits pattern and added unit tests for math operators.
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Change success message to reduce confusion (#5621)
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update documentation and tutorials for Adapters (#5610)
* Improve docs for adapter and tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Improve docs for adapter and tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update test
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Rename test file
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] add type hints and change varialbe names for tokenizers and g2p (#5602)
* [TTS] add type hints and change variable names for tokenizers and g2p
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* 1. Added missing import for gather_objects. (#5627)
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Co-authored-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] add fastpitch and hifigan model NGC urls and update NeMo docs. (#5596) (#5625)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fixed RadTTS unit test (#5572)
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove tests (#5633)
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][DOC] add notes about automatic conversion to target sampling rates. (#5624) (#5634)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Conformer local attention (#5525)
* local attn and merge
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* optional
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* override
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* incorporate comments
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* update
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* fix
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* comment
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* changes, test
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* changes
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* check att context
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* readme link
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* utils
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* update
Signed-off-by: sam1373 <samuelkriman@gmail.com>
Signed-off-by: sam1373 <samuelkriman@gmail.com>
Signed-off-by: Samuel Kriman <samuelkriman@gmail.com>
Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add core classes and functions for online clustering diarizer part 1 (#5526)
* Add core classes and functions for online clustering diarizer
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add audio to labels code
Signed-off-by: Taejin Park <tango4j@gmail.com>
* resolve type errors
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added unit=tests for very short audio
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Filled all missing docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* resolved conflict and added missing docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixed unit-test errors
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix the wrongly added file - megatron_gpt_model.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fix wrongly included file - megatron_gpt_model.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* resolve code quality issue
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fixed unit-test errors and bugs
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* changed total_sec for offline_clustering toy_data in unit-tests
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed merging index offset bug
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* only including part 1 files
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* removed unused function
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed unused imports
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* divided nmesc_clustering.py into two and reflected first-pass comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* adding offline/online_clustering.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix code QL autocomment
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Removed unused imports
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/online_clustering.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Reflected comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* resolved code scanning issue
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/offline_clustering.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [STT] Add Esperanto (Eo) ASR Conformer-CTC and Conformer-Transducer models (#5639) (#5641)
* add stt_eo_conformer_ctc_large model
* stt_eo_conformer_transducer_large
Co-authored-by: Andrei Andrusenko <52885736+andrusenkoau@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Removed unused import
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Specify that filepaths need to be absolute
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* replaces any spaces in utt_id with dashes
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make hydra script callable by another script
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* do not specify default model or model_downsample_factor
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [Dockerfile] Remove AIS archive from docker image (#5629)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Measure audio_sr from audio instead of needing to specify
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] Disambiguate polyphones with augmented dict and Jieba segmenter for Chinese FastPitch (#5541)
* Chinese TTS replaces default pypinyin dict
* Add jieba word segmenter as an option
Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make separate parameters for device of transcription and viterbi steps
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add mention of gecko
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [workflow] add exclude labels option to ignore cherry-picks in release changelog. (#5645)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] bugfix for the tutorial and add NGC CLI installation guide. (#5643) (#5647)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [Add] ASR+VAD Inference Pipeline (#5575)
Added offline ASR+VAD inference pipeline that matches with what's in RIVA, along with some feature-based ASR and classification datasets.
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* rename separator to ctm_grouping_separator and refactor
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Bert interleaved (#5556)
* Adding SP and SAR support Bert
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding SP and SAR support Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding SP and SAR support Bert
* Adding SP and SAR support Bert
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* Update bert_model.py
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Adding tests
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Addressing Eric's comments
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Addressing Eric's comments
* Fix bug fix sequence parallel and Interleaved
* Fix bug fix sequence parallel and Interleaved
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add duration padding support for RADTTS inference (#5650)
* Added duration padding support for RADTTS inference
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Co-authored-by: Kevin Shih <kshih@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add remove_blank_tokens_from_ctm parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Dont save initial_silence line in CTM
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add DLLogger support to exp_manager (#5658)
* Add DLLogger support to exp_manager
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
* Move dllogger to separate file and check import
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Remove unused import
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add minimum_timestamp_duration parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add suggestion about removing blanks to README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* reorder args
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* clarify description of ctm_grouping_separator in README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* update docstring
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] bugfix for ngc cli installation. (#5652) (#5664)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Port stateless timer to exp manager (#5584)
* Port stateless timer to exp manager
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes and remove from all megatron code
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Change message
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix EMA restart by allowing device to be set by the class init (#5668)
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove SDP (moved to separate repo) - merge to main (#5630)
* Remove sdp files from tools folder
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add page to docs with new SDP location
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add interface for making amax reduction optional for FP8 (#5447)
* add TE interface for making amax reduction optional
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] add tts dict cust notebook (#5662)
* add tts dict cust notebook
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* review
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* fixed audio links
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* remove old notebook
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* fix typo
Signed-off-by: ekmb <ebakhturina@nvidia.com>
Signed-off-by: ekmb <ebakhturina@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [ASR] Audio processing base, multi-channel enhancement models (#5356)
* Audio processing base model, enc-mask-dec enhancement, tests and modules
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed review comments
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Fixed CodeQL warnings
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed PR comments
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed PR comments:
- renamed AudioProcessingModel to AudioToAudioModel
- various small modifications
- updated unit tests
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed comments
- Moved spectrogram to audio_preprocessing
- Renamed MultichannelFeatures
- Updated config and unit tests
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Expose ClusteringDiarizer device (#5681)
* Expose device for users to set
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Expose device for users to set
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add Beam Search support to ASR transcribe() (#5443)
* Add support for beam decoding via high level API.
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add ctc decoding section
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update ctc transcribe API to return results from beam search
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add argument to preserve arpa file
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update script to use hydra config, add some support for future compute timesteps, add doc for ctc decoding
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update eval script and doc to use new API
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add tests for ctc greedy decoding
Signed-off-by: smajumdar <titu1994@gmail.com>
* Address reviewer comments and add docstrings
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix changes and address comments
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Samuel Kriman <samuelkriman@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Propagate attention_dropout flag for GPT-3 (#5669)
* Propagate attention_dropout flag for GPT-3
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
* Add default to megatron_gpt_config
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Eric H…
treacker
added a commit
that referenced
this pull request
Jan 25, 2023
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (#5277) * Typo fix (#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (#5359) * Add Gradio App to ASR Docs (#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (#5379) * Fixed bug in notebook (#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (#5421)" (#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (#5242) (#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Kipok
added a commit
to Kipok/NeMo
that referenced
this pull request
Jan 31, 2023
* Merge r1.13.0 main (#5570)
* update branch
Signed-off-by: ericharper <complex451@gmail.com>
* Rename Speech Dataset Processor to Speech Data Processor (#5378)
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Megatron Export Update (#5343)
* export update for Megatron + change ORT optimization
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* updated export_utils to use autocast instead of manually casting >:/
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* removed dtype from LayerNorm
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* added comment
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* reverting changes on FloatCast
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* Cherry-picked changes from megatron-norm
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* updated asr_model import to cast_utils
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* updated del onnx_model place
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* changed ort optimization to basic -> temp fix
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Disable sync_batch_comm in validation_step for GPT (#5397)
* disable sync_batch_comm in validation_step
Signed-off-by: ericharper <complex451@gmail.com>
* Read sync_batch_comm from config or default to False
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
* Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
* Empty
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Comment out test
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
* Radtts 1.13 (#5451)
* [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358)
* [TTS] add CI test for RADTTS training recipe.
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
* Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478)
* Initial refactor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Resolve config before passing to load_from_checkpoint
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes for model parallel and nemo restore
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes for eval
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Revert config changes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Refactor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix typo
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Remove comments
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Minor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix validation reconfiguration
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Remove old comment
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes for test_ds
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* export_utils bugfix (#5480)
* updated export_utils
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Export fixes for Riva (#5496)
* Export fixes for Riva
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Cleaning up training_utils
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* added set_start_method + function param bugfix (#5539)
* added set_start_method + function param bugfix
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* upper bound torchmetrics
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: ericharper <complex451@gmail.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ericharper <complex451@gmail.com>
* remove notebook (#5548)
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
* update readme
Signed-off-by: ericharper <complex451@gmail.com>
* update branch
Signed-off-by: ericharper <complex451@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Co-authored-by: David <amosalla@asu.edu>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Optimized loop and bugfix in SDE (#5573)
- Fixed bug with loading custom data attributes from JSON in Speech Data Explorer
Signed-off-by: George Zelenfroynd <gzelenfroind@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update torchmetrics (#5566)
* add task arg
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* update state
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Co-authored-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove useless files. (#5580)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add initial NFA code
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make use of the specified device during viterbi decoding
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix CodeQL notes
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix CodeQL warning
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add an option to defer data setup from ``__init__`` to ``setup`` (#5569)
* Add an option to defer dataloader setup from __init__ to setup
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Updated doc
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make utt_id specified by number of parts of audio_filepath user wishes to use
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove audio_sr TODO - reduce risk of silent bugs
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add check that model is CTC
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove unused import
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Text generation improvement (UI client, data parallel support) (#5437)
* Squashed commit of the following:
commit a5e124f34be31bd6eafe5e5fdf5bedcd0d50915c
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Oct 13 15:07:42 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 35b424044fe80c3081e7756ab21244f701716f7e
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 08:04:49 2022 -0700
get rid of base
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 2955210e2311791543538cfbb5ad26b79414c954
Merge: d52edef8c eaf6757ca
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:17:02 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit d52edef8cd7b36593838fb270047e80f8ccb652e
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:16:24 2022 +0000
align with main
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit eaf6757ca5be8e099492f57c81d984429b0ad49c
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Oct 13 13:12:11 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit c4b86d97626ea0721bf8fb4c0a45dec5becc94c9
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:10:58 2022 +0000
same as main
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit e335de51bcc0d681c58b568c3d8c238bc5687c3b
Merge: c231086e0 4463a9fe9
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:08:09 2022 +0000
Merge branch 'main' into universal_prompt
commit c231086e057f1efaa915f691d84664cb3d5aad85
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Wed Oct 12 19:59:12 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 6a821a4b49a23dd3408a706a2a3dd393149b0bb1
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:56:17 2022 +0000
default to pad
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9d908e39fef1beed9ba2da4d1a6806161eb7ef25
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:55:44 2022 +0000
add the option to pad the tokens
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 876dc395b43fdeeaa2bcbbe13c76523633764c33
Merge: fbb0f4035 fe3c77ee9
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:20:47 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit fe3c77ee93ab6cf3ea152db68cb6beefcac2a392
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 18:59:49 2022 +0000
fix import again
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit fbb0f4035c6cd6bfefed50a20605503de8c1dccb
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Wed Oct 12 16:00:24 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 372ca8c0d7988f2339b15888dc72aa21f4fb6937
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 15:58:32 2022 +0000
enable server
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit cbe05d9fbc978f812cfbb671f45f147f300713c4
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 13:07:28 2022 +0000
fix comment error
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 1948048922e726ec6131e44b1a745389f18d4ef2
Merge: 232c2cce3 984f5c09a
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 13:05:30 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit 232c2cce34d7a8b902da406706f3dd9b39475091
Merge: 34c8a68df 658243fb6
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 12:50:00 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit 984f5c09a6dbf1d1fb5aa30ed9b0df188e66a50f
Merge: 658243fb6 3fda5de46
Author: Yi Dong <43824965+yidong72@users.noreply.github.com>
Date: Wed Oct 12 08:42:11 2022 -0400
Merge branch 'main' into fix_global_init
commit 658243fb6580191b5d60edd30cde16dcc23cbb85
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 12 12:40:57 2022 +0000
fix import error
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 8e0fe1cad05ec288ec122b3cd0e139a96872e08c
Author: Yi Dong <doyend@gmail.com>
Date: Tue Oct 11 22:44:12 2022 +0000
update the fused kernel
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 536cf6bef9447b75843fad630729c47a2fba35f3
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:44:52 2022 -0700
add the missing file
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 1b437ec41dc5e354453ce0a089bca0171cbcb6c2
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:43:14 2022 -0700
fix fused softmax
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 7813f60e05f9783af61f8c14ec1cb0c6c4f1f263
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:16:48 2022 -0700
move global step to base
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 34c8a68df084b18d377e84415d9f07b2cd6673dd
Author: Yi Dong <doyend@gmail.com>
Date: Thu Oct 6 13:50:11 2022 +0000
fix pipeline for eval
Signed-off-by: Yi Dong <doyend@gmail.com>
commit eee5d38218f26660c3ffebe9f615c850c80a1f0d
Author: Yi Dong <doyend@gmail.com>
Date: Thu Oct 6 13:48:22 2022 +0000
fix for pipleline parallel
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 323bca73e7ef6099ee79c0a2fffac7b709ed6c5d
Merge: 125e49947 e3b4c4d1f
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 19:29:13 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit 125e4994760448ff75dd9328395813eda1c87547
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 19:29:04 2022 +0000
add share option
Signed-off-by: Yi Dong <doyend@gmail.com>
commit e3b4c4d1f7346c9fa596f3cca6d4df0a9e05c368
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 5 11:43:48 2022 -0700
make sure consolidation works
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit a5c833964ecf05dc460ca1da69275c4019742150
Merge: 2a07ab52d abcb74be2
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:40:29 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit 2a07ab52d95f15ba666823028c69e23825666c05
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:40:23 2022 +0000
added requirement
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 3abecd9dd1611993a87c537636abe7f7e6a9b04c
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:39:42 2022 +0000
added a simple web server
Signed-off-by: Yi Dong <doyend@gmail.com>
commit abcb74be2caf1cdec40eb9ba2be4dde4d45a3b4b
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 5 06:54:12 2022 -0700
fix empty val loss
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit b8eb92ac4a0d665570af75e34c9ba3c2e2420c26
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 19:25:30 2022 -0700
text gen working
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit d59f3e3f3a6fd19736d1c5706fed65a3dd4049ba
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 16:08:40 2022 -0700
first change
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 59d077585e6962a669b824af58f64e8a0bea6547
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 15:00:40 2022 -0700
revert
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 12a0f3902d99e9179403644bd951c045df716ca7
Author: Yi Dong <doyend@gmail.com>
Date: Tue Oct 4 21:26:23 2022 +0000
init imp
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 62a15dfd943cc48be495ac61b9f2f00995775c5f
Merge: 82c90d2cd e0cc6b767
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 11:58:26 2022 -0700
Merge branch 'main' into universal_prompt
commit 82c90d2cd0fd156f16a4b899f8c741d598f33990
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 11:17:13 2022 -0700
add sync
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9819b703eef877d90cd1257bf3610c69de9b4d7e
Author: Yi Dong <yidong@nvidia.com>
Date: Sun Oct 2 17:52:34 2022 -0700
fix save model
Signed-off-by: root <root@luna-0197.selene.nvidia.com>
commit e4937e2fc5fb7d70754c97668416e4a69c3079fe
Author: Yi Dong <yidong@nvidia.com>
Date: Sat Oct 1 18:56:09 2022 +0000
working
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit b73b06d1c7cf5417a6d87cb33d8ed83a57e38b7b
Author: Yi Dong <yidong@nvidia.com>
Date: Sat Oct 1 17:34:03 2022 +0000
calcuate the mask
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9db3bc13eb65a94a475b837603351da68e3745bc
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 23:26:32 2022 +0000
fix bug in datasets
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit f289900375d4412f53f8110be00fec6587627550
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 22:29:40 2022 +0000
update the code
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 8e28a1f208aabaab72dbe769e72756baada04d99
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 21:52:52 2022 +0000
added new ds
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 8d41315bab7ce90e200a8a7d1023c34f8e046897
Author: Yi Dong <doyend@gmail.com>
Date: Fri Sep 30 18:57:09 2022 +0000
added new files
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 984e0e94e15e16323c1ba1ca2efeabd84f69463f
Merge: cbe8b7ab1 fa6cd8588
Author: Yi Dong <doyend@gmail.com>
Date: Thu Sep 29 21:43:29 2022 +0000
Merge branch 'llm-prompt-learning-improvements' into universal_prompt
commit fa6cd858839277939446afe7275976078d54c512
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Sep 29 16:47:30 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 78ba46e5d6fde1be53c08e1e30a54cce59824be0
Merge: 7d6d46742 8d670bc77
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Thu Sep 29 09:43:27 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 7d6d46742170a66758287a207d67e1b1bfd15613
Author: Virginia Adams <vadams@nvidia.com>
Date: Thu Sep 29 16:42:43 2022 +0000
Removed inference step and added sentence peice check to predict step
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 20fd265acd6f7f9912cf52155fe66ccfa6b201a2
Author: Virginia Adams <vadams@nvidia.com>
Date: Thu Sep 29 15:26:32 2022 +0000
fixed first stage check for pipeline parallel T5 pt
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 3637be2b258c8d9028856f9971edb7da4a8121f0
Merge: a3ea722fd 986a76612
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Wed Sep 28 10:23:30 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit a3ea722fdc12fbcc5989b76ef5643a574b763bc4
Merge: 770967a52 971485ce7
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Mon Sep 26 13:35:52 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 770967a5251a474b6dcc2d44bf9a2076adbcb604
Merge: d23bf6c30 e3ac280a8
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Mon Sep 26 10:17:03 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit d23bf6c30acc0e3f6af9b4e24547669866a34d62
Merge: de6a31651 333d2b749
Author: Virginia Adams <vadams@nvidia.com>
Date: Mon Sep 26 10:05:16 2022 -0700
Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements
commit de6a31651e63d88a42b971794d93f18ff5a3cdff
Author: Virginia Adams <vadams@nvidia.com>
Date: Mon Sep 26 17:00:53 2022 +0000
Updated PP check to be on first stage pipeline only
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 333d2b7498e6742ce66436f733c980a74616900c
Merge: 592c0986a a39fc925a
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Fri Sep 23 16:11:21 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 592c0986a476a91b57b8605d7b70830d7acfa021
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 23:08:41 2022 +0000
Fixed unused import and CI test bug
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit ea9cd82d85638bc60ae4ad7ef105db931c8e3455
Merge: ce4b72c8c b566c2d0e
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 18:57:25 2022 +0000
Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements
commit ce4b72c8c52f32be336e323dd78a38089edc3e7c
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 18:57:16 2022 +0000
Switch to import from base class
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit b566c2d0e35a068f758fd1310bc620a47be4590b
Merge: 6621f2854 e872061ac
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Fri Sep 23 10:09:03 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 6621f28543828a48484a5637f6c9f3ccb23a5b02
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 20:47:35 2022 +0000
python format fix
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 8deafc8987b6af5f7b99a250310f57a40198c37f
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 20:28:02 2022 +0000
Save .nemo on new best val score
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 761bd36969cb465d6a129e9eee6ce1f883d3cf41
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 18:03:19 2022 +0000
Added automatic checkpoint to nemo file method
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 3be4ed57b6cd3ddfe4876d78650dfe8fe794598b
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 02:11:56 2022 +0000
Make GPT use base prompt learning model class:
Signed-off-by: Virginia Adams <vadams@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix LGTM
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix validation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* change for the lm eval
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* make text generation work in data parallel environment
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* implement the service with rest service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* surpress log
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Update config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Restore function needed for NMT
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* handles no answer only
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Fix config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* added knn to web
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix lgtm.com comments
Signed-off-by: Yi Dong <yidong@nvidia.com>
* output the retrieved context
Signed-off-by: Yi Dong <yidong@nvidia.com>
* allow no neighbor query
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove the imports
Signed-off-by: Yi Dong <yidong@nvidia.com>
* warn only once
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Change output file format from JSON to JSONL
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* new t0 dataset
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Add T0 data preproc scripts
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Merge and multiprocessing
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix for is_correct
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* fix epoch > 2
Signed-off-by: Yi Dong <yidong@nvidia.com>
* handles multiple dataloader
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove template
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Refactor T0 dataset
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Add script to merge train folder into individual training files to minimize number of blends
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added on the fly service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add combo instance
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added combo service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* send weights back to server
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix index store
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Minor changes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add reset button
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add add eos
Signed-off-by: Yi Dong <yidong@nvidia.com>
* use a seperate bert service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* no loss of accuracy
Signed-off-by: Yi Dong <yidong@nvidia.com>
* pin the gradio version
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Remove bin compat
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix header lines
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* evaluate based on text generation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* exact match result aggregation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* working SP and SA
Signed-off-by: Yi Dong <yidong@nvidia.com>
* sync
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix checkpoint
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix eval
Signed-off-by: Yi Dong <yidong@nvidia.com>
* backup states
Signed-off-by: Yi Dong <yidong@nvidia.com>
* backup states reset
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix the bug
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix evaluation for sentence piece
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix a bug
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* potential fix in the future
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove the universal codes
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove universal strategy
Signed-off-by: Yi Dong <yidong@nvidia.com>
* address reviewer comment
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add align function docstrings and make most args optional
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove redundant returns of viterbi and log probs matrices
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Rename h# to <initial_silence>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update manifest format description in README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* always remove any spaces from utt_id
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Patch the hanging of threads on very large stderr (#5589) (#5590)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* O2 style amp for gpt3 ptuning (#5246)
* enable amp o2 plugin
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* only create master param if param requires gradient
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* remove pytorch autocast
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* Update optimizer_with_main_params.py
Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
* create master grad only if param group requires grad
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* fix grad scaler for pp > 1
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
Co-authored-by: Jimmy Zhang <jiemingz@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Better patch hydra (#5591) (#5592)
* Readd buffereing and thread drain to Hydra Launcher
Signed-off-by: smajumdar <titu1994@gmail.com>
* Readd buffereing and thread drain to Hydra Launcher
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Yet another fix with hydra multirun (#5594) (#5595)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add RETRO model documentation (#5578)
* added retro doc
Signed-off-by: Yi Dong <yidong@nvidia.com>
* finish data part
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the data format
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added training script
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added training and evaluation steps
Signed-off-by: Yi Dong <yidong@nvidia.com>
* edit the text
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the images
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix beginning
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix the grammar
Signed-off-by: Yi Dong <yidong@nvidia.com>
* trim it down
Signed-off-by: Yi Dong <yidong@nvidia.com>
* add wandb option
Signed-off-by: Yi Dong <yidong@nvidia.com>
* add reference
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix path
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the parameters table
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix section
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix: setup_multiple validation/test data (#5585)
Fix: setup_multiple validation/test data (#5585)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Move to optimizer based EMA implementation (#5169)
* Move to optimizer
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fix replacing weights
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Allow swapping of weights be optional
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Save 2 models
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Use different hook
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Expose cpu device
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add clause to see if this fixes issue with O2 optimizer
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Try to get O2 working
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* WIP
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fixes
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fixes to tests
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add guard
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Remove import
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add guard
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add comment
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Remove overwrite
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add BatchNorm, currently tests fail
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fix tests/functionality for batch norm
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Get rid of NLP changes
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* AIStore for ASR datasets (#5462)
AIStore for ASR datasets
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add support for MHA adapters to ASR (#5396)
* Convert AbstractAdapterModule to AbstractAdapterMixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* Temporary fixes to new signature of mixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add adapter util for constants, add all mha adapters.
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update name of function
Signed-off-by: smajumdar <titu1994@gmail.com>
* Roll back changes to convASR
Signed-off-by: smajumdar <titu1994@gmail.com>
* Convert AbstractAdapterModule to AbstractAdapterMixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* First draft of Conformer support for MHA attention
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add some preliminary tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add support for projection of the hidden dimension for attention
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add support for squeezeformer
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update train adapter config
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add tests for squeezeformer and unit tests for new modules
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update config for hp search,set limits on modules for conformer and squeezeformer, update adapter mixin, add cache to import_from_class_path
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update location of adapters
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add pre_norm for proper attention learning, Fix the issue with nan/inf in pos_bias_u and pos_bias_v
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update expmanager to clean up checkpoints
Signed-off-by: smajumdar <titu1994@gmail.com>
* Fix style
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update training scripts
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update config and docs
Signed-off-by: smajumdar <titu1994@gmail.com>
* Expose nemo delete function
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct adapter partial state saving
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct a bug with state management of adapter tokens
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Pull down EMA test
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct name of adapter module utility class
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove unused TTS eval functions w/ pesq and pystoi dependencies (#5605) (#5606)
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Co-authored-by: Jocelyn <jocelynh@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Create separator parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Call align function with hydra config
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* update usage example
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update Dockerfile (#5614) (#5616)
Pinned to use `numba==0.53.1` to avoid crashing in training with `num_workers > 0`. This is just a temporary workaround, still need to fix it in the future.
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Co-authored-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make separate pretrained_name and model_path parameters
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* make "optional" tags bold in markdown
Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Move non-main functions to utils dir
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Temp workaround: Disable test with cache_audio=True since it is failing in CI (#5607) (#5615)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Co-authored-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] fix ranges of char set for accented letters. (#5607)
* [TTS] fix ranges of char set for accented letters.
* remove digits pattern and added unit tests for math operators.
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Change success message to reduce confusion (#5621)
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update documentation and tutorials for Adapters (#5610)
* Improve docs for adapter and tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Improve docs for adapter and tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update test
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Rename test file
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] add type hints and change varialbe names for tokenizers and g2p (#5602)
* [TTS] add type hints and change variable names for tokenizers and g2p
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* 1. Added missing import for gather_objects. (#5627)
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Co-authored-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] add fastpitch and hifigan model NGC urls and update NeMo docs. (#5596) (#5625)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fixed RadTTS unit test (#5572)
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove tests (#5633)
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][DOC] add notes about automatic conversion to target sampling rates. (#5624) (#5634)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Conformer local attention (#5525)
* local attn and merge
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* optional
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* override
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* incorporate comments
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* update
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* fix
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* comment
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* changes, test
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* changes
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* check att context
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* readme link
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* utils
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* update
Signed-off-by: sam1373 <samuelkriman@gmail.com>
Signed-off-by: sam1373 <samuelkriman@gmail.com>
Signed-off-by: Samuel Kriman <samuelkriman@gmail.com>
Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add core classes and functions for online clustering diarizer part 1 (#5526)
* Add core classes and functions for online clustering diarizer
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add audio to labels code
Signed-off-by: Taejin Park <tango4j@gmail.com>
* resolve type errors
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added unit=tests for very short audio
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Filled all missing docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* resolved conflict and added missing docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixed unit-test errors
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix the wrongly added file - megatron_gpt_model.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fix wrongly included file - megatron_gpt_model.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* resolve code quality issue
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fixed unit-test errors and bugs
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* changed total_sec for offline_clustering toy_data in unit-tests
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed merging index offset bug
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* only including part 1 files
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* removed unused function
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed unused imports
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* divided nmesc_clustering.py into two and reflected first-pass comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* adding offline/online_clustering.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix code QL autocomment
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Removed unused imports
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/online_clustering.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Reflected comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* resolved code scanning issue
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/offline_clustering.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [STT] Add Esperanto (Eo) ASR Conformer-CTC and Conformer-Transducer models (#5639) (#5641)
* add stt_eo_conformer_ctc_large model
* stt_eo_conformer_transducer_large
Co-authored-by: Andrei Andrusenko <52885736+andrusenkoau@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Removed unused import
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Specify that filepaths need to be absolute
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* replaces any spaces in utt_id with dashes
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make hydra script callable by another script
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* do not specify default model or model_downsample_factor
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [Dockerfile] Remove AIS archive from docker image (#5629)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Measure audio_sr from audio instead of needing to specify
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] Disambiguate polyphones with augmented dict and Jieba segmenter for Chinese FastPitch (#5541)
* Chinese TTS replaces default pypinyin dict
* Add jieba word segmenter as an option
Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make separate parameters for device of transcription and viterbi steps
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add mention of gecko
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [workflow] add exclude labels option to ignore cherry-picks in release changelog. (#5645)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] bugfix for the tutorial and add NGC CLI installation guide. (#5643) (#5647)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [Add] ASR+VAD Inference Pipeline (#5575)
Added offline ASR+VAD inference pipeline that matches with what's in RIVA, along with some feature-based ASR and classification datasets.
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* rename separator to ctm_grouping_separator and refactor
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Bert interleaved (#5556)
* Adding SP and SAR support Bert
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding SP and SAR support Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding SP and SAR support Bert
* Adding SP and SAR support Bert
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* Update bert_model.py
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Adding tests
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Addressing Eric's comments
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Addressing Eric's comments
* Fix bug fix sequence parallel and Interleaved
* Fix bug fix sequence parallel and Interleaved
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add duration padding support for RADTTS inference (#5650)
* Added duration padding support for RADTTS inference
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Co-authored-by: Kevin Shih <kshih@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add remove_blank_tokens_from_ctm parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Dont save initial_silence line in CTM
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add DLLogger support to exp_manager (#5658)
* Add DLLogger support to exp_manager
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
* Move dllogger to separate file and check import
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Remove unused import
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add minimum_timestamp_duration parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add suggestion about removing blanks to README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* reorder args
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* clarify description of ctm_grouping_separator in README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* update docstring
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] bugfix for ngc cli installation. (#5652) (#5664)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Port stateless timer to exp manager (#5584)
* Port stateless timer to exp manager
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes and remove from all megatron code
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Change message
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix EMA restart by allowing device to be set by the class init (#5668)
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove SDP (moved to separate repo) - merge to main (#5630)
* Remove sdp files from tools folder
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add page to docs with new SDP location
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add interface for making amax reduction optional for FP8 (#5447)
* add TE interface for making amax reduction optional
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] add tts dict cust notebook (#5662)
* add tts dict cust notebook
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* review
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* fixed audio links
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* remove old notebook
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* fix typo
Signed-off-by: ekmb <ebakhturina@nvidia.com>
Signed-off-by: ekmb <ebakhturina@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [ASR] Audio processing base, multi-channel enhancement models (#5356)
* Audio processing base model, enc-mask-dec enhancement, tests and modules
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed review comments
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Fixed CodeQL warnings
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed PR comments
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed PR comments:
- renamed AudioProcessingModel to AudioToAudioModel
- various small modifications
- updated unit tests
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed comments
- Moved spectrogram to audio_preprocessing
- Renamed MultichannelFeatures
- Updated config and unit tests
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Expose ClusteringDiarizer device (#5681)
* Expose device for users to set
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Expose device for users to set
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add Beam Search support to ASR transcribe() (#5443)
* Add support for beam decoding via high level API.
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add ctc decoding section
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update ctc transcribe API to return results from beam search
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add argument to preserve arpa file
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update script to use hydra config, add some support for future compute timesteps, add doc for ctc decoding
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update eval script and doc to use new API
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add tests for ctc greedy decoding
Signed-off-by: smajumdar <titu1994@gmail.com>
* Address reviewer comments and add docstrings
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix changes and address comments
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Samuel Kriman <samuelkriman@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Propagate attention_dropout flag for GPT-3 (#5669)
* Propagate attention_dropout flag for GPT-3
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
* Add default to megatron_gpt_config
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Eric H…
Kipok
added a commit
to Kipok/NeMo
that referenced
this pull request
Jan 31, 2023
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (NVIDIA-NeMo#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (NVIDIA-NeMo#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (NVIDIA-NeMo#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (NVIDIA-NeMo#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (NVIDIA-NeMo#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (NVIDIA-NeMo#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA-NeMo#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (NVIDIA-NeMo#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (NVIDIA-NeMo#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (NVIDIA-NeMo#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (NVIDIA-NeMo#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (NVIDIA-NeMo#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (NVIDIA-NeMo#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (NVIDIA-NeMo#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (NVIDIA-NeMo#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (NVIDIA-NeMo#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (NVIDIA-NeMo#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (NVIDIA-NeMo#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (NVIDIA-NeMo#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (NVIDIA-NeMo#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (NVIDIA-NeMo#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (NVIDIA-NeMo#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (NVIDIA-NeMo#5277) * Typo fix (NVIDIA-NeMo#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (NVIDIA-NeMo#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (NVIDIA-NeMo#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (NVIDIA-NeMo#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (NVIDIA-NeMo#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (NVIDIA-NeMo#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (NVIDIA-NeMo#5359) * Add Gradio App to ASR Docs (NVIDIA-NeMo#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (NVIDIA-NeMo#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (NVIDIA-NeMo#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (NVIDIA-NeMo#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA-NeMo#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (NVIDIA-NeMo#5379) * Fixed bug in notebook (NVIDIA-NeMo#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (NVIDIA-NeMo#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (NVIDIA-NeMo#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA-NeMo#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (NVIDIA-NeMo#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (NVIDIA-NeMo#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (NVIDIA-NeMo#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA-NeMo#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (NVIDIA-NeMo#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA-NeMo#5421)" (NVIDIA-NeMo#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA-NeMo#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (NVIDIA-NeMo#5242) (NVIDIA-NeMo#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (NVIDIA-NeMo#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (NVIDIA-NeMo#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA-NeMo#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (NVIDIA-NeMo#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA-NeMo#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (NVIDIA-NeMo#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (NVIDIA-NeMo#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA-NeMo#5339) (NVIDIA-NeMo#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA-NeMo#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA-NeMo#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (NVIDIA-NeMo#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (NVIDIA-NeMo#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (NVIDIA-NeMo#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (NVIDIA-NeMo#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (NVIDIA-NeMo#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
ericharper
added a commit
that referenced
this pull request
Jan 31, 2023
* Merge r1.13.0 main (#5570)
* update branch
Signed-off-by: ericharper <complex451@gmail.com>
* Rename Speech Dataset Processor to Speech Data Processor (#5378)
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Megatron Export Update (#5343)
* export update for Megatron + change ORT optimization
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* updated export_utils to use autocast instead of manually casting >:/
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* removed dtype from LayerNorm
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* added comment
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* reverting changes on FloatCast
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* Cherry-picked changes from megatron-norm
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* updated asr_model import to cast_utils
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* updated del onnx_model place
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* changed ort optimization to basic -> temp fix
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Disable sync_batch_comm in validation_step for GPT (#5397)
* disable sync_batch_comm in validation_step
Signed-off-by: ericharper <complex451@gmail.com>
* Read sync_batch_comm from config or default to False
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
* Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
* Empty
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Comment out test
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
* Radtts 1.13 (#5451)
* [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358)
* [TTS] add CI test for RADTTS training recipe.
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
* Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478)
* Initial refactor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Resolve config before passing to load_from_checkpoint
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes for model parallel and nemo restore
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes for eval
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Revert config changes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Refactor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix typo
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Remove comments
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Minor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix validation reconfiguration
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Remove old comment
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes for test_ds
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* export_utils bugfix (#5480)
* updated export_utils
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Export fixes for Riva (#5496)
* Export fixes for Riva
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Cleaning up training_utils
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* added set_start_method + function param bugfix (#5539)
* added set_start_method + function param bugfix
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* upper bound torchmetrics
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: ericharper <complex451@gmail.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ericharper <complex451@gmail.com>
* remove notebook (#5548)
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
* update readme
Signed-off-by: ericharper <complex451@gmail.com>
* update branch
Signed-off-by: ericharper <complex451@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Co-authored-by: David <amosalla@asu.edu>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Optimized loop and bugfix in SDE (#5573)
- Fixed bug with loading custom data attributes from JSON in Speech Data Explorer
Signed-off-by: George Zelenfroynd <gzelenfroind@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update torchmetrics (#5566)
* add task arg
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* update state
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Co-authored-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove useless files. (#5580)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add initial NFA code
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make use of the specified device during viterbi decoding
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix CodeQL notes
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix CodeQL warning
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add an option to defer data setup from ``__init__`` to ``setup`` (#5569)
* Add an option to defer dataloader setup from __init__ to setup
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Updated doc
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make utt_id specified by number of parts of audio_filepath user wishes to use
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove audio_sr TODO - reduce risk of silent bugs
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add check that model is CTC
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove unused import
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Text generation improvement (UI client, data parallel support) (#5437)
* Squashed commit of the following:
commit a5e124f34be31bd6eafe5e5fdf5bedcd0d50915c
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Oct 13 15:07:42 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 35b424044fe80c3081e7756ab21244f701716f7e
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 08:04:49 2022 -0700
get rid of base
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 2955210e2311791543538cfbb5ad26b79414c954
Merge: d52edef8c eaf6757ca
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:17:02 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit d52edef8cd7b36593838fb270047e80f8ccb652e
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:16:24 2022 +0000
align with main
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit eaf6757ca5be8e099492f57c81d984429b0ad49c
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Oct 13 13:12:11 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit c4b86d97626ea0721bf8fb4c0a45dec5becc94c9
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:10:58 2022 +0000
same as main
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit e335de51bcc0d681c58b568c3d8c238bc5687c3b
Merge: c231086e0 4463a9fe9
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:08:09 2022 +0000
Merge branch 'main' into universal_prompt
commit c231086e057f1efaa915f691d84664cb3d5aad85
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Wed Oct 12 19:59:12 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 6a821a4b49a23dd3408a706a2a3dd393149b0bb1
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:56:17 2022 +0000
default to pad
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9d908e39fef1beed9ba2da4d1a6806161eb7ef25
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:55:44 2022 +0000
add the option to pad the tokens
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 876dc395b43fdeeaa2bcbbe13c76523633764c33
Merge: fbb0f4035 fe3c77ee9
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:20:47 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit fe3c77ee93ab6cf3ea152db68cb6beefcac2a392
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 18:59:49 2022 +0000
fix import again
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit fbb0f4035c6cd6bfefed50a20605503de8c1dccb
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Wed Oct 12 16:00:24 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 372ca8c0d7988f2339b15888dc72aa21f4fb6937
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 15:58:32 2022 +0000
enable server
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit cbe05d9fbc978f812cfbb671f45f147f300713c4
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 13:07:28 2022 +0000
fix comment error
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 1948048922e726ec6131e44b1a745389f18d4ef2
Merge: 232c2cce3 984f5c09a
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 13:05:30 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit 232c2cce34d7a8b902da406706f3dd9b39475091
Merge: 34c8a68df 658243fb6
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 12:50:00 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit 984f5c09a6dbf1d1fb5aa30ed9b0df188e66a50f
Merge: 658243fb6 3fda5de46
Author: Yi Dong <43824965+yidong72@users.noreply.github.com>
Date: Wed Oct 12 08:42:11 2022 -0400
Merge branch 'main' into fix_global_init
commit 658243fb6580191b5d60edd30cde16dcc23cbb85
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 12 12:40:57 2022 +0000
fix import error
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 8e0fe1cad05ec288ec122b3cd0e139a96872e08c
Author: Yi Dong <doyend@gmail.com>
Date: Tue Oct 11 22:44:12 2022 +0000
update the fused kernel
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 536cf6bef9447b75843fad630729c47a2fba35f3
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:44:52 2022 -0700
add the missing file
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 1b437ec41dc5e354453ce0a089bca0171cbcb6c2
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:43:14 2022 -0700
fix fused softmax
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 7813f60e05f9783af61f8c14ec1cb0c6c4f1f263
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:16:48 2022 -0700
move global step to base
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 34c8a68df084b18d377e84415d9f07b2cd6673dd
Author: Yi Dong <doyend@gmail.com>
Date: Thu Oct 6 13:50:11 2022 +0000
fix pipeline for eval
Signed-off-by: Yi Dong <doyend@gmail.com>
commit eee5d38218f26660c3ffebe9f615c850c80a1f0d
Author: Yi Dong <doyend@gmail.com>
Date: Thu Oct 6 13:48:22 2022 +0000
fix for pipleline parallel
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 323bca73e7ef6099ee79c0a2fffac7b709ed6c5d
Merge: 125e49947 e3b4c4d1f
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 19:29:13 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit 125e4994760448ff75dd9328395813eda1c87547
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 19:29:04 2022 +0000
add share option
Signed-off-by: Yi Dong <doyend@gmail.com>
commit e3b4c4d1f7346c9fa596f3cca6d4df0a9e05c368
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 5 11:43:48 2022 -0700
make sure consolidation works
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit a5c833964ecf05dc460ca1da69275c4019742150
Merge: 2a07ab52d abcb74be2
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:40:29 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit 2a07ab52d95f15ba666823028c69e23825666c05
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:40:23 2022 +0000
added requirement
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 3abecd9dd1611993a87c537636abe7f7e6a9b04c
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:39:42 2022 +0000
added a simple web server
Signed-off-by: Yi Dong <doyend@gmail.com>
commit abcb74be2caf1cdec40eb9ba2be4dde4d45a3b4b
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 5 06:54:12 2022 -0700
fix empty val loss
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit b8eb92ac4a0d665570af75e34c9ba3c2e2420c26
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 19:25:30 2022 -0700
text gen working
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit d59f3e3f3a6fd19736d1c5706fed65a3dd4049ba
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 16:08:40 2022 -0700
first change
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 59d077585e6962a669b824af58f64e8a0bea6547
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 15:00:40 2022 -0700
revert
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 12a0f3902d99e9179403644bd951c045df716ca7
Author: Yi Dong <doyend@gmail.com>
Date: Tue Oct 4 21:26:23 2022 +0000
init imp
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 62a15dfd943cc48be495ac61b9f2f00995775c5f
Merge: 82c90d2cd e0cc6b767
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 11:58:26 2022 -0700
Merge branch 'main' into universal_prompt
commit 82c90d2cd0fd156f16a4b899f8c741d598f33990
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 11:17:13 2022 -0700
add sync
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9819b703eef877d90cd1257bf3610c69de9b4d7e
Author: Yi Dong <yidong@nvidia.com>
Date: Sun Oct 2 17:52:34 2022 -0700
fix save model
Signed-off-by: root <root@luna-0197.selene.nvidia.com>
commit e4937e2fc5fb7d70754c97668416e4a69c3079fe
Author: Yi Dong <yidong@nvidia.com>
Date: Sat Oct 1 18:56:09 2022 +0000
working
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit b73b06d1c7cf5417a6d87cb33d8ed83a57e38b7b
Author: Yi Dong <yidong@nvidia.com>
Date: Sat Oct 1 17:34:03 2022 +0000
calcuate the mask
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9db3bc13eb65a94a475b837603351da68e3745bc
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 23:26:32 2022 +0000
fix bug in datasets
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit f289900375d4412f53f8110be00fec6587627550
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 22:29:40 2022 +0000
update the code
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 8e28a1f208aabaab72dbe769e72756baada04d99
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 21:52:52 2022 +0000
added new ds
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 8d41315bab7ce90e200a8a7d1023c34f8e046897
Author: Yi Dong <doyend@gmail.com>
Date: Fri Sep 30 18:57:09 2022 +0000
added new files
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 984e0e94e15e16323c1ba1ca2efeabd84f69463f
Merge: cbe8b7ab1 fa6cd8588
Author: Yi Dong <doyend@gmail.com>
Date: Thu Sep 29 21:43:29 2022 +0000
Merge branch 'llm-prompt-learning-improvements' into universal_prompt
commit fa6cd858839277939446afe7275976078d54c512
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Sep 29 16:47:30 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 78ba46e5d6fde1be53c08e1e30a54cce59824be0
Merge: 7d6d46742 8d670bc77
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Thu Sep 29 09:43:27 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 7d6d46742170a66758287a207d67e1b1bfd15613
Author: Virginia Adams <vadams@nvidia.com>
Date: Thu Sep 29 16:42:43 2022 +0000
Removed inference step and added sentence peice check to predict step
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 20fd265acd6f7f9912cf52155fe66ccfa6b201a2
Author: Virginia Adams <vadams@nvidia.com>
Date: Thu Sep 29 15:26:32 2022 +0000
fixed first stage check for pipeline parallel T5 pt
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 3637be2b258c8d9028856f9971edb7da4a8121f0
Merge: a3ea722fd 986a76612
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Wed Sep 28 10:23:30 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit a3ea722fdc12fbcc5989b76ef5643a574b763bc4
Merge: 770967a52 971485ce7
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Mon Sep 26 13:35:52 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 770967a5251a474b6dcc2d44bf9a2076adbcb604
Merge: d23bf6c30 e3ac280a8
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Mon Sep 26 10:17:03 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit d23bf6c30acc0e3f6af9b4e24547669866a34d62
Merge: de6a31651 333d2b749
Author: Virginia Adams <vadams@nvidia.com>
Date: Mon Sep 26 10:05:16 2022 -0700
Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements
commit de6a31651e63d88a42b971794d93f18ff5a3cdff
Author: Virginia Adams <vadams@nvidia.com>
Date: Mon Sep 26 17:00:53 2022 +0000
Updated PP check to be on first stage pipeline only
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 333d2b7498e6742ce66436f733c980a74616900c
Merge: 592c0986a a39fc925a
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Fri Sep 23 16:11:21 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 592c0986a476a91b57b8605d7b70830d7acfa021
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 23:08:41 2022 +0000
Fixed unused import and CI test bug
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit ea9cd82d85638bc60ae4ad7ef105db931c8e3455
Merge: ce4b72c8c b566c2d0e
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 18:57:25 2022 +0000
Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements
commit ce4b72c8c52f32be336e323dd78a38089edc3e7c
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 18:57:16 2022 +0000
Switch to import from base class
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit b566c2d0e35a068f758fd1310bc620a47be4590b
Merge: 6621f2854 e872061ac
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Fri Sep 23 10:09:03 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 6621f28543828a48484a5637f6c9f3ccb23a5b02
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 20:47:35 2022 +0000
python format fix
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 8deafc8987b6af5f7b99a250310f57a40198c37f
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 20:28:02 2022 +0000
Save .nemo on new best val score
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 761bd36969cb465d6a129e9eee6ce1f883d3cf41
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 18:03:19 2022 +0000
Added automatic checkpoint to nemo file method
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 3be4ed57b6cd3ddfe4876d78650dfe8fe794598b
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 02:11:56 2022 +0000
Make GPT use base prompt learning model class:
Signed-off-by: Virginia Adams <vadams@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix LGTM
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix validation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* change for the lm eval
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* make text generation work in data parallel environment
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* implement the service with rest service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* surpress log
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Update config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Restore function needed for NMT
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* handles no answer only
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Fix config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* added knn to web
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix lgtm.com comments
Signed-off-by: Yi Dong <yidong@nvidia.com>
* output the retrieved context
Signed-off-by: Yi Dong <yidong@nvidia.com>
* allow no neighbor query
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove the imports
Signed-off-by: Yi Dong <yidong@nvidia.com>
* warn only once
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Change output file format from JSON to JSONL
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* new t0 dataset
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Add T0 data preproc scripts
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Merge and multiprocessing
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix for is_correct
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* fix epoch > 2
Signed-off-by: Yi Dong <yidong@nvidia.com>
* handles multiple dataloader
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove template
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Refactor T0 dataset
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Add script to merge train folder into individual training files to minimize number of blends
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added on the fly service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add combo instance
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added combo service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* send weights back to server
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix index store
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Minor changes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add reset button
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add add eos
Signed-off-by: Yi Dong <yidong@nvidia.com>
* use a seperate bert service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* no loss of accuracy
Signed-off-by: Yi Dong <yidong@nvidia.com>
* pin the gradio version
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Remove bin compat
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix header lines
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* evaluate based on text generation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* exact match result aggregation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* working SP and SA
Signed-off-by: Yi Dong <yidong@nvidia.com>
* sync
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix checkpoint
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix eval
Signed-off-by: Yi Dong <yidong@nvidia.com>
* backup states
Signed-off-by: Yi Dong <yidong@nvidia.com>
* backup states reset
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix the bug
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix evaluation for sentence piece
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix a bug
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* potential fix in the future
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove the universal codes
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove universal strategy
Signed-off-by: Yi Dong <yidong@nvidia.com>
* address reviewer comment
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add align function docstrings and make most args optional
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove redundant returns of viterbi and log probs matrices
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Rename h# to <initial_silence>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update manifest format description in README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* always remove any spaces from utt_id
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Patch the hanging of threads on very large stderr (#5589) (#5590)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* O2 style amp for gpt3 ptuning (#5246)
* enable amp o2 plugin
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* only create master param if param requires gradient
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* remove pytorch autocast
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* Update optimizer_with_main_params.py
Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
* create master grad only if param group requires grad
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* fix grad scaler for pp > 1
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
Co-authored-by: Jimmy Zhang <jiemingz@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Better patch hydra (#5591) (#5592)
* Readd buffereing and thread drain to Hydra Launcher
Signed-off-by: smajumdar <titu1994@gmail.com>
* Readd buffereing and thread drain to Hydra Launcher
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Yet another fix with hydra multirun (#5594) (#5595)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add RETRO model documentation (#5578)
* added retro doc
Signed-off-by: Yi Dong <yidong@nvidia.com>
* finish data part
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the data format
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added training script
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added training and evaluation steps
Signed-off-by: Yi Dong <yidong@nvidia.com>
* edit the text
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the images
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix beginning
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix the grammar
Signed-off-by: Yi Dong <yidong@nvidia.com>
* trim it down
Signed-off-by: Yi Dong <yidong@nvidia.com>
* add wandb option
Signed-off-by: Yi Dong <yidong@nvidia.com>
* add reference
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix path
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the parameters table
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix section
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix: setup_multiple validation/test data (#5585)
Fix: setup_multiple validation/test data (#5585)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Move to optimizer based EMA implementation (#5169)
* Move to optimizer
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fix replacing weights
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Allow swapping of weights be optional
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Save 2 models
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Use different hook
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Expose cpu device
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add clause to see if this fixes issue with O2 optimizer
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Try to get O2 working
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* WIP
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fixes
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fixes to tests
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add guard
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Remove import
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add guard
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add comment
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Remove overwrite
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add BatchNorm, currently tests fail
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fix tests/functionality for batch norm
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Get rid of NLP changes
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* AIStore for ASR datasets (#5462)
AIStore for ASR datasets
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add support for MHA adapters to ASR (#5396)
* Convert AbstractAdapterModule to AbstractAdapterMixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* Temporary fixes to new signature of mixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add adapter util for constants, add all mha adapters.
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update name of function
Signed-off-by: smajumdar <titu1994@gmail.com>
* Roll back changes to convASR
Signed-off-by: smajumdar <titu1994@gmail.com>
* Convert AbstractAdapterModule to AbstractAdapterMixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* First draft of Conformer support for MHA attention
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add some preliminary tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add support for projection of the hidden dimension for attention
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add support for squeezeformer
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update train adapter config
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add tests for squeezeformer and unit tests for new modules
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update config for hp search,set limits on modules for conformer and squeezeformer, update adapter mixin, add cache to import_from_class_path
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update location of adapters
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add pre_norm for proper attention learning, Fix the issue with nan/inf in pos_bias_u and pos_bias_v
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update expmanager to clean up checkpoints
Signed-off-by: smajumdar <titu1994@gmail.com>
* Fix style
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update training scripts
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update config and docs
Signed-off-by: smajumdar <titu1994@gmail.com>
* Expose nemo delete function
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct adapter partial state saving
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct a bug with state management of adapter tokens
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Pull down EMA test
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct name of adapter module utility class
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove unused TTS eval functions w/ pesq and pystoi dependencies (#5605) (#5606)
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Co-authored-by: Jocelyn <jocelynh@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Create separator parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Call align function with hydra config
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* update usage example
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update Dockerfile (#5614) (#5616)
Pinned to use `numba==0.53.1` to avoid crashing in training with `num_workers > 0`. This is just a temporary workaround, still need to fix it in the future.
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Co-authored-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make separate pretrained_name and model_path parameters
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* make "optional" tags bold in markdown
Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Move non-main functions to utils dir
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Temp workaround: Disable test with cache_audio=True since it is failing in CI (#5607) (#5615)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Co-authored-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] fix ranges of char set for accented letters. (#5607)
* [TTS] fix ranges of char set for accented letters.
* remove digits pattern and added unit tests for math operators.
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Change success message to reduce confusion (#5621)
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update documentation and tutorials for Adapters (#5610)
* Improve docs for adapter and tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Improve docs for adapter and tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update test
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Rename test file
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] add type hints and change varialbe names for tokenizers and g2p (#5602)
* [TTS] add type hints and change variable names for tokenizers and g2p
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* 1. Added missing import for gather_objects. (#5627)
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Co-authored-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] add fastpitch and hifigan model NGC urls and update NeMo docs. (#5596) (#5625)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fixed RadTTS unit test (#5572)
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove tests (#5633)
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][DOC] add notes about automatic conversion to target sampling rates. (#5624) (#5634)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Conformer local attention (#5525)
* local attn and merge
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* optional
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* override
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* incorporate comments
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* update
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* fix
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* comment
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* changes, test
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* changes
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* check att context
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* readme link
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* utils
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* update
Signed-off-by: sam1373 <samuelkriman@gmail.com>
Signed-off-by: sam1373 <samuelkriman@gmail.com>
Signed-off-by: Samuel Kriman <samuelkriman@gmail.com>
Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add core classes and functions for online clustering diarizer part 1 (#5526)
* Add core classes and functions for online clustering diarizer
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add audio to labels code
Signed-off-by: Taejin Park <tango4j@gmail.com>
* resolve type errors
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added unit=tests for very short audio
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Filled all missing docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* resolved conflict and added missing docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixed unit-test errors
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix the wrongly added file - megatron_gpt_model.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fix wrongly included file - megatron_gpt_model.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* resolve code quality issue
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fixed unit-test errors and bugs
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* changed total_sec for offline_clustering toy_data in unit-tests
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed merging index offset bug
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* only including part 1 files
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* removed unused function
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed unused imports
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* divided nmesc_clustering.py into two and reflected first-pass comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* adding offline/online_clustering.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix code QL autocomment
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Removed unused imports
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/online_clustering.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Reflected comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* resolved code scanning issue
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/offline_clustering.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [STT] Add Esperanto (Eo) ASR Conformer-CTC and Conformer-Transducer models (#5639) (#5641)
* add stt_eo_conformer_ctc_large model
* stt_eo_conformer_transducer_large
Co-authored-by: Andrei Andrusenko <52885736+andrusenkoau@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Removed unused import
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Specify that filepaths need to be absolute
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* replaces any spaces in utt_id with dashes
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make hydra script callable by another script
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* do not specify default model or model_downsample_factor
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [Dockerfile] Remove AIS archive from docker image (#5629)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Measure audio_sr from audio instead of needing to specify
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] Disambiguate polyphones with augmented dict and Jieba segmenter for Chinese FastPitch (#5541)
* Chinese TTS replaces default pypinyin dict
* Add jieba word segmenter as an option
Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make separate parameters for device of transcription and viterbi steps
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add mention of gecko
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [workflow] add exclude labels option to ignore cherry-picks in release changelog. (#5645)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] bugfix for the tutorial and add NGC CLI installation guide. (#5643) (#5647)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [Add] ASR+VAD Inference Pipeline (#5575)
Added offline ASR+VAD inference pipeline that matches with what's in RIVA, along with some feature-based ASR and classification datasets.
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* rename separator to ctm_grouping_separator and refactor
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Bert interleaved (#5556)
* Adding SP and SAR support Bert
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding SP and SAR support Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding SP and SAR support Bert
* Adding SP and SAR support Bert
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* Update bert_model.py
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Adding tests
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Addressing Eric's comments
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Addressing Eric's comments
* Fix bug fix sequence parallel and Interleaved
* Fix bug fix sequence parallel and Interleaved
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add duration padding support for RADTTS inference (#5650)
* Added duration padding support for RADTTS inference
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Co-authored-by: Kevin Shih <kshih@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add remove_blank_tokens_from_ctm parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Dont save initial_silence line in CTM
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add DLLogger support to exp_manager (#5658)
* Add DLLogger support to exp_manager
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
* Move dllogger to separate file and check import
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Remove unused import
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add minimum_timestamp_duration parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add suggestion about removing blanks to README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* reorder args
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* clarify description of ctm_grouping_separator in README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* update docstring
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] bugfix for ngc cli installation. (#5652) (#5664)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Port stateless timer to exp manager (#5584)
* Port stateless timer to exp manager
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes and remove from all megatron code
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Change message
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix EMA restart by allowing device to be set by the class init (#5668)
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove SDP (moved to separate repo) - merge to main (#5630)
* Remove sdp files from tools folder
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add page to docs with new SDP location
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add interface for making amax reduction optional for FP8 (#5447)
* add TE interface for making amax reduction optional
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] add tts dict cust notebook (#5662)
* add tts dict cust notebook
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* review
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* fixed audio links
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* remove old notebook
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* fix typo
Signed-off-by: ekmb <ebakhturina@nvidia.com>
Signed-off-by: ekmb <ebakhturina@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [ASR] Audio processing base, multi-channel enhancement models (#5356)
* Audio processing base model, enc-mask-dec enhancement, tests and modules
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed review comments
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Fixed CodeQL warnings
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed PR comments
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed PR comments:
- renamed AudioProcessingModel to AudioToAudioModel
- various small modifications
- updated unit tests
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed comments
- Moved spectrogram to audio_preprocessing
- Renamed MultichannelFeatures
- Updated config and unit tests
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Expose ClusteringDiarizer device (#5681)
* Expose device for users to set
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Expose device for users to set
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add Beam Search support to ASR transcribe() (#5443)
* Add support for beam decoding via high level API.
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add ctc decoding section
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update ctc transcribe API to return results from beam search
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add argument to preserve arpa file
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update script to use hydra config, add some support for future compute timesteps, add doc for ctc decoding
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update eval script and doc to use new API
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add tests for ctc greedy decoding
Signed-off-by: smajumdar <titu1994@gmail.com>
* Address reviewer comments and add docstrings
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix changes and address comments
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Samuel Kriman <samuelkriman@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Propagate attention_dropout flag for GPT-3 (#5669)
* Propagate attention_dropout flag for GPT-3
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
* Add default to megatron_gpt_config
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Eric H…
ericharper
added a commit
that referenced
this pull request
Jan 31, 2023
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (#5277) * Typo fix (#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (#5359) * Add Gradio App to ASR Docs (#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (#5379) * Fixed bug in notebook (#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (#5421)" (#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (#5242) (#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Kipok
added a commit
to Kipok/NeMo
that referenced
this pull request
Jan 31, 2023
* Flashlight Decoder for Nemo (#5790)
* Added initial flashlight decoding files
Signed-off-by: Daniel Egert <degert@nvidia.com>
* Fixed some minor bugs
Signed-off-by: Daniel Egert <degert@nvidia.com>
* Added fixes from auto style thingamajig
Signed-off-by: Daniel Egert <degert@nvidia.com>
* Ran pre-commit and fixed script file formatting
Signed-off-by: Daniel Egert <degert@nvidia.com>
* Minor fixes brought up during PR review
Signed-off-by: Daniel Egert <degert@nvidia.com>
* Added Windows-compatible code to eval_beamsearch_ngram.py
Signed-off-by: Daniel Egert <degert@nvidia.com>
* Added initial flashlight decoding files
Signed-off-by: Daniel Egert <degert@nvidia.com>
* Fixed some minor bugs
Signed-off-by: Daniel Egert <degert@nvidia.com>
* Added fixes from auto style thingamajig
Signed-off-by: Daniel Egert <degert@nvidia.com>
* Ran pre-commit and fixed script file formatting
Signed-off-by: Daniel Egert <degert@nvidia.com>
* Minor fixes brought up during PR review
Signed-off-by: Daniel Egert <degert@nvidia.com>
* Added Windows-compatible code to eval_beamsearch_ngram.py
Signed-off-by: Daniel Egert <degert@nvidia.com>
Signed-off-by: Daniel Egert <degert@nvidia.com>
Co-authored-by: Daniel Egert <degert@nvidia.com>
* update torchmetrics to latest version (#5801) (#5802)
* update torchmetrics to latest version
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* pin min version for metrics
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
* Update radtts' infer path (#5788)
* Update radtts' infer path
Signed-off-by: Jason <jasoli@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* update to newer code
Signed-off-by: Jason <jasoli@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add import
Signed-off-by: Jason <jasoli@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* move code to model
Signed-off-by: Jason <jasoli@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add input and output types property
Signed-off-by: Jason <jasoli@nvidia.com>
* remove comment
Signed-off-by: Jason <jasoli@nvidia.com>
* update volume
Signed-off-by: Jason <jasoli@nvidia.com>
* align volume
Signed-off-by: Jason <jasoli@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Jason <jasoli@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com>
* add constraint info on batch size for tar dataset (#5812)
* add constraint info on batch size for tar dataset
Signed-off-by: Yang Zhang <yangzhang@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* style fix
Signed-off-by: Yang Zhang <yangzhang@nvidia.com>
Signed-off-by: Yang Zhang <yangzhang@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* NeMo Forced Aligner (#5571)
* Merge r1.13.0 main (#5570)
* update branch
Signed-off-by: ericharper <complex451@gmail.com>
* Rename Speech Dataset Processor to Speech Data Processor (#5378)
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Megatron Export Update (#5343)
* export update for Megatron + change ORT optimization
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* updated export_utils to use autocast instead of manually casting >:/
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* removed dtype from LayerNorm
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* added comment
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* reverting changes on FloatCast
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* Cherry-picked changes from megatron-norm
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* updated asr_model import to cast_utils
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* updated del onnx_model place
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* changed ort optimization to basic -> temp fix
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Disable sync_batch_comm in validation_step for GPT (#5397)
* disable sync_batch_comm in validation_step
Signed-off-by: ericharper <complex451@gmail.com>
* Read sync_batch_comm from config or default to False
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
* Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
* Empty
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Comment out test
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
* Radtts 1.13 (#5451)
* [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358)
* [TTS] add CI test for RADTTS training recipe.
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
* Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478)
* Initial refactor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Resolve config before passing to load_from_checkpoint
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes for model parallel and nemo restore
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes for eval
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Revert config changes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Refactor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix typo
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Remove comments
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Minor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix validation reconfiguration
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Remove old comment
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes for test_ds
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* export_utils bugfix (#5480)
* updated export_utils
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Export fixes for Riva (#5496)
* Export fixes for Riva
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Cleaning up training_utils
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* added set_start_method + function param bugfix (#5539)
* added set_start_method + function param bugfix
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* upper bound torchmetrics
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: ericharper <complex451@gmail.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ericharper <complex451@gmail.com>
* remove notebook (#5548)
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
* update readme
Signed-off-by: ericharper <complex451@gmail.com>
* update branch
Signed-off-by: ericharper <complex451@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Co-authored-by: David <amosalla@asu.edu>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Optimized loop and bugfix in SDE (#5573)
- Fixed bug with loading custom data attributes from JSON in Speech Data Explorer
Signed-off-by: George Zelenfroynd <gzelenfroind@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update torchmetrics (#5566)
* add task arg
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* update state
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Co-authored-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove useless files. (#5580)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add initial NFA code
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make use of the specified device during viterbi decoding
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix CodeQL notes
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix CodeQL warning
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add an option to defer data setup from ``__init__`` to ``setup`` (#5569)
* Add an option to defer dataloader setup from __init__ to setup
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Updated doc
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make utt_id specified by number of parts of audio_filepath user wishes to use
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove audio_sr TODO - reduce risk of silent bugs
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add check that model is CTC
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove unused import
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Text generation improvement (UI client, data parallel support) (#5437)
* Squashed commit of the following:
commit a5e124f34be31bd6eafe5e5fdf5bedcd0d50915c
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Oct 13 15:07:42 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 35b424044fe80c3081e7756ab21244f701716f7e
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 08:04:49 2022 -0700
get rid of base
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 2955210e2311791543538cfbb5ad26b79414c954
Merge: d52edef8c eaf6757ca
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:17:02 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit d52edef8cd7b36593838fb270047e80f8ccb652e
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:16:24 2022 +0000
align with main
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit eaf6757ca5be8e099492f57c81d984429b0ad49c
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Oct 13 13:12:11 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit c4b86d97626ea0721bf8fb4c0a45dec5becc94c9
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:10:58 2022 +0000
same as main
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit e335de51bcc0d681c58b568c3d8c238bc5687c3b
Merge: c231086e0 4463a9fe9
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:08:09 2022 +0000
Merge branch 'main' into universal_prompt
commit c231086e057f1efaa915f691d84664cb3d5aad85
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Wed Oct 12 19:59:12 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 6a821a4b49a23dd3408a706a2a3dd393149b0bb1
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:56:17 2022 +0000
default to pad
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9d908e39fef1beed9ba2da4d1a6806161eb7ef25
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:55:44 2022 +0000
add the option to pad the tokens
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 876dc395b43fdeeaa2bcbbe13c76523633764c33
Merge: fbb0f4035 fe3c77ee9
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:20:47 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit fe3c77ee93ab6cf3ea152db68cb6beefcac2a392
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 18:59:49 2022 +0000
fix import again
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit fbb0f4035c6cd6bfefed50a20605503de8c1dccb
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Wed Oct 12 16:00:24 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 372ca8c0d7988f2339b15888dc72aa21f4fb6937
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 15:58:32 2022 +0000
enable server
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit cbe05d9fbc978f812cfbb671f45f147f300713c4
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 13:07:28 2022 +0000
fix comment error
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 1948048922e726ec6131e44b1a745389f18d4ef2
Merge: 232c2cce3 984f5c09a
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 13:05:30 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit 232c2cce34d7a8b902da406706f3dd9b39475091
Merge: 34c8a68df 658243fb6
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 12:50:00 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit 984f5c09a6dbf1d1fb5aa30ed9b0df188e66a50f
Merge: 658243fb6 3fda5de46
Author: Yi Dong <43824965+yidong72@users.noreply.github.com>
Date: Wed Oct 12 08:42:11 2022 -0400
Merge branch 'main' into fix_global_init
commit 658243fb6580191b5d60edd30cde16dcc23cbb85
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 12 12:40:57 2022 +0000
fix import error
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 8e0fe1cad05ec288ec122b3cd0e139a96872e08c
Author: Yi Dong <doyend@gmail.com>
Date: Tue Oct 11 22:44:12 2022 +0000
update the fused kernel
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 536cf6bef9447b75843fad630729c47a2fba35f3
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:44:52 2022 -0700
add the missing file
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 1b437ec41dc5e354453ce0a089bca0171cbcb6c2
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:43:14 2022 -0700
fix fused softmax
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 7813f60e05f9783af61f8c14ec1cb0c6c4f1f263
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:16:48 2022 -0700
move global step to base
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 34c8a68df084b18d377e84415d9f07b2cd6673dd
Author: Yi Dong <doyend@gmail.com>
Date: Thu Oct 6 13:50:11 2022 +0000
fix pipeline for eval
Signed-off-by: Yi Dong <doyend@gmail.com>
commit eee5d38218f26660c3ffebe9f615c850c80a1f0d
Author: Yi Dong <doyend@gmail.com>
Date: Thu Oct 6 13:48:22 2022 +0000
fix for pipleline parallel
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 323bca73e7ef6099ee79c0a2fffac7b709ed6c5d
Merge: 125e49947 e3b4c4d1f
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 19:29:13 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit 125e4994760448ff75dd9328395813eda1c87547
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 19:29:04 2022 +0000
add share option
Signed-off-by: Yi Dong <doyend@gmail.com>
commit e3b4c4d1f7346c9fa596f3cca6d4df0a9e05c368
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 5 11:43:48 2022 -0700
make sure consolidation works
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit a5c833964ecf05dc460ca1da69275c4019742150
Merge: 2a07ab52d abcb74be2
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:40:29 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit 2a07ab52d95f15ba666823028c69e23825666c05
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:40:23 2022 +0000
added requirement
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 3abecd9dd1611993a87c537636abe7f7e6a9b04c
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:39:42 2022 +0000
added a simple web server
Signed-off-by: Yi Dong <doyend@gmail.com>
commit abcb74be2caf1cdec40eb9ba2be4dde4d45a3b4b
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 5 06:54:12 2022 -0700
fix empty val loss
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit b8eb92ac4a0d665570af75e34c9ba3c2e2420c26
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 19:25:30 2022 -0700
text gen working
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit d59f3e3f3a6fd19736d1c5706fed65a3dd4049ba
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 16:08:40 2022 -0700
first change
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 59d077585e6962a669b824af58f64e8a0bea6547
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 15:00:40 2022 -0700
revert
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 12a0f3902d99e9179403644bd951c045df716ca7
Author: Yi Dong <doyend@gmail.com>
Date: Tue Oct 4 21:26:23 2022 +0000
init imp
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 62a15dfd943cc48be495ac61b9f2f00995775c5f
Merge: 82c90d2cd e0cc6b767
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 11:58:26 2022 -0700
Merge branch 'main' into universal_prompt
commit 82c90d2cd0fd156f16a4b899f8c741d598f33990
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 11:17:13 2022 -0700
add sync
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9819b703eef877d90cd1257bf3610c69de9b4d7e
Author: Yi Dong <yidong@nvidia.com>
Date: Sun Oct 2 17:52:34 2022 -0700
fix save model
Signed-off-by: root <root@luna-0197.selene.nvidia.com>
commit e4937e2fc5fb7d70754c97668416e4a69c3079fe
Author: Yi Dong <yidong@nvidia.com>
Date: Sat Oct 1 18:56:09 2022 +0000
working
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit b73b06d1c7cf5417a6d87cb33d8ed83a57e38b7b
Author: Yi Dong <yidong@nvidia.com>
Date: Sat Oct 1 17:34:03 2022 +0000
calcuate the mask
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9db3bc13eb65a94a475b837603351da68e3745bc
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 23:26:32 2022 +0000
fix bug in datasets
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit f289900375d4412f53f8110be00fec6587627550
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 22:29:40 2022 +0000
update the code
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 8e28a1f208aabaab72dbe769e72756baada04d99
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 21:52:52 2022 +0000
added new ds
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 8d41315bab7ce90e200a8a7d1023c34f8e046897
Author: Yi Dong <doyend@gmail.com>
Date: Fri Sep 30 18:57:09 2022 +0000
added new files
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 984e0e94e15e16323c1ba1ca2efeabd84f69463f
Merge: cbe8b7ab1 fa6cd8588
Author: Yi Dong <doyend@gmail.com>
Date: Thu Sep 29 21:43:29 2022 +0000
Merge branch 'llm-prompt-learning-improvements' into universal_prompt
commit fa6cd858839277939446afe7275976078d54c512
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Sep 29 16:47:30 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 78ba46e5d6fde1be53c08e1e30a54cce59824be0
Merge: 7d6d46742 8d670bc77
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Thu Sep 29 09:43:27 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 7d6d46742170a66758287a207d67e1b1bfd15613
Author: Virginia Adams <vadams@nvidia.com>
Date: Thu Sep 29 16:42:43 2022 +0000
Removed inference step and added sentence peice check to predict step
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 20fd265acd6f7f9912cf52155fe66ccfa6b201a2
Author: Virginia Adams <vadams@nvidia.com>
Date: Thu Sep 29 15:26:32 2022 +0000
fixed first stage check for pipeline parallel T5 pt
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 3637be2b258c8d9028856f9971edb7da4a8121f0
Merge: a3ea722fd 986a76612
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Wed Sep 28 10:23:30 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit a3ea722fdc12fbcc5989b76ef5643a574b763bc4
Merge: 770967a52 971485ce7
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Mon Sep 26 13:35:52 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 770967a5251a474b6dcc2d44bf9a2076adbcb604
Merge: d23bf6c30 e3ac280a8
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Mon Sep 26 10:17:03 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit d23bf6c30acc0e3f6af9b4e24547669866a34d62
Merge: de6a31651 333d2b749
Author: Virginia Adams <vadams@nvidia.com>
Date: Mon Sep 26 10:05:16 2022 -0700
Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements
commit de6a31651e63d88a42b971794d93f18ff5a3cdff
Author: Virginia Adams <vadams@nvidia.com>
Date: Mon Sep 26 17:00:53 2022 +0000
Updated PP check to be on first stage pipeline only
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 333d2b7498e6742ce66436f733c980a74616900c
Merge: 592c0986a a39fc925a
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Fri Sep 23 16:11:21 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 592c0986a476a91b57b8605d7b70830d7acfa021
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 23:08:41 2022 +0000
Fixed unused import and CI test bug
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit ea9cd82d85638bc60ae4ad7ef105db931c8e3455
Merge: ce4b72c8c b566c2d0e
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 18:57:25 2022 +0000
Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements
commit ce4b72c8c52f32be336e323dd78a38089edc3e7c
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 18:57:16 2022 +0000
Switch to import from base class
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit b566c2d0e35a068f758fd1310bc620a47be4590b
Merge: 6621f2854 e872061ac
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Fri Sep 23 10:09:03 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 6621f28543828a48484a5637f6c9f3ccb23a5b02
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 20:47:35 2022 +0000
python format fix
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 8deafc8987b6af5f7b99a250310f57a40198c37f
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 20:28:02 2022 +0000
Save .nemo on new best val score
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 761bd36969cb465d6a129e9eee6ce1f883d3cf41
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 18:03:19 2022 +0000
Added automatic checkpoint to nemo file method
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 3be4ed57b6cd3ddfe4876d78650dfe8fe794598b
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 02:11:56 2022 +0000
Make GPT use base prompt learning model class:
Signed-off-by: Virginia Adams <vadams@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix LGTM
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix validation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* change for the lm eval
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* make text generation work in data parallel environment
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* implement the service with rest service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* surpress log
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Update config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Restore function needed for NMT
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* handles no answer only
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Fix config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* added knn to web
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix lgtm.com comments
Signed-off-by: Yi Dong <yidong@nvidia.com>
* output the retrieved context
Signed-off-by: Yi Dong <yidong@nvidia.com>
* allow no neighbor query
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove the imports
Signed-off-by: Yi Dong <yidong@nvidia.com>
* warn only once
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Change output file format from JSON to JSONL
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* new t0 dataset
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Add T0 data preproc scripts
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Merge and multiprocessing
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix for is_correct
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* fix epoch > 2
Signed-off-by: Yi Dong <yidong@nvidia.com>
* handles multiple dataloader
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove template
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Refactor T0 dataset
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Add script to merge train folder into individual training files to minimize number of blends
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added on the fly service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add combo instance
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added combo service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* send weights back to server
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix index store
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Minor changes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add reset button
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add add eos
Signed-off-by: Yi Dong <yidong@nvidia.com>
* use a seperate bert service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* no loss of accuracy
Signed-off-by: Yi Dong <yidong@nvidia.com>
* pin the gradio version
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Remove bin compat
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix header lines
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* evaluate based on text generation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* exact match result aggregation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* working SP and SA
Signed-off-by: Yi Dong <yidong@nvidia.com>
* sync
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix checkpoint
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix eval
Signed-off-by: Yi Dong <yidong@nvidia.com>
* backup states
Signed-off-by: Yi Dong <yidong@nvidia.com>
* backup states reset
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix the bug
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix evaluation for sentence piece
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix a bug
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* potential fix in the future
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove the universal codes
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove universal strategy
Signed-off-by: Yi Dong <yidong@nvidia.com>
* address reviewer comment
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add align function docstrings and make most args optional
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove redundant returns of viterbi and log probs matrices
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Rename h# to <initial_silence>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update manifest format description in README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* always remove any spaces from utt_id
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Patch the hanging of threads on very large stderr (#5589) (#5590)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* O2 style amp for gpt3 ptuning (#5246)
* enable amp o2 plugin
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* only create master param if param requires gradient
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* remove pytorch autocast
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* Update optimizer_with_main_params.py
Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
* create master grad only if param group requires grad
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* fix grad scaler for pp > 1
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
Co-authored-by: Jimmy Zhang <jiemingz@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Better patch hydra (#5591) (#5592)
* Readd buffereing and thread drain to Hydra Launcher
Signed-off-by: smajumdar <titu1994@gmail.com>
* Readd buffereing and thread drain to Hydra Launcher
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Yet another fix with hydra multirun (#5594) (#5595)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add RETRO model documentation (#5578)
* added retro doc
Signed-off-by: Yi Dong <yidong@nvidia.com>
* finish data part
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the data format
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added training script
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added training and evaluation steps
Signed-off-by: Yi Dong <yidong@nvidia.com>
* edit the text
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the images
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix beginning
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix the grammar
Signed-off-by: Yi Dong <yidong@nvidia.com>
* trim it down
Signed-off-by: Yi Dong <yidong@nvidia.com>
* add wandb option
Signed-off-by: Yi Dong <yidong@nvidia.com>
* add reference
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix path
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the parameters table
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix section
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix: setup_multiple validation/test data (#5585)
Fix: setup_multiple validation/test data (#5585)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Move to optimizer based EMA implementation (#5169)
* Move to optimizer
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fix replacing weights
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Allow swapping of weights be optional
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Save 2 models
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Use different hook
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Expose cpu device
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add clause to see if this fixes issue with O2 optimizer
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Try to get O2 working
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* WIP
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fixes
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fixes to tests
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add guard
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Remove import
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add guard
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add comment
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Remove overwrite
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add BatchNorm, currently tests fail
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fix tests/functionality for batch norm
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Get rid of NLP changes
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* AIStore for ASR datasets (#5462)
AIStore for ASR datasets
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add support for MHA adapters to ASR (#5396)
* Convert AbstractAdapterModule to AbstractAdapterMixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* Temporary fixes to new signature of mixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add adapter util for constants, add all mha adapters.
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update name of function
Signed-off-by: smajumdar <titu1994@gmail.com>
* Roll back changes to convASR
Signed-off-by: smajumdar <titu1994@gmail.com>
* Convert AbstractAdapterModule to AbstractAdapterMixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* First draft of Conformer support for MHA attention
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add some preliminary tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add support for projection of the hidden dimension for attention
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add support for squeezeformer
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update train adapter config
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add tests for squeezeformer and unit tests for new modules
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update config for hp search,set limits on modules for conformer and squeezeformer, update adapter mixin, add cache to import_from_class_path
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update location of adapters
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add pre_norm for proper attention learning, Fix the issue with nan/inf in pos_bias_u and pos_bias_v
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update expmanager to clean up checkpoints
Signed-off-by: smajumdar <titu1994@gmail.com>
* Fix style
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update training scripts
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update config and docs
Signed-off-by: smajumdar <titu1994@gmail.com>
* Expose nemo delete function
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct adapter partial state saving
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct a bug with state management of adapter tokens
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Pull down EMA test
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct name of adapter module utility class
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove unused TTS eval functions w/ pesq and pystoi dependencies (#5605) (#5606)
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Co-authored-by: Jocelyn <jocelynh@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Create separator parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Call align function with hydra config
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* update usage example
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update Dockerfile (#5614) (#5616)
Pinned to use `numba==0.53.1` to avoid crashing in training with `num_workers > 0`. This is just a temporary workaround, still need to fix it in the future.
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Co-authored-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make separate pretrained_name and model_path parameters
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* make "optional" tags bold in markdown
Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Move non-main functions to utils dir
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Temp workaround: Disable test with cache_audio=True since it is failing in CI (#5607) (#5615)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Co-authored-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] fix ranges of char set for accented letters. (#5607)
* [TTS] fix ranges of char set for accented letters.
* remove digits pattern and added unit tests for math operators.
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Change success message to reduce confusion (#5621)
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update documentation and tutorials for Adapters (#5610)
* Improve docs for adapter and tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Improve docs for adapter and tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update test
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Rename test file
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] add type hints and change varialbe names for tokenizers and g2p (#5602)
* [TTS] add type hints and change variable names for tokenizers and g2p
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* 1. Added missing import for gather_objects. (#5627)
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Co-authored-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] add fastpitch and hifigan model NGC urls and update NeMo docs. (#5596) (#5625)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fixed RadTTS unit test (#5572)
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove tests (#5633)
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][DOC] add notes about automatic conversion to target sampling rates. (#5624) (#5634)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Conformer local attention (#5525)
* local attn and merge
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* optional
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* override
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* incorporate comments
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* update
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* fix
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* comment
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* changes, test
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* changes
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* check att context
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* readme link
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* utils
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* update
Signed-off-by: sam1373 <samuelkriman@gmail.com>
Signed-off-by: sam1373 <samuelkriman@gmail.com>
Signed-off-by: Samuel Kriman <samuelkriman@gmail.com>
Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add core classes and functions for online clustering diarizer part 1 (#5526)
* Add core classes and functions for online clustering diarizer
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add audio to labels code
Signed-off-by: Taejin Park <tango4j@gmail.com>
* resolve type errors
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added unit=tests for very short audio
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Filled all missing docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* resolved conflict and added missing docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixed unit-test errors
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix the wrongly added file - megatron_gpt_model.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fix wrongly included file - megatron_gpt_model.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* resolve code quality issue
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fixed unit-test errors and bugs
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* changed total_sec for offline_clustering toy_data in unit-tests
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed merging index offset bug
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* only including part 1 files
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* removed unused function
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed unused imports
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* divided nmesc_clustering.py into two and reflected first-pass comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* adding offline/online_clustering.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix code QL autocomment
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Removed unused imports
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/online_clustering.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Reflected comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* resolved code scanning issue
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/offline_clustering.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [STT] Add Esperanto (Eo) ASR Conformer-CTC and Conformer-Transducer models (#5639) (#5641)
* add stt_eo_conformer_ctc_large model
* stt_eo_conformer_transducer_large
Co-authored-by: Andrei Andrusenko <52885736+andrusenkoau@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Removed unused import
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Specify that filepaths need to be absolute
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* replaces any spaces in utt_id with dashes
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make hydra script callable by another script
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* do not specify default model or model_downsample_factor
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [Dockerfile] Remove AIS archive from docker image (#5629)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Measure audio_sr from audio instead of needing to specify
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] Disambiguate polyphones with augmented dict and Jieba segmenter for Chinese FastPitch (#5541)
* Chinese TTS replaces default pypinyin dict
* Add jieba word segmenter as an option
Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make separate parameters for device of transcription and viterbi steps
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add mention of gecko
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [workflow] add exclude labels option to ignore cherry-picks in release changelog. (#5645)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] bugfix for the tutorial and add NGC CLI installation guide. (#5643) (#5647)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [Add] ASR+VAD Inference Pipeline (#5575)
Added offline ASR+VAD inference pipeline that matches with what's in RIVA, along with some feature-based ASR and classification datasets.
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* rename separator to ctm_grouping_separator and refactor
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Bert interleaved (#5556)
* Adding SP and SAR support Bert
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding SP and SAR support Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding SP and SAR support Bert
* Adding SP and SAR support Bert
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* Update bert_model.py
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Adding tests
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Addressing Eric's comments
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Addressing Eric's comments
* Fix bug fix sequence parallel and Interleaved
* Fix bug fix sequence parallel and Interleaved
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add duration padding support for RADTTS inference (#5650)
* Added duration padding support for RADTTS inference
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Co-authored-by: Kevin Shih <kshih@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add remove_blank_tokens_from_ctm parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Dont save initial_silence line in CTM
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add DLLogger support to exp_manager (#5658)
* Add DLLogger support to exp_manager
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
* Move dllogger to separate file and check import
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Remove unused import
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add minimum_timestamp_duration parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add suggestion about removing blanks to README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* reorder args
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* clarify description of ctm_grouping_separator in README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* update docstring
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] bugfix for ngc cli installation. (#5652) (#5664)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Port stateless timer to exp manager (#5584)
* Port stateless timer to exp manager
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes and remove from all megatron code
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Change message
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix EMA restart by allowing device to be set by the class init (#5668)
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove SDP (moved to separate repo) - merge to main (#5630)
* Remove sdp files from tools folder
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add page to docs with new SDP location
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add interface for making amax reduction optional for FP8 (#5447)
* add TE interface for making amax reduction optional
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] a…
ericharper
added a commit
that referenced
this pull request
Jan 31, 2023
* Merge r1.13.0 main (#5570)
* update branch
Signed-off-by: ericharper <complex451@gmail.com>
* Rename Speech Dataset Processor to Speech Data Processor (#5378)
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Megatron Export Update (#5343)
* export update for Megatron + change ORT optimization
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* updated export_utils to use autocast instead of manually casting >:/
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* removed dtype from LayerNorm
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* added comment
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* reverting changes on FloatCast
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* Cherry-picked changes from megatron-norm
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* updated asr_model import to cast_utils
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* updated del onnx_model place
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* changed ort optimization to basic -> temp fix
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Disable sync_batch_comm in validation_step for GPT (#5397)
* disable sync_batch_comm in validation_step
Signed-off-by: ericharper <complex451@gmail.com>
* Read sync_batch_comm from config or default to False
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
* Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
* Empty
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Comment out test
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
* Radtts 1.13 (#5451)
* [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358)
* [TTS] add CI test for RADTTS training recipe.
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
* Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478)
* Initial refactor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Resolve config before passing to load_from_checkpoint
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes for model parallel and nemo restore
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes for eval
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Revert config changes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Refactor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix typo
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Remove comments
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Minor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix validation reconfiguration
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Remove old comment
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes for test_ds
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* export_utils bugfix (#5480)
* updated export_utils
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Export fixes for Riva (#5496)
* Export fixes for Riva
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Cleaning up training_utils
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* added set_start_method + function param bugfix (#5539)
* added set_start_method + function param bugfix
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* upper bound torchmetrics
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: ericharper <complex451@gmail.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ericharper <complex451@gmail.com>
* remove notebook (#5548)
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
* update readme
Signed-off-by: ericharper <complex451@gmail.com>
* update branch
Signed-off-by: ericharper <complex451@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Co-authored-by: David <amosalla@asu.edu>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Optimized loop and bugfix in SDE (#5573)
- Fixed bug with loading custom data attributes from JSON in Speech Data Explorer
Signed-off-by: George Zelenfroynd <gzelenfroind@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update torchmetrics (#5566)
* add task arg
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* update state
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Co-authored-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove useless files. (#5580)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add initial NFA code
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make use of the specified device during viterbi decoding
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix CodeQL notes
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix CodeQL warning
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add an option to defer data setup from ``__init__`` to ``setup`` (#5569)
* Add an option to defer dataloader setup from __init__ to setup
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Updated doc
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make utt_id specified by number of parts of audio_filepath user wishes to use
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove audio_sr TODO - reduce risk of silent bugs
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add check that model is CTC
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove unused import
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Text generation improvement (UI client, data parallel support) (#5437)
* Squashed commit of the following:
commit a5e124f34be31bd6eafe5e5fdf5bedcd0d50915c
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Oct 13 15:07:42 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 35b424044fe80c3081e7756ab21244f701716f7e
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 08:04:49 2022 -0700
get rid of base
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 2955210e2311791543538cfbb5ad26b79414c954
Merge: d52edef8c eaf6757ca
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:17:02 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit d52edef8cd7b36593838fb270047e80f8ccb652e
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:16:24 2022 +0000
align with main
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit eaf6757ca5be8e099492f57c81d984429b0ad49c
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Oct 13 13:12:11 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit c4b86d97626ea0721bf8fb4c0a45dec5becc94c9
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:10:58 2022 +0000
same as main
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit e335de51bcc0d681c58b568c3d8c238bc5687c3b
Merge: c231086e0 4463a9fe9
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:08:09 2022 +0000
Merge branch 'main' into universal_prompt
commit c231086e057f1efaa915f691d84664cb3d5aad85
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Wed Oct 12 19:59:12 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 6a821a4b49a23dd3408a706a2a3dd393149b0bb1
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:56:17 2022 +0000
default to pad
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9d908e39fef1beed9ba2da4d1a6806161eb7ef25
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:55:44 2022 +0000
add the option to pad the tokens
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 876dc395b43fdeeaa2bcbbe13c76523633764c33
Merge: fbb0f4035 fe3c77ee9
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:20:47 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit fe3c77ee93ab6cf3ea152db68cb6beefcac2a392
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 18:59:49 2022 +0000
fix import again
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit fbb0f4035c6cd6bfefed50a20605503de8c1dccb
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Wed Oct 12 16:00:24 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 372ca8c0d7988f2339b15888dc72aa21f4fb6937
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 15:58:32 2022 +0000
enable server
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit cbe05d9fbc978f812cfbb671f45f147f300713c4
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 13:07:28 2022 +0000
fix comment error
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 1948048922e726ec6131e44b1a745389f18d4ef2
Merge: 232c2cce3 984f5c09a
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 13:05:30 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit 232c2cce34d7a8b902da406706f3dd9b39475091
Merge: 34c8a68df 658243fb6
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 12:50:00 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit 984f5c09a6dbf1d1fb5aa30ed9b0df188e66a50f
Merge: 658243fb6 3fda5de46
Author: Yi Dong <43824965+yidong72@users.noreply.github.com>
Date: Wed Oct 12 08:42:11 2022 -0400
Merge branch 'main' into fix_global_init
commit 658243fb6580191b5d60edd30cde16dcc23cbb85
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 12 12:40:57 2022 +0000
fix import error
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 8e0fe1cad05ec288ec122b3cd0e139a96872e08c
Author: Yi Dong <doyend@gmail.com>
Date: Tue Oct 11 22:44:12 2022 +0000
update the fused kernel
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 536cf6bef9447b75843fad630729c47a2fba35f3
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:44:52 2022 -0700
add the missing file
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 1b437ec41dc5e354453ce0a089bca0171cbcb6c2
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:43:14 2022 -0700
fix fused softmax
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 7813f60e05f9783af61f8c14ec1cb0c6c4f1f263
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:16:48 2022 -0700
move global step to base
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 34c8a68df084b18d377e84415d9f07b2cd6673dd
Author: Yi Dong <doyend@gmail.com>
Date: Thu Oct 6 13:50:11 2022 +0000
fix pipeline for eval
Signed-off-by: Yi Dong <doyend@gmail.com>
commit eee5d38218f26660c3ffebe9f615c850c80a1f0d
Author: Yi Dong <doyend@gmail.com>
Date: Thu Oct 6 13:48:22 2022 +0000
fix for pipleline parallel
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 323bca73e7ef6099ee79c0a2fffac7b709ed6c5d
Merge: 125e49947 e3b4c4d1f
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 19:29:13 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit 125e4994760448ff75dd9328395813eda1c87547
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 19:29:04 2022 +0000
add share option
Signed-off-by: Yi Dong <doyend@gmail.com>
commit e3b4c4d1f7346c9fa596f3cca6d4df0a9e05c368
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 5 11:43:48 2022 -0700
make sure consolidation works
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit a5c833964ecf05dc460ca1da69275c4019742150
Merge: 2a07ab52d abcb74be2
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:40:29 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit 2a07ab52d95f15ba666823028c69e23825666c05
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:40:23 2022 +0000
added requirement
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 3abecd9dd1611993a87c537636abe7f7e6a9b04c
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:39:42 2022 +0000
added a simple web server
Signed-off-by: Yi Dong <doyend@gmail.com>
commit abcb74be2caf1cdec40eb9ba2be4dde4d45a3b4b
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 5 06:54:12 2022 -0700
fix empty val loss
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit b8eb92ac4a0d665570af75e34c9ba3c2e2420c26
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 19:25:30 2022 -0700
text gen working
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit d59f3e3f3a6fd19736d1c5706fed65a3dd4049ba
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 16:08:40 2022 -0700
first change
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 59d077585e6962a669b824af58f64e8a0bea6547
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 15:00:40 2022 -0700
revert
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 12a0f3902d99e9179403644bd951c045df716ca7
Author: Yi Dong <doyend@gmail.com>
Date: Tue Oct 4 21:26:23 2022 +0000
init imp
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 62a15dfd943cc48be495ac61b9f2f00995775c5f
Merge: 82c90d2cd e0cc6b767
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 11:58:26 2022 -0700
Merge branch 'main' into universal_prompt
commit 82c90d2cd0fd156f16a4b899f8c741d598f33990
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 11:17:13 2022 -0700
add sync
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9819b703eef877d90cd1257bf3610c69de9b4d7e
Author: Yi Dong <yidong@nvidia.com>
Date: Sun Oct 2 17:52:34 2022 -0700
fix save model
Signed-off-by: root <root@luna-0197.selene.nvidia.com>
commit e4937e2fc5fb7d70754c97668416e4a69c3079fe
Author: Yi Dong <yidong@nvidia.com>
Date: Sat Oct 1 18:56:09 2022 +0000
working
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit b73b06d1c7cf5417a6d87cb33d8ed83a57e38b7b
Author: Yi Dong <yidong@nvidia.com>
Date: Sat Oct 1 17:34:03 2022 +0000
calcuate the mask
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9db3bc13eb65a94a475b837603351da68e3745bc
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 23:26:32 2022 +0000
fix bug in datasets
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit f289900375d4412f53f8110be00fec6587627550
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 22:29:40 2022 +0000
update the code
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 8e28a1f208aabaab72dbe769e72756baada04d99
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 21:52:52 2022 +0000
added new ds
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 8d41315bab7ce90e200a8a7d1023c34f8e046897
Author: Yi Dong <doyend@gmail.com>
Date: Fri Sep 30 18:57:09 2022 +0000
added new files
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 984e0e94e15e16323c1ba1ca2efeabd84f69463f
Merge: cbe8b7ab1 fa6cd8588
Author: Yi Dong <doyend@gmail.com>
Date: Thu Sep 29 21:43:29 2022 +0000
Merge branch 'llm-prompt-learning-improvements' into universal_prompt
commit fa6cd858839277939446afe7275976078d54c512
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Sep 29 16:47:30 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 78ba46e5d6fde1be53c08e1e30a54cce59824be0
Merge: 7d6d46742 8d670bc77
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Thu Sep 29 09:43:27 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 7d6d46742170a66758287a207d67e1b1bfd15613
Author: Virginia Adams <vadams@nvidia.com>
Date: Thu Sep 29 16:42:43 2022 +0000
Removed inference step and added sentence peice check to predict step
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 20fd265acd6f7f9912cf52155fe66ccfa6b201a2
Author: Virginia Adams <vadams@nvidia.com>
Date: Thu Sep 29 15:26:32 2022 +0000
fixed first stage check for pipeline parallel T5 pt
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 3637be2b258c8d9028856f9971edb7da4a8121f0
Merge: a3ea722fd 986a76612
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Wed Sep 28 10:23:30 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit a3ea722fdc12fbcc5989b76ef5643a574b763bc4
Merge: 770967a52 971485ce7
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Mon Sep 26 13:35:52 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 770967a5251a474b6dcc2d44bf9a2076adbcb604
Merge: d23bf6c30 e3ac280a8
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Mon Sep 26 10:17:03 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit d23bf6c30acc0e3f6af9b4e24547669866a34d62
Merge: de6a31651 333d2b749
Author: Virginia Adams <vadams@nvidia.com>
Date: Mon Sep 26 10:05:16 2022 -0700
Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements
commit de6a31651e63d88a42b971794d93f18ff5a3cdff
Author: Virginia Adams <vadams@nvidia.com>
Date: Mon Sep 26 17:00:53 2022 +0000
Updated PP check to be on first stage pipeline only
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 333d2b7498e6742ce66436f733c980a74616900c
Merge: 592c0986a a39fc925a
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Fri Sep 23 16:11:21 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 592c0986a476a91b57b8605d7b70830d7acfa021
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 23:08:41 2022 +0000
Fixed unused import and CI test bug
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit ea9cd82d85638bc60ae4ad7ef105db931c8e3455
Merge: ce4b72c8c b566c2d0e
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 18:57:25 2022 +0000
Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements
commit ce4b72c8c52f32be336e323dd78a38089edc3e7c
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 18:57:16 2022 +0000
Switch to import from base class
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit b566c2d0e35a068f758fd1310bc620a47be4590b
Merge: 6621f2854 e872061ac
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Fri Sep 23 10:09:03 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 6621f28543828a48484a5637f6c9f3ccb23a5b02
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 20:47:35 2022 +0000
python format fix
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 8deafc8987b6af5f7b99a250310f57a40198c37f
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 20:28:02 2022 +0000
Save .nemo on new best val score
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 761bd36969cb465d6a129e9eee6ce1f883d3cf41
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 18:03:19 2022 +0000
Added automatic checkpoint to nemo file method
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 3be4ed57b6cd3ddfe4876d78650dfe8fe794598b
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 02:11:56 2022 +0000
Make GPT use base prompt learning model class:
Signed-off-by: Virginia Adams <vadams@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix LGTM
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix validation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* change for the lm eval
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* make text generation work in data parallel environment
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* implement the service with rest service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* surpress log
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Update config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Restore function needed for NMT
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* handles no answer only
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Fix config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* added knn to web
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix lgtm.com comments
Signed-off-by: Yi Dong <yidong@nvidia.com>
* output the retrieved context
Signed-off-by: Yi Dong <yidong@nvidia.com>
* allow no neighbor query
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove the imports
Signed-off-by: Yi Dong <yidong@nvidia.com>
* warn only once
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Change output file format from JSON to JSONL
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* new t0 dataset
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Add T0 data preproc scripts
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Merge and multiprocessing
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix for is_correct
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* fix epoch > 2
Signed-off-by: Yi Dong <yidong@nvidia.com>
* handles multiple dataloader
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove template
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Refactor T0 dataset
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Add script to merge train folder into individual training files to minimize number of blends
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added on the fly service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add combo instance
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added combo service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* send weights back to server
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix index store
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Minor changes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add reset button
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add add eos
Signed-off-by: Yi Dong <yidong@nvidia.com>
* use a seperate bert service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* no loss of accuracy
Signed-off-by: Yi Dong <yidong@nvidia.com>
* pin the gradio version
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Remove bin compat
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix header lines
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* evaluate based on text generation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* exact match result aggregation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* working SP and SA
Signed-off-by: Yi Dong <yidong@nvidia.com>
* sync
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix checkpoint
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix eval
Signed-off-by: Yi Dong <yidong@nvidia.com>
* backup states
Signed-off-by: Yi Dong <yidong@nvidia.com>
* backup states reset
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix the bug
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix evaluation for sentence piece
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix a bug
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* potential fix in the future
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove the universal codes
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove universal strategy
Signed-off-by: Yi Dong <yidong@nvidia.com>
* address reviewer comment
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add align function docstrings and make most args optional
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove redundant returns of viterbi and log probs matrices
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Rename h# to <initial_silence>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update manifest format description in README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* always remove any spaces from utt_id
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Patch the hanging of threads on very large stderr (#5589) (#5590)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* O2 style amp for gpt3 ptuning (#5246)
* enable amp o2 plugin
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* only create master param if param requires gradient
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* remove pytorch autocast
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* Update optimizer_with_main_params.py
Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
* create master grad only if param group requires grad
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* fix grad scaler for pp > 1
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
Co-authored-by: Jimmy Zhang <jiemingz@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Better patch hydra (#5591) (#5592)
* Readd buffereing and thread drain to Hydra Launcher
Signed-off-by: smajumdar <titu1994@gmail.com>
* Readd buffereing and thread drain to Hydra Launcher
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Yet another fix with hydra multirun (#5594) (#5595)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add RETRO model documentation (#5578)
* added retro doc
Signed-off-by: Yi Dong <yidong@nvidia.com>
* finish data part
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the data format
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added training script
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added training and evaluation steps
Signed-off-by: Yi Dong <yidong@nvidia.com>
* edit the text
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the images
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix beginning
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix the grammar
Signed-off-by: Yi Dong <yidong@nvidia.com>
* trim it down
Signed-off-by: Yi Dong <yidong@nvidia.com>
* add wandb option
Signed-off-by: Yi Dong <yidong@nvidia.com>
* add reference
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix path
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the parameters table
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix section
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix: setup_multiple validation/test data (#5585)
Fix: setup_multiple validation/test data (#5585)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Move to optimizer based EMA implementation (#5169)
* Move to optimizer
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fix replacing weights
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Allow swapping of weights be optional
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Save 2 models
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Use different hook
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Expose cpu device
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add clause to see if this fixes issue with O2 optimizer
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Try to get O2 working
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* WIP
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fixes
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fixes to tests
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add guard
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Remove import
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add guard
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add comment
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Remove overwrite
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add BatchNorm, currently tests fail
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fix tests/functionality for batch norm
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Get rid of NLP changes
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* AIStore for ASR datasets (#5462)
AIStore for ASR datasets
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add support for MHA adapters to ASR (#5396)
* Convert AbstractAdapterModule to AbstractAdapterMixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* Temporary fixes to new signature of mixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add adapter util for constants, add all mha adapters.
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update name of function
Signed-off-by: smajumdar <titu1994@gmail.com>
* Roll back changes to convASR
Signed-off-by: smajumdar <titu1994@gmail.com>
* Convert AbstractAdapterModule to AbstractAdapterMixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* First draft of Conformer support for MHA attention
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add some preliminary tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add support for projection of the hidden dimension for attention
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add support for squeezeformer
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update train adapter config
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add tests for squeezeformer and unit tests for new modules
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update config for hp search,set limits on modules for conformer and squeezeformer, update adapter mixin, add cache to import_from_class_path
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update location of adapters
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add pre_norm for proper attention learning, Fix the issue with nan/inf in pos_bias_u and pos_bias_v
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update expmanager to clean up checkpoints
Signed-off-by: smajumdar <titu1994@gmail.com>
* Fix style
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update training scripts
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update config and docs
Signed-off-by: smajumdar <titu1994@gmail.com>
* Expose nemo delete function
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct adapter partial state saving
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct a bug with state management of adapter tokens
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Pull down EMA test
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct name of adapter module utility class
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove unused TTS eval functions w/ pesq and pystoi dependencies (#5605) (#5606)
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Co-authored-by: Jocelyn <jocelynh@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Create separator parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Call align function with hydra config
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* update usage example
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update Dockerfile (#5614) (#5616)
Pinned to use `numba==0.53.1` to avoid crashing in training with `num_workers > 0`. This is just a temporary workaround, still need to fix it in the future.
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Co-authored-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make separate pretrained_name and model_path parameters
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* make "optional" tags bold in markdown
Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Move non-main functions to utils dir
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Temp workaround: Disable test with cache_audio=True since it is failing in CI (#5607) (#5615)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Co-authored-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] fix ranges of char set for accented letters. (#5607)
* [TTS] fix ranges of char set for accented letters.
* remove digits pattern and added unit tests for math operators.
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Change success message to reduce confusion (#5621)
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update documentation and tutorials for Adapters (#5610)
* Improve docs for adapter and tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Improve docs for adapter and tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update test
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Rename test file
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] add type hints and change varialbe names for tokenizers and g2p (#5602)
* [TTS] add type hints and change variable names for tokenizers and g2p
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* 1. Added missing import for gather_objects. (#5627)
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Co-authored-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] add fastpitch and hifigan model NGC urls and update NeMo docs. (#5596) (#5625)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fixed RadTTS unit test (#5572)
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove tests (#5633)
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][DOC] add notes about automatic conversion to target sampling rates. (#5624) (#5634)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Conformer local attention (#5525)
* local attn and merge
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* optional
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* override
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* incorporate comments
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* update
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* fix
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* comment
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* changes, test
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* changes
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* check att context
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* readme link
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* utils
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* update
Signed-off-by: sam1373 <samuelkriman@gmail.com>
Signed-off-by: sam1373 <samuelkriman@gmail.com>
Signed-off-by: Samuel Kriman <samuelkriman@gmail.com>
Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add core classes and functions for online clustering diarizer part 1 (#5526)
* Add core classes and functions for online clustering diarizer
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add audio to labels code
Signed-off-by: Taejin Park <tango4j@gmail.com>
* resolve type errors
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added unit=tests for very short audio
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Filled all missing docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* resolved conflict and added missing docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixed unit-test errors
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix the wrongly added file - megatron_gpt_model.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fix wrongly included file - megatron_gpt_model.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* resolve code quality issue
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fixed unit-test errors and bugs
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* changed total_sec for offline_clustering toy_data in unit-tests
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed merging index offset bug
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* only including part 1 files
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* removed unused function
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed unused imports
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* divided nmesc_clustering.py into two and reflected first-pass comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* adding offline/online_clustering.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix code QL autocomment
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Removed unused imports
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/online_clustering.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Reflected comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* resolved code scanning issue
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/offline_clustering.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [STT] Add Esperanto (Eo) ASR Conformer-CTC and Conformer-Transducer models (#5639) (#5641)
* add stt_eo_conformer_ctc_large model
* stt_eo_conformer_transducer_large
Co-authored-by: Andrei Andrusenko <52885736+andrusenkoau@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Removed unused import
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Specify that filepaths need to be absolute
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* replaces any spaces in utt_id with dashes
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make hydra script callable by another script
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* do not specify default model or model_downsample_factor
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [Dockerfile] Remove AIS archive from docker image (#5629)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Measure audio_sr from audio instead of needing to specify
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] Disambiguate polyphones with augmented dict and Jieba segmenter for Chinese FastPitch (#5541)
* Chinese TTS replaces default pypinyin dict
* Add jieba word segmenter as an option
Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make separate parameters for device of transcription and viterbi steps
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add mention of gecko
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [workflow] add exclude labels option to ignore cherry-picks in release changelog. (#5645)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] bugfix for the tutorial and add NGC CLI installation guide. (#5643) (#5647)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [Add] ASR+VAD Inference Pipeline (#5575)
Added offline ASR+VAD inference pipeline that matches with what's in RIVA, along with some feature-based ASR and classification datasets.
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* rename separator to ctm_grouping_separator and refactor
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Bert interleaved (#5556)
* Adding SP and SAR support Bert
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding SP and SAR support Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding SP and SAR support Bert
* Adding SP and SAR support Bert
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* Update bert_model.py
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Adding tests
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Addressing Eric's comments
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Addressing Eric's comments
* Fix bug fix sequence parallel and Interleaved
* Fix bug fix sequence parallel and Interleaved
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add duration padding support for RADTTS inference (#5650)
* Added duration padding support for RADTTS inference
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Co-authored-by: Kevin Shih <kshih@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add remove_blank_tokens_from_ctm parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Dont save initial_silence line in CTM
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add DLLogger support to exp_manager (#5658)
* Add DLLogger support to exp_manager
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
* Move dllogger to separate file and check import
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Remove unused import
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add minimum_timestamp_duration parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add suggestion about removing blanks to README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* reorder args
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* clarify description of ctm_grouping_separator in README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* update docstring
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] bugfix for ngc cli installation. (#5652) (#5664)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Port stateless timer to exp manager (#5584)
* Port stateless timer to exp manager
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes and remove from all megatron code
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Change message
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix EMA restart by allowing device to be set by the class init (#5668)
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove SDP (moved to separate repo) - merge to main (#5630)
* Remove sdp files from tools folder
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add page to docs with new SDP location
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add interface for making amax reduction optional for FP8 (#5447)
* add TE interface for making amax reduction optional
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] add tts dict cust notebook (#5662)
* add tts dict cust notebook
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* review
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* fixed audio links
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* remove old notebook
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* fix typo
Signed-off-by: ekmb <ebakhturina@nvidia.com>
Signed-off-by: ekmb <ebakhturina@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [ASR] Audio processing base, multi-channel enhancement models (#5356)
* Audio processing base model, enc-mask-dec enhancement, tests and modules
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed review comments
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Fixed CodeQL warnings
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed PR comments
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed PR comments:
- renamed AudioProcessingModel to AudioToAudioModel
- various small modifications
- updated unit tests
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed comments
- Moved spectrogram to audio_preprocessing
- Renamed MultichannelFeatures
- Updated config and unit tests
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Expose ClusteringDiarizer device (#5681)
* Expose device for users to set
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Expose device for users to set
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add Beam Search support to ASR transcribe() (#5443)
* Add support for beam decoding via high level API.
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add ctc decoding section
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update ctc transcribe API to return results from beam search
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add argument to preserve arpa file
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update script to use hydra config, add some support for future compute timesteps, add doc for ctc decoding
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update eval script and doc to use new API
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add tests for ctc greedy decoding
Signed-off-by: smajumdar <titu1994@gmail.com>
* Address reviewer comments and add docstrings
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix changes and address comments
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Samuel Kriman <samuelkriman@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Propagate attention_dropout flag for GPT-3 (#5669)
* Propagate attention_dropout flag for GPT-3
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
* Add default to megatron_gpt_config
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Eric H…
ericharper
added a commit
that referenced
this pull request
Jan 31, 2023
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (#5277) * Typo fix (#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (#5359) * Add Gradio App to ASR Docs (#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (#5379) * Fixed bug in notebook (#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (#5421)" (#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (#5242) (#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Kipok
added a commit
to Kipok/NeMo
that referenced
this pull request
Jan 31, 2023
* Merge r1.13.0 main (#5570)
* update branch
Signed-off-by: ericharper <complex451@gmail.com>
* Rename Speech Dataset Processor to Speech Data Processor (#5378)
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Megatron Export Update (#5343)
* export update for Megatron + change ORT optimization
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* updated export_utils to use autocast instead of manually casting >:/
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* removed dtype from LayerNorm
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* added comment
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* reverting changes on FloatCast
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* Cherry-picked changes from megatron-norm
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* updated asr_model import to cast_utils
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* updated del onnx_model place
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* changed ort optimization to basic -> temp fix
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Disable sync_batch_comm in validation_step for GPT (#5397)
* disable sync_batch_comm in validation_step
Signed-off-by: ericharper <complex451@gmail.com>
* Read sync_batch_comm from config or default to False
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
* Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
* Empty
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Comment out test
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
* Radtts 1.13 (#5451)
* [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358)
* [TTS] add CI test for RADTTS training recipe.
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
* Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478)
* Initial refactor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Resolve config before passing to load_from_checkpoint
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes for model parallel and nemo restore
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes for eval
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Revert config changes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Refactor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix typo
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Remove comments
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Minor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix validation reconfiguration
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Remove old comment
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes for test_ds
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* export_utils bugfix (#5480)
* updated export_utils
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Export fixes for Riva (#5496)
* Export fixes for Riva
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Cleaning up training_utils
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* added set_start_method + function param bugfix (#5539)
* added set_start_method + function param bugfix
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* upper bound torchmetrics
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: ericharper <complex451@gmail.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ericharper <complex451@gmail.com>
* remove notebook (#5548)
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
* update readme
Signed-off-by: ericharper <complex451@gmail.com>
* update branch
Signed-off-by: ericharper <complex451@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Co-authored-by: David <amosalla@asu.edu>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Optimized loop and bugfix in SDE (#5573)
- Fixed bug with loading custom data attributes from JSON in Speech Data Explorer
Signed-off-by: George Zelenfroynd <gzelenfroind@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update torchmetrics (#5566)
* add task arg
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* update state
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Co-authored-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove useless files. (#5580)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add initial NFA code
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make use of the specified device during viterbi decoding
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix CodeQL notes
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix CodeQL warning
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add an option to defer data setup from ``__init__`` to ``setup`` (#5569)
* Add an option to defer dataloader setup from __init__ to setup
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Updated doc
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make utt_id specified by number of parts of audio_filepath user wishes to use
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove audio_sr TODO - reduce risk of silent bugs
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add check that model is CTC
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove unused import
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Text generation improvement (UI client, data parallel support) (#5437)
* Squashed commit of the following:
commit a5e124f34be31bd6eafe5e5fdf5bedcd0d50915c
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Oct 13 15:07:42 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 35b424044fe80c3081e7756ab21244f701716f7e
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 08:04:49 2022 -0700
get rid of base
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 2955210e2311791543538cfbb5ad26b79414c954
Merge: d52edef8c eaf6757ca
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:17:02 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit d52edef8cd7b36593838fb270047e80f8ccb652e
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:16:24 2022 +0000
align with main
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit eaf6757ca5be8e099492f57c81d984429b0ad49c
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Oct 13 13:12:11 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit c4b86d97626ea0721bf8fb4c0a45dec5becc94c9
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:10:58 2022 +0000
same as main
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit e335de51bcc0d681c58b568c3d8c238bc5687c3b
Merge: c231086e0 4463a9fe9
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:08:09 2022 +0000
Merge branch 'main' into universal_prompt
commit c231086e057f1efaa915f691d84664cb3d5aad85
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Wed Oct 12 19:59:12 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 6a821a4b49a23dd3408a706a2a3dd393149b0bb1
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:56:17 2022 +0000
default to pad
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9d908e39fef1beed9ba2da4d1a6806161eb7ef25
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:55:44 2022 +0000
add the option to pad the tokens
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 876dc395b43fdeeaa2bcbbe13c76523633764c33
Merge: fbb0f4035 fe3c77ee9
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:20:47 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit fe3c77ee93ab6cf3ea152db68cb6beefcac2a392
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 18:59:49 2022 +0000
fix import again
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit fbb0f4035c6cd6bfefed50a20605503de8c1dccb
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Wed Oct 12 16:00:24 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 372ca8c0d7988f2339b15888dc72aa21f4fb6937
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 15:58:32 2022 +0000
enable server
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit cbe05d9fbc978f812cfbb671f45f147f300713c4
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 13:07:28 2022 +0000
fix comment error
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 1948048922e726ec6131e44b1a745389f18d4ef2
Merge: 232c2cce3 984f5c09a
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 13:05:30 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit 232c2cce34d7a8b902da406706f3dd9b39475091
Merge: 34c8a68df 658243fb6
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 12:50:00 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit 984f5c09a6dbf1d1fb5aa30ed9b0df188e66a50f
Merge: 658243fb6 3fda5de46
Author: Yi Dong <43824965+yidong72@users.noreply.github.com>
Date: Wed Oct 12 08:42:11 2022 -0400
Merge branch 'main' into fix_global_init
commit 658243fb6580191b5d60edd30cde16dcc23cbb85
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 12 12:40:57 2022 +0000
fix import error
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 8e0fe1cad05ec288ec122b3cd0e139a96872e08c
Author: Yi Dong <doyend@gmail.com>
Date: Tue Oct 11 22:44:12 2022 +0000
update the fused kernel
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 536cf6bef9447b75843fad630729c47a2fba35f3
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:44:52 2022 -0700
add the missing file
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 1b437ec41dc5e354453ce0a089bca0171cbcb6c2
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:43:14 2022 -0700
fix fused softmax
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 7813f60e05f9783af61f8c14ec1cb0c6c4f1f263
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:16:48 2022 -0700
move global step to base
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 34c8a68df084b18d377e84415d9f07b2cd6673dd
Author: Yi Dong <doyend@gmail.com>
Date: Thu Oct 6 13:50:11 2022 +0000
fix pipeline for eval
Signed-off-by: Yi Dong <doyend@gmail.com>
commit eee5d38218f26660c3ffebe9f615c850c80a1f0d
Author: Yi Dong <doyend@gmail.com>
Date: Thu Oct 6 13:48:22 2022 +0000
fix for pipleline parallel
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 323bca73e7ef6099ee79c0a2fffac7b709ed6c5d
Merge: 125e49947 e3b4c4d1f
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 19:29:13 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit 125e4994760448ff75dd9328395813eda1c87547
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 19:29:04 2022 +0000
add share option
Signed-off-by: Yi Dong <doyend@gmail.com>
commit e3b4c4d1f7346c9fa596f3cca6d4df0a9e05c368
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 5 11:43:48 2022 -0700
make sure consolidation works
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit a5c833964ecf05dc460ca1da69275c4019742150
Merge: 2a07ab52d abcb74be2
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:40:29 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit 2a07ab52d95f15ba666823028c69e23825666c05
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:40:23 2022 +0000
added requirement
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 3abecd9dd1611993a87c537636abe7f7e6a9b04c
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:39:42 2022 +0000
added a simple web server
Signed-off-by: Yi Dong <doyend@gmail.com>
commit abcb74be2caf1cdec40eb9ba2be4dde4d45a3b4b
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 5 06:54:12 2022 -0700
fix empty val loss
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit b8eb92ac4a0d665570af75e34c9ba3c2e2420c26
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 19:25:30 2022 -0700
text gen working
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit d59f3e3f3a6fd19736d1c5706fed65a3dd4049ba
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 16:08:40 2022 -0700
first change
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 59d077585e6962a669b824af58f64e8a0bea6547
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 15:00:40 2022 -0700
revert
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 12a0f3902d99e9179403644bd951c045df716ca7
Author: Yi Dong <doyend@gmail.com>
Date: Tue Oct 4 21:26:23 2022 +0000
init imp
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 62a15dfd943cc48be495ac61b9f2f00995775c5f
Merge: 82c90d2cd e0cc6b767
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 11:58:26 2022 -0700
Merge branch 'main' into universal_prompt
commit 82c90d2cd0fd156f16a4b899f8c741d598f33990
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 11:17:13 2022 -0700
add sync
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9819b703eef877d90cd1257bf3610c69de9b4d7e
Author: Yi Dong <yidong@nvidia.com>
Date: Sun Oct 2 17:52:34 2022 -0700
fix save model
Signed-off-by: root <root@luna-0197.selene.nvidia.com>
commit e4937e2fc5fb7d70754c97668416e4a69c3079fe
Author: Yi Dong <yidong@nvidia.com>
Date: Sat Oct 1 18:56:09 2022 +0000
working
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit b73b06d1c7cf5417a6d87cb33d8ed83a57e38b7b
Author: Yi Dong <yidong@nvidia.com>
Date: Sat Oct 1 17:34:03 2022 +0000
calcuate the mask
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9db3bc13eb65a94a475b837603351da68e3745bc
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 23:26:32 2022 +0000
fix bug in datasets
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit f289900375d4412f53f8110be00fec6587627550
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 22:29:40 2022 +0000
update the code
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 8e28a1f208aabaab72dbe769e72756baada04d99
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 21:52:52 2022 +0000
added new ds
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 8d41315bab7ce90e200a8a7d1023c34f8e046897
Author: Yi Dong <doyend@gmail.com>
Date: Fri Sep 30 18:57:09 2022 +0000
added new files
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 984e0e94e15e16323c1ba1ca2efeabd84f69463f
Merge: cbe8b7ab1 fa6cd8588
Author: Yi Dong <doyend@gmail.com>
Date: Thu Sep 29 21:43:29 2022 +0000
Merge branch 'llm-prompt-learning-improvements' into universal_prompt
commit fa6cd858839277939446afe7275976078d54c512
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Sep 29 16:47:30 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 78ba46e5d6fde1be53c08e1e30a54cce59824be0
Merge: 7d6d46742 8d670bc77
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Thu Sep 29 09:43:27 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 7d6d46742170a66758287a207d67e1b1bfd15613
Author: Virginia Adams <vadams@nvidia.com>
Date: Thu Sep 29 16:42:43 2022 +0000
Removed inference step and added sentence peice check to predict step
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 20fd265acd6f7f9912cf52155fe66ccfa6b201a2
Author: Virginia Adams <vadams@nvidia.com>
Date: Thu Sep 29 15:26:32 2022 +0000
fixed first stage check for pipeline parallel T5 pt
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 3637be2b258c8d9028856f9971edb7da4a8121f0
Merge: a3ea722fd 986a76612
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Wed Sep 28 10:23:30 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit a3ea722fdc12fbcc5989b76ef5643a574b763bc4
Merge: 770967a52 971485ce7
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Mon Sep 26 13:35:52 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 770967a5251a474b6dcc2d44bf9a2076adbcb604
Merge: d23bf6c30 e3ac280a8
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Mon Sep 26 10:17:03 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit d23bf6c30acc0e3f6af9b4e24547669866a34d62
Merge: de6a31651 333d2b749
Author: Virginia Adams <vadams@nvidia.com>
Date: Mon Sep 26 10:05:16 2022 -0700
Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements
commit de6a31651e63d88a42b971794d93f18ff5a3cdff
Author: Virginia Adams <vadams@nvidia.com>
Date: Mon Sep 26 17:00:53 2022 +0000
Updated PP check to be on first stage pipeline only
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 333d2b7498e6742ce66436f733c980a74616900c
Merge: 592c0986a a39fc925a
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Fri Sep 23 16:11:21 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 592c0986a476a91b57b8605d7b70830d7acfa021
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 23:08:41 2022 +0000
Fixed unused import and CI test bug
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit ea9cd82d85638bc60ae4ad7ef105db931c8e3455
Merge: ce4b72c8c b566c2d0e
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 18:57:25 2022 +0000
Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements
commit ce4b72c8c52f32be336e323dd78a38089edc3e7c
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 18:57:16 2022 +0000
Switch to import from base class
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit b566c2d0e35a068f758fd1310bc620a47be4590b
Merge: 6621f2854 e872061ac
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Fri Sep 23 10:09:03 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 6621f28543828a48484a5637f6c9f3ccb23a5b02
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 20:47:35 2022 +0000
python format fix
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 8deafc8987b6af5f7b99a250310f57a40198c37f
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 20:28:02 2022 +0000
Save .nemo on new best val score
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 761bd36969cb465d6a129e9eee6ce1f883d3cf41
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 18:03:19 2022 +0000
Added automatic checkpoint to nemo file method
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 3be4ed57b6cd3ddfe4876d78650dfe8fe794598b
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 02:11:56 2022 +0000
Make GPT use base prompt learning model class:
Signed-off-by: Virginia Adams <vadams@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix LGTM
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix validation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* change for the lm eval
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* make text generation work in data parallel environment
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* implement the service with rest service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* surpress log
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Update config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Restore function needed for NMT
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* handles no answer only
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Fix config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* added knn to web
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix lgtm.com comments
Signed-off-by: Yi Dong <yidong@nvidia.com>
* output the retrieved context
Signed-off-by: Yi Dong <yidong@nvidia.com>
* allow no neighbor query
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove the imports
Signed-off-by: Yi Dong <yidong@nvidia.com>
* warn only once
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Change output file format from JSON to JSONL
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* new t0 dataset
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Add T0 data preproc scripts
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Merge and multiprocessing
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix for is_correct
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* fix epoch > 2
Signed-off-by: Yi Dong <yidong@nvidia.com>
* handles multiple dataloader
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove template
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Refactor T0 dataset
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Add script to merge train folder into individual training files to minimize number of blends
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added on the fly service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add combo instance
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added combo service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* send weights back to server
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix index store
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Minor changes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add reset button
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add add eos
Signed-off-by: Yi Dong <yidong@nvidia.com>
* use a seperate bert service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* no loss of accuracy
Signed-off-by: Yi Dong <yidong@nvidia.com>
* pin the gradio version
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Remove bin compat
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix header lines
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* evaluate based on text generation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* exact match result aggregation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* working SP and SA
Signed-off-by: Yi Dong <yidong@nvidia.com>
* sync
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix checkpoint
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix eval
Signed-off-by: Yi Dong <yidong@nvidia.com>
* backup states
Signed-off-by: Yi Dong <yidong@nvidia.com>
* backup states reset
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix the bug
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix evaluation for sentence piece
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix a bug
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* potential fix in the future
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove the universal codes
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove universal strategy
Signed-off-by: Yi Dong <yidong@nvidia.com>
* address reviewer comment
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add align function docstrings and make most args optional
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove redundant returns of viterbi and log probs matrices
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Rename h# to <initial_silence>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update manifest format description in README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* always remove any spaces from utt_id
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Patch the hanging of threads on very large stderr (#5589) (#5590)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* O2 style amp for gpt3 ptuning (#5246)
* enable amp o2 plugin
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* only create master param if param requires gradient
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* remove pytorch autocast
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* Update optimizer_with_main_params.py
Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
* create master grad only if param group requires grad
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* fix grad scaler for pp > 1
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
Co-authored-by: Jimmy Zhang <jiemingz@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Better patch hydra (#5591) (#5592)
* Readd buffereing and thread drain to Hydra Launcher
Signed-off-by: smajumdar <titu1994@gmail.com>
* Readd buffereing and thread drain to Hydra Launcher
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Yet another fix with hydra multirun (#5594) (#5595)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add RETRO model documentation (#5578)
* added retro doc
Signed-off-by: Yi Dong <yidong@nvidia.com>
* finish data part
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the data format
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added training script
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added training and evaluation steps
Signed-off-by: Yi Dong <yidong@nvidia.com>
* edit the text
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the images
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix beginning
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix the grammar
Signed-off-by: Yi Dong <yidong@nvidia.com>
* trim it down
Signed-off-by: Yi Dong <yidong@nvidia.com>
* add wandb option
Signed-off-by: Yi Dong <yidong@nvidia.com>
* add reference
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix path
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the parameters table
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix section
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix: setup_multiple validation/test data (#5585)
Fix: setup_multiple validation/test data (#5585)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Move to optimizer based EMA implementation (#5169)
* Move to optimizer
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fix replacing weights
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Allow swapping of weights be optional
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Save 2 models
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Use different hook
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Expose cpu device
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add clause to see if this fixes issue with O2 optimizer
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Try to get O2 working
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* WIP
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fixes
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fixes to tests
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add guard
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Remove import
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add guard
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add comment
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Remove overwrite
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add BatchNorm, currently tests fail
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fix tests/functionality for batch norm
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Get rid of NLP changes
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* AIStore for ASR datasets (#5462)
AIStore for ASR datasets
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add support for MHA adapters to ASR (#5396)
* Convert AbstractAdapterModule to AbstractAdapterMixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* Temporary fixes to new signature of mixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add adapter util for constants, add all mha adapters.
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update name of function
Signed-off-by: smajumdar <titu1994@gmail.com>
* Roll back changes to convASR
Signed-off-by: smajumdar <titu1994@gmail.com>
* Convert AbstractAdapterModule to AbstractAdapterMixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* First draft of Conformer support for MHA attention
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add some preliminary tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add support for projection of the hidden dimension for attention
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add support for squeezeformer
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update train adapter config
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add tests for squeezeformer and unit tests for new modules
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update config for hp search,set limits on modules for conformer and squeezeformer, update adapter mixin, add cache to import_from_class_path
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update location of adapters
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add pre_norm for proper attention learning, Fix the issue with nan/inf in pos_bias_u and pos_bias_v
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update expmanager to clean up checkpoints
Signed-off-by: smajumdar <titu1994@gmail.com>
* Fix style
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update training scripts
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update config and docs
Signed-off-by: smajumdar <titu1994@gmail.com>
* Expose nemo delete function
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct adapter partial state saving
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct a bug with state management of adapter tokens
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Pull down EMA test
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct name of adapter module utility class
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove unused TTS eval functions w/ pesq and pystoi dependencies (#5605) (#5606)
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Co-authored-by: Jocelyn <jocelynh@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Create separator parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Call align function with hydra config
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* update usage example
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update Dockerfile (#5614) (#5616)
Pinned to use `numba==0.53.1` to avoid crashing in training with `num_workers > 0`. This is just a temporary workaround, still need to fix it in the future.
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Co-authored-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make separate pretrained_name and model_path parameters
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* make "optional" tags bold in markdown
Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Move non-main functions to utils dir
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Temp workaround: Disable test with cache_audio=True since it is failing in CI (#5607) (#5615)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Co-authored-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] fix ranges of char set for accented letters. (#5607)
* [TTS] fix ranges of char set for accented letters.
* remove digits pattern and added unit tests for math operators.
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Change success message to reduce confusion (#5621)
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update documentation and tutorials for Adapters (#5610)
* Improve docs for adapter and tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Improve docs for adapter and tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update test
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Rename test file
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] add type hints and change varialbe names for tokenizers and g2p (#5602)
* [TTS] add type hints and change variable names for tokenizers and g2p
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* 1. Added missing import for gather_objects. (#5627)
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Co-authored-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] add fastpitch and hifigan model NGC urls and update NeMo docs. (#5596) (#5625)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fixed RadTTS unit test (#5572)
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove tests (#5633)
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][DOC] add notes about automatic conversion to target sampling rates. (#5624) (#5634)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Conformer local attention (#5525)
* local attn and merge
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* optional
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* override
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* incorporate comments
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* update
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* fix
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* comment
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* changes, test
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* changes
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* check att context
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* readme link
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* utils
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* update
Signed-off-by: sam1373 <samuelkriman@gmail.com>
Signed-off-by: sam1373 <samuelkriman@gmail.com>
Signed-off-by: Samuel Kriman <samuelkriman@gmail.com>
Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add core classes and functions for online clustering diarizer part 1 (#5526)
* Add core classes and functions for online clustering diarizer
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add audio to labels code
Signed-off-by: Taejin Park <tango4j@gmail.com>
* resolve type errors
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added unit=tests for very short audio
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Filled all missing docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* resolved conflict and added missing docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixed unit-test errors
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix the wrongly added file - megatron_gpt_model.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fix wrongly included file - megatron_gpt_model.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* resolve code quality issue
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fixed unit-test errors and bugs
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* changed total_sec for offline_clustering toy_data in unit-tests
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed merging index offset bug
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* only including part 1 files
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* removed unused function
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed unused imports
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* divided nmesc_clustering.py into two and reflected first-pass comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* adding offline/online_clustering.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix code QL autocomment
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Removed unused imports
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/online_clustering.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Reflected comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* resolved code scanning issue
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/offline_clustering.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [STT] Add Esperanto (Eo) ASR Conformer-CTC and Conformer-Transducer models (#5639) (#5641)
* add stt_eo_conformer_ctc_large model
* stt_eo_conformer_transducer_large
Co-authored-by: Andrei Andrusenko <52885736+andrusenkoau@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Removed unused import
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Specify that filepaths need to be absolute
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* replaces any spaces in utt_id with dashes
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make hydra script callable by another script
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* do not specify default model or model_downsample_factor
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [Dockerfile] Remove AIS archive from docker image (#5629)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Measure audio_sr from audio instead of needing to specify
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] Disambiguate polyphones with augmented dict and Jieba segmenter for Chinese FastPitch (#5541)
* Chinese TTS replaces default pypinyin dict
* Add jieba word segmenter as an option
Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make separate parameters for device of transcription and viterbi steps
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add mention of gecko
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [workflow] add exclude labels option to ignore cherry-picks in release changelog. (#5645)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] bugfix for the tutorial and add NGC CLI installation guide. (#5643) (#5647)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [Add] ASR+VAD Inference Pipeline (#5575)
Added offline ASR+VAD inference pipeline that matches with what's in RIVA, along with some feature-based ASR and classification datasets.
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* rename separator to ctm_grouping_separator and refactor
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Bert interleaved (#5556)
* Adding SP and SAR support Bert
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding SP and SAR support Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding SP and SAR support Bert
* Adding SP and SAR support Bert
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* Update bert_model.py
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Adding tests
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Addressing Eric's comments
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Addressing Eric's comments
* Fix bug fix sequence parallel and Interleaved
* Fix bug fix sequence parallel and Interleaved
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add duration padding support for RADTTS inference (#5650)
* Added duration padding support for RADTTS inference
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Co-authored-by: Kevin Shih <kshih@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add remove_blank_tokens_from_ctm parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Dont save initial_silence line in CTM
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add DLLogger support to exp_manager (#5658)
* Add DLLogger support to exp_manager
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
* Move dllogger to separate file and check import
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Remove unused import
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add minimum_timestamp_duration parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add suggestion about removing blanks to README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* reorder args
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* clarify description of ctm_grouping_separator in README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* update docstring
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] bugfix for ngc cli installation. (#5652) (#5664)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Port stateless timer to exp manager (#5584)
* Port stateless timer to exp manager
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes and remove from all megatron code
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Change message
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix EMA restart by allowing device to be set by the class init (#5668)
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove SDP (moved to separate repo) - merge to main (#5630)
* Remove sdp files from tools folder
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add page to docs with new SDP location
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add interface for making amax reduction optional for FP8 (#5447)
* add TE interface for making amax reduction optional
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] add tts dict cust notebook (#5662)
* add tts dict cust notebook
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* review
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* fixed audio links
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* remove old notebook
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* fix typo
Signed-off-by: ekmb <ebakhturina@nvidia.com>
Signed-off-by: ekmb <ebakhturina@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [ASR] Audio processing base, multi-channel enhancement models (#5356)
* Audio processing base model, enc-mask-dec enhancement, tests and modules
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed review comments
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Fixed CodeQL warnings
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed PR comments
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed PR comments:
- renamed AudioProcessingModel to AudioToAudioModel
- various small modifications
- updated unit tests
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed comments
- Moved spectrogram to audio_preprocessing
- Renamed MultichannelFeatures
- Updated config and unit tests
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Expose ClusteringDiarizer device (#5681)
* Expose device for users to set
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Expose device for users to set
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add Beam Search support to ASR transcribe() (#5443)
* Add support for beam decoding via high level API.
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add ctc decoding section
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update ctc transcribe API to return results from beam search
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add argument to preserve arpa file
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update script to use hydra config, add some support for future compute timesteps, add doc for ctc decoding
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update eval script and doc to use new API
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add tests for ctc greedy decoding
Signed-off-by: smajumdar <titu1994@gmail.com>
* Address reviewer comments and add docstrings
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix changes and address comments
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Samuel Kriman <samuelkriman@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Propagate attention_dropout flag for GPT-3 (#5669)
* Propagate attention_dropout flag for GPT-3
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
* Add default to megatron_gpt_config
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Eric H…
Kipok
added a commit
to Kipok/NeMo
that referenced
this pull request
Jan 31, 2023
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (NVIDIA-NeMo#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (NVIDIA-NeMo#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (NVIDIA-NeMo#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (NVIDIA-NeMo#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (NVIDIA-NeMo#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (NVIDIA-NeMo#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA-NeMo#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (NVIDIA-NeMo#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (NVIDIA-NeMo#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (NVIDIA-NeMo#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (NVIDIA-NeMo#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (NVIDIA-NeMo#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (NVIDIA-NeMo#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (NVIDIA-NeMo#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (NVIDIA-NeMo#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (NVIDIA-NeMo#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (NVIDIA-NeMo#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (NVIDIA-NeMo#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (NVIDIA-NeMo#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (NVIDIA-NeMo#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (NVIDIA-NeMo#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (NVIDIA-NeMo#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (NVIDIA-NeMo#5277) * Typo fix (NVIDIA-NeMo#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (NVIDIA-NeMo#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (NVIDIA-NeMo#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (NVIDIA-NeMo#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (NVIDIA-NeMo#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (NVIDIA-NeMo#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (NVIDIA-NeMo#5359) * Add Gradio App to ASR Docs (NVIDIA-NeMo#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (NVIDIA-NeMo#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (NVIDIA-NeMo#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (NVIDIA-NeMo#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA-NeMo#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (NVIDIA-NeMo#5379) * Fixed bug in notebook (NVIDIA-NeMo#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (NVIDIA-NeMo#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (NVIDIA-NeMo#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA-NeMo#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (NVIDIA-NeMo#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (NVIDIA-NeMo#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (NVIDIA-NeMo#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA-NeMo#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (NVIDIA-NeMo#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA-NeMo#5421)" (NVIDIA-NeMo#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA-NeMo#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (NVIDIA-NeMo#5242) (NVIDIA-NeMo#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (NVIDIA-NeMo#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (NVIDIA-NeMo#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA-NeMo#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (NVIDIA-NeMo#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA-NeMo#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (NVIDIA-NeMo#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (NVIDIA-NeMo#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA-NeMo#5339) (NVIDIA-NeMo#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA-NeMo#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA-NeMo#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (NVIDIA-NeMo#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (NVIDIA-NeMo#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (NVIDIA-NeMo#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (NVIDIA-NeMo#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (NVIDIA-NeMo#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
titu1994
pushed a commit
to titu1994/NeMo
that referenced
this pull request
Mar 24, 2023
* update branch Signed-off-by: ericharper <complex451@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA-NeMo#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Megatron Export Update (NVIDIA-NeMo#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * Disable sync_batch_comm in validation_step for GPT (NVIDIA-NeMo#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Radtts 1.13 (NVIDIA-NeMo#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA-NeMo#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA-NeMo#5339) (NVIDIA-NeMo#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA-NeMo#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA-NeMo#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * added set_start_method + function param bugfix (NVIDIA-NeMo#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (NVIDIA-NeMo#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * update readme Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
titu1994
added a commit
to titu1994/NeMo
that referenced
this pull request
Mar 24, 2023
* Merge r1.13.0 main (#5570)
* update branch
Signed-off-by: ericharper <complex451@gmail.com>
* Rename Speech Dataset Processor to Speech Data Processor (#5378)
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Megatron Export Update (#5343)
* export update for Megatron + change ORT optimization
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* updated export_utils to use autocast instead of manually casting >:/
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* removed dtype from LayerNorm
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* added comment
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* reverting changes on FloatCast
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* Cherry-picked changes from megatron-norm
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* updated asr_model import to cast_utils
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* updated del onnx_model place
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* changed ort optimization to basic -> temp fix
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Disable sync_batch_comm in validation_step for GPT (#5397)
* disable sync_batch_comm in validation_step
Signed-off-by: ericharper <complex451@gmail.com>
* Read sync_batch_comm from config or default to False
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
* Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
* Empty
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Comment out test
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
* Radtts 1.13 (#5451)
* [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358)
* [TTS] add CI test for RADTTS training recipe.
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
* Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478)
* Initial refactor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Resolve config before passing to load_from_checkpoint
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes for model parallel and nemo restore
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes for eval
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Revert config changes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Refactor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix typo
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Remove comments
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Minor
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix validation reconfiguration
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Remove old comment
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes for test_ds
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* export_utils bugfix (#5480)
* updated export_utils
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Export fixes for Riva (#5496)
* Export fixes for Riva
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* Cleaning up training_utils
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
* added set_start_method + function param bugfix (#5539)
* added set_start_method + function param bugfix
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* upper bound torchmetrics
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: ericharper <complex451@gmail.com>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: ericharper <complex451@gmail.com>
* remove notebook (#5548)
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
* update readme
Signed-off-by: ericharper <complex451@gmail.com>
* update branch
Signed-off-by: ericharper <complex451@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
* revert
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Co-authored-by: David <amosalla@asu.edu>
Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com>
Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com>
Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Optimized loop and bugfix in SDE (#5573)
- Fixed bug with loading custom data attributes from JSON in Speech Data Explorer
Signed-off-by: George Zelenfroynd <gzelenfroind@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update torchmetrics (#5566)
* add task arg
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* update state
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Co-authored-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove useless files. (#5580)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add initial NFA code
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make use of the specified device during viterbi decoding
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix CodeQL notes
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix CodeQL warning
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add an option to defer data setup from ``__init__`` to ``setup`` (#5569)
* Add an option to defer dataloader setup from __init__ to setup
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Updated doc
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make utt_id specified by number of parts of audio_filepath user wishes to use
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove audio_sr TODO - reduce risk of silent bugs
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add check that model is CTC
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove unused import
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Text generation improvement (UI client, data parallel support) (#5437)
* Squashed commit of the following:
commit a5e124f34be31bd6eafe5e5fdf5bedcd0d50915c
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Oct 13 15:07:42 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 35b424044fe80c3081e7756ab21244f701716f7e
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 08:04:49 2022 -0700
get rid of base
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 2955210e2311791543538cfbb5ad26b79414c954
Merge: d52edef8c eaf6757ca
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:17:02 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit d52edef8cd7b36593838fb270047e80f8ccb652e
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:16:24 2022 +0000
align with main
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit eaf6757ca5be8e099492f57c81d984429b0ad49c
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Oct 13 13:12:11 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit c4b86d97626ea0721bf8fb4c0a45dec5becc94c9
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:10:58 2022 +0000
same as main
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit e335de51bcc0d681c58b568c3d8c238bc5687c3b
Merge: c231086e0 4463a9fe9
Author: Yi Dong <yidong@nvidia.com>
Date: Thu Oct 13 13:08:09 2022 +0000
Merge branch 'main' into universal_prompt
commit c231086e057f1efaa915f691d84664cb3d5aad85
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Wed Oct 12 19:59:12 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 6a821a4b49a23dd3408a706a2a3dd393149b0bb1
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:56:17 2022 +0000
default to pad
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9d908e39fef1beed9ba2da4d1a6806161eb7ef25
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:55:44 2022 +0000
add the option to pad the tokens
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 876dc395b43fdeeaa2bcbbe13c76523633764c33
Merge: fbb0f4035 fe3c77ee9
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 19:20:47 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit fe3c77ee93ab6cf3ea152db68cb6beefcac2a392
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 18:59:49 2022 +0000
fix import again
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit fbb0f4035c6cd6bfefed50a20605503de8c1dccb
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Wed Oct 12 16:00:24 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 372ca8c0d7988f2339b15888dc72aa21f4fb6937
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 15:58:32 2022 +0000
enable server
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit cbe05d9fbc978f812cfbb671f45f147f300713c4
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 13:07:28 2022 +0000
fix comment error
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 1948048922e726ec6131e44b1a745389f18d4ef2
Merge: 232c2cce3 984f5c09a
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 13:05:30 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit 232c2cce34d7a8b902da406706f3dd9b39475091
Merge: 34c8a68df 658243fb6
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 12 12:50:00 2022 +0000
Merge branch 'fix_global_init' into universal_prompt
commit 984f5c09a6dbf1d1fb5aa30ed9b0df188e66a50f
Merge: 658243fb6 3fda5de46
Author: Yi Dong <43824965+yidong72@users.noreply.github.com>
Date: Wed Oct 12 08:42:11 2022 -0400
Merge branch 'main' into fix_global_init
commit 658243fb6580191b5d60edd30cde16dcc23cbb85
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 12 12:40:57 2022 +0000
fix import error
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 8e0fe1cad05ec288ec122b3cd0e139a96872e08c
Author: Yi Dong <doyend@gmail.com>
Date: Tue Oct 11 22:44:12 2022 +0000
update the fused kernel
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 536cf6bef9447b75843fad630729c47a2fba35f3
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:44:52 2022 -0700
add the missing file
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 1b437ec41dc5e354453ce0a089bca0171cbcb6c2
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:43:14 2022 -0700
fix fused softmax
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 7813f60e05f9783af61f8c14ec1cb0c6c4f1f263
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 11 14:16:48 2022 -0700
move global step to base
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 34c8a68df084b18d377e84415d9f07b2cd6673dd
Author: Yi Dong <doyend@gmail.com>
Date: Thu Oct 6 13:50:11 2022 +0000
fix pipeline for eval
Signed-off-by: Yi Dong <doyend@gmail.com>
commit eee5d38218f26660c3ffebe9f615c850c80a1f0d
Author: Yi Dong <doyend@gmail.com>
Date: Thu Oct 6 13:48:22 2022 +0000
fix for pipleline parallel
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 323bca73e7ef6099ee79c0a2fffac7b709ed6c5d
Merge: 125e49947 e3b4c4d1f
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 19:29:13 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit 125e4994760448ff75dd9328395813eda1c87547
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 19:29:04 2022 +0000
add share option
Signed-off-by: Yi Dong <doyend@gmail.com>
commit e3b4c4d1f7346c9fa596f3cca6d4df0a9e05c368
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 5 11:43:48 2022 -0700
make sure consolidation works
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit a5c833964ecf05dc460ca1da69275c4019742150
Merge: 2a07ab52d abcb74be2
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:40:29 2022 +0000
Merge branch 'universal_prompt' of github.com:NVIDIA/NeMo into universal_prompt
commit 2a07ab52d95f15ba666823028c69e23825666c05
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:40:23 2022 +0000
added requirement
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 3abecd9dd1611993a87c537636abe7f7e6a9b04c
Author: Yi Dong <doyend@gmail.com>
Date: Wed Oct 5 18:39:42 2022 +0000
added a simple web server
Signed-off-by: Yi Dong <doyend@gmail.com>
commit abcb74be2caf1cdec40eb9ba2be4dde4d45a3b4b
Author: Yi Dong <yidong@nvidia.com>
Date: Wed Oct 5 06:54:12 2022 -0700
fix empty val loss
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit b8eb92ac4a0d665570af75e34c9ba3c2e2420c26
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 19:25:30 2022 -0700
text gen working
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit d59f3e3f3a6fd19736d1c5706fed65a3dd4049ba
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 16:08:40 2022 -0700
first change
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 59d077585e6962a669b824af58f64e8a0bea6547
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 15:00:40 2022 -0700
revert
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 12a0f3902d99e9179403644bd951c045df716ca7
Author: Yi Dong <doyend@gmail.com>
Date: Tue Oct 4 21:26:23 2022 +0000
init imp
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 62a15dfd943cc48be495ac61b9f2f00995775c5f
Merge: 82c90d2cd e0cc6b767
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 11:58:26 2022 -0700
Merge branch 'main' into universal_prompt
commit 82c90d2cd0fd156f16a4b899f8c741d598f33990
Author: Yi Dong <yidong@nvidia.com>
Date: Tue Oct 4 11:17:13 2022 -0700
add sync
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9819b703eef877d90cd1257bf3610c69de9b4d7e
Author: Yi Dong <yidong@nvidia.com>
Date: Sun Oct 2 17:52:34 2022 -0700
fix save model
Signed-off-by: root <root@luna-0197.selene.nvidia.com>
commit e4937e2fc5fb7d70754c97668416e4a69c3079fe
Author: Yi Dong <yidong@nvidia.com>
Date: Sat Oct 1 18:56:09 2022 +0000
working
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit b73b06d1c7cf5417a6d87cb33d8ed83a57e38b7b
Author: Yi Dong <yidong@nvidia.com>
Date: Sat Oct 1 17:34:03 2022 +0000
calcuate the mask
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 9db3bc13eb65a94a475b837603351da68e3745bc
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 23:26:32 2022 +0000
fix bug in datasets
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit f289900375d4412f53f8110be00fec6587627550
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 22:29:40 2022 +0000
update the code
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 8e28a1f208aabaab72dbe769e72756baada04d99
Author: Yi Dong <yidong@nvidia.com>
Date: Fri Sep 30 21:52:52 2022 +0000
added new ds
Signed-off-by: Yi Dong <yidong@nvidia.com>
commit 8d41315bab7ce90e200a8a7d1023c34f8e046897
Author: Yi Dong <doyend@gmail.com>
Date: Fri Sep 30 18:57:09 2022 +0000
added new files
Signed-off-by: Yi Dong <doyend@gmail.com>
commit 984e0e94e15e16323c1ba1ca2efeabd84f69463f
Merge: cbe8b7ab1 fa6cd8588
Author: Yi Dong <doyend@gmail.com>
Date: Thu Sep 29 21:43:29 2022 +0000
Merge branch 'llm-prompt-learning-improvements' into universal_prompt
commit fa6cd858839277939446afe7275976078d54c512
Author: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Date: Thu Sep 29 16:47:30 2022 +0000
[pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
commit 78ba46e5d6fde1be53c08e1e30a54cce59824be0
Merge: 7d6d46742 8d670bc77
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Thu Sep 29 09:43:27 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 7d6d46742170a66758287a207d67e1b1bfd15613
Author: Virginia Adams <vadams@nvidia.com>
Date: Thu Sep 29 16:42:43 2022 +0000
Removed inference step and added sentence peice check to predict step
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 20fd265acd6f7f9912cf52155fe66ccfa6b201a2
Author: Virginia Adams <vadams@nvidia.com>
Date: Thu Sep 29 15:26:32 2022 +0000
fixed first stage check for pipeline parallel T5 pt
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 3637be2b258c8d9028856f9971edb7da4a8121f0
Merge: a3ea722fd 986a76612
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Wed Sep 28 10:23:30 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit a3ea722fdc12fbcc5989b76ef5643a574b763bc4
Merge: 770967a52 971485ce7
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Mon Sep 26 13:35:52 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 770967a5251a474b6dcc2d44bf9a2076adbcb604
Merge: d23bf6c30 e3ac280a8
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Mon Sep 26 10:17:03 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit d23bf6c30acc0e3f6af9b4e24547669866a34d62
Merge: de6a31651 333d2b749
Author: Virginia Adams <vadams@nvidia.com>
Date: Mon Sep 26 10:05:16 2022 -0700
Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements
commit de6a31651e63d88a42b971794d93f18ff5a3cdff
Author: Virginia Adams <vadams@nvidia.com>
Date: Mon Sep 26 17:00:53 2022 +0000
Updated PP check to be on first stage pipeline only
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 333d2b7498e6742ce66436f733c980a74616900c
Merge: 592c0986a a39fc925a
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Fri Sep 23 16:11:21 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 592c0986a476a91b57b8605d7b70830d7acfa021
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 23:08:41 2022 +0000
Fixed unused import and CI test bug
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit ea9cd82d85638bc60ae4ad7ef105db931c8e3455
Merge: ce4b72c8c b566c2d0e
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 18:57:25 2022 +0000
Merge branch 'llm-prompt-learning-improvements' of https://github.com/NVIDIA/NeMo into llm-prompt-learning-improvements
commit ce4b72c8c52f32be336e323dd78a38089edc3e7c
Author: Virginia Adams <vadams@nvidia.com>
Date: Fri Sep 23 18:57:16 2022 +0000
Switch to import from base class
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit b566c2d0e35a068f758fd1310bc620a47be4590b
Merge: 6621f2854 e872061ac
Author: Virginia Adams <78445382+vadam5@users.noreply.github.com>
Date: Fri Sep 23 10:09:03 2022 -0700
Merge branch 'main' into llm-prompt-learning-improvements
commit 6621f28543828a48484a5637f6c9f3ccb23a5b02
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 20:47:35 2022 +0000
python format fix
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 8deafc8987b6af5f7b99a250310f57a40198c37f
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 20:28:02 2022 +0000
Save .nemo on new best val score
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 761bd36969cb465d6a129e9eee6ce1f883d3cf41
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 18:03:19 2022 +0000
Added automatic checkpoint to nemo file method
Signed-off-by: Virginia Adams <vadams@nvidia.com>
commit 3be4ed57b6cd3ddfe4876d78650dfe8fe794598b
Author: Virginia Adams <vadams@nvidia.com>
Date: Wed Sep 14 02:11:56 2022 +0000
Make GPT use base prompt learning model class:
Signed-off-by: Virginia Adams <vadams@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix LGTM
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix validation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* change for the lm eval
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* make text generation work in data parallel environment
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* implement the service with rest service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* surpress log
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Update config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Restore function needed for NMT
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* handles no answer only
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Fix config
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* added knn to web
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix lgtm.com comments
Signed-off-by: Yi Dong <yidong@nvidia.com>
* output the retrieved context
Signed-off-by: Yi Dong <yidong@nvidia.com>
* allow no neighbor query
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove the imports
Signed-off-by: Yi Dong <yidong@nvidia.com>
* warn only once
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Change output file format from JSON to JSONL
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* new t0 dataset
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Add T0 data preproc scripts
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Merge and multiprocessing
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix for is_correct
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* fix epoch > 2
Signed-off-by: Yi Dong <yidong@nvidia.com>
* handles multiple dataloader
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove template
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Refactor T0 dataset
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Add script to merge train folder into individual training files to minimize number of blends
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added on the fly service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add combo instance
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added combo service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* send weights back to server
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix index store
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Minor changes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add reset button
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add add eos
Signed-off-by: Yi Dong <yidong@nvidia.com>
* use a seperate bert service
Signed-off-by: Yi Dong <yidong@nvidia.com>
* no loss of accuracy
Signed-off-by: Yi Dong <yidong@nvidia.com>
* pin the gradio version
Signed-off-by: Yi Dong <yidong@nvidia.com>
* Remove bin compat
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fix header lines
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* evaluate based on text generation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* exact match result aggregation
Signed-off-by: Yi Dong <yidong@nvidia.com>
* working SP and SA
Signed-off-by: Yi Dong <yidong@nvidia.com>
* sync
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix checkpoint
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix eval
Signed-off-by: Yi Dong <yidong@nvidia.com>
* backup states
Signed-off-by: Yi Dong <yidong@nvidia.com>
* backup states reset
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix the bug
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix evaluation for sentence piece
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix a bug
Signed-off-by: Yi Dong <yidong@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* potential fix in the future
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove the universal codes
Signed-off-by: Yi Dong <yidong@nvidia.com>
* remove universal strategy
Signed-off-by: Yi Dong <yidong@nvidia.com>
* address reviewer comment
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add align function docstrings and make most args optional
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove redundant returns of viterbi and log probs matrices
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Rename h# to <initial_silence>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update manifest format description in README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* always remove any spaces from utt_id
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Patch the hanging of threads on very large stderr (#5589) (#5590)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* O2 style amp for gpt3 ptuning (#5246)
* enable amp o2 plugin
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* only create master param if param requires gradient
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* remove pytorch autocast
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* Update optimizer_with_main_params.py
Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
* create master grad only if param group requires grad
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
* fix grad scaler for pp > 1
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
Co-authored-by: Jimmy Zhang <jiemingz@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Better patch hydra (#5591) (#5592)
* Readd buffereing and thread drain to Hydra Launcher
Signed-off-by: smajumdar <titu1994@gmail.com>
* Readd buffereing and thread drain to Hydra Launcher
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Yet another fix with hydra multirun (#5594) (#5595)
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Somshubra Majumdar <titu1994@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add RETRO model documentation (#5578)
* added retro doc
Signed-off-by: Yi Dong <yidong@nvidia.com>
* finish data part
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the data format
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added training script
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added training and evaluation steps
Signed-off-by: Yi Dong <yidong@nvidia.com>
* edit the text
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the images
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix beginning
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix the grammar
Signed-off-by: Yi Dong <yidong@nvidia.com>
* trim it down
Signed-off-by: Yi Dong <yidong@nvidia.com>
* add wandb option
Signed-off-by: Yi Dong <yidong@nvidia.com>
* add reference
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix path
Signed-off-by: Yi Dong <yidong@nvidia.com>
* added the parameters table
Signed-off-by: Yi Dong <yidong@nvidia.com>
* fix section
Signed-off-by: Yi Dong <yidong@nvidia.com>
Signed-off-by: Yi Dong <yidong@nvidia.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix: setup_multiple validation/test data (#5585)
Fix: setup_multiple validation/test data (#5585)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Move to optimizer based EMA implementation (#5169)
* Move to optimizer
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fix replacing weights
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Allow swapping of weights be optional
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Save 2 models
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Use different hook
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Expose cpu device
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add clause to see if this fixes issue with O2 optimizer
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Try to get O2 working
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* WIP
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fixes
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fixes to tests
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add guard
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Remove import
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add guard
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add comment
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Remove overwrite
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Add BatchNorm, currently tests fail
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Fix tests/functionality for batch norm
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Get rid of NLP changes
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* AIStore for ASR datasets (#5462)
AIStore for ASR datasets
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add support for MHA adapters to ASR (#5396)
* Convert AbstractAdapterModule to AbstractAdapterMixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* Temporary fixes to new signature of mixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add adapter util for constants, add all mha adapters.
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update name of function
Signed-off-by: smajumdar <titu1994@gmail.com>
* Roll back changes to convASR
Signed-off-by: smajumdar <titu1994@gmail.com>
* Convert AbstractAdapterModule to AbstractAdapterMixin
Signed-off-by: smajumdar <titu1994@gmail.com>
* First draft of Conformer support for MHA attention
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add some preliminary tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add support for projection of the hidden dimension for attention
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add support for squeezeformer
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update train adapter config
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add tests for squeezeformer and unit tests for new modules
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update config for hp search,set limits on modules for conformer and squeezeformer, update adapter mixin, add cache to import_from_class_path
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update location of adapters
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add pre_norm for proper attention learning, Fix the issue with nan/inf in pos_bias_u and pos_bias_v
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update expmanager to clean up checkpoints
Signed-off-by: smajumdar <titu1994@gmail.com>
* Fix style
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add docstrings and update tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update training scripts
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update config and docs
Signed-off-by: smajumdar <titu1994@gmail.com>
* Expose nemo delete function
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct adapter partial state saving
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct a bug with state management of adapter tokens
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Pull down EMA test
Signed-off-by: smajumdar <titu1994@gmail.com>
* Correct name of adapter module utility class
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove unused TTS eval functions w/ pesq and pystoi dependencies (#5605) (#5606)
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com>
Co-authored-by: Jocelyn <jocelynh@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Create separator parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Call align function with hydra config
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* update usage example
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update Dockerfile (#5614) (#5616)
Pinned to use `numba==0.53.1` to avoid crashing in training with `num_workers > 0`. This is just a temporary workaround, still need to fix it in the future.
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Co-authored-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make separate pretrained_name and model_path parameters
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* make "optional" tags bold in markdown
Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Move non-main functions to utils dir
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Temp workaround: Disable test with cache_audio=True since it is failing in CI (#5607) (#5615)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Co-authored-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] fix ranges of char set for accented letters. (#5607)
* [TTS] fix ranges of char set for accented letters.
* remove digits pattern and added unit tests for math operators.
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Change success message to reduce confusion (#5621)
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Update documentation and tutorials for Adapters (#5610)
* Improve docs for adapter and tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* Improve docs for adapter and tests
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Update test
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Rename test file
Signed-off-by: smajumdar <titu1994@gmail.com>
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] add type hints and change varialbe names for tokenizers and g2p (#5602)
* [TTS] add type hints and change variable names for tokenizers and g2p
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* 1. Added missing import for gather_objects. (#5627)
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Micha Livne <mlivne@nvidia.com>
Co-authored-by: Micha Livne <mlivne@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] add fastpitch and hifigan model NGC urls and update NeMo docs. (#5596) (#5625)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fixed RadTTS unit test (#5572)
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* remove tests (#5633)
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: ericharper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][DOC] add notes about automatic conversion to target sampling rates. (#5624) (#5634)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Conformer local attention (#5525)
* local attn and merge
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* optional
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* override
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* incorporate comments
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* update
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* fix
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* comment
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* changes, test
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* changes
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* check att context
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* readme link
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* utils
Signed-off-by: sam1373 <samuelkriman@gmail.com>
* update
Signed-off-by: sam1373 <samuelkriman@gmail.com>
Signed-off-by: sam1373 <samuelkriman@gmail.com>
Signed-off-by: Samuel Kriman <samuelkriman@gmail.com>
Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add core classes and functions for online clustering diarizer part 1 (#5526)
* Add core classes and functions for online clustering diarizer
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* add audio to labels code
Signed-off-by: Taejin Park <tango4j@gmail.com>
* resolve type errors
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* added unit=tests for very short audio
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Filled all missing docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* resolved conflict and added missing docstrings
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixed unit-test errors
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix the wrongly added file - megatron_gpt_model.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fix wrongly included file - megatron_gpt_model.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* resolve code quality issue
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Fixed unit-test errors and bugs
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* changed total_sec for offline_clustering toy_data in unit-tests
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed merging index offset bug
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* only including part 1 files
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* removed unused function
Signed-off-by: Taejin Park <tango4j@gmail.com>
* fixed unused imports
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* divided nmesc_clustering.py into two and reflected first-pass comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* adding offline/online_clustering.py
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* fix code QL autocomment
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Removed unused imports
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/online_clustering.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Reflected comments
Signed-off-by: Taejin Park <tango4j@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* resolved code scanning issue
Signed-off-by: Taejin Park <tango4j@gmail.com>
* Update nemo/collections/asr/parts/utils/offline_clustering.py
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Signed-off-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
Co-authored-by: Sean Naren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [STT] Add Esperanto (Eo) ASR Conformer-CTC and Conformer-Transducer models (#5639) (#5641)
* add stt_eo_conformer_ctc_large model
* stt_eo_conformer_transducer_large
Co-authored-by: Andrei Andrusenko <52885736+andrusenkoau@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Removed unused import
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Specify that filepaths need to be absolute
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* replaces any spaces in utt_id with dashes
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make hydra script callable by another script
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* do not specify default model or model_downsample_factor
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [Dockerfile] Remove AIS archive from docker image (#5629)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Measure audio_sr from audio instead of needing to specify
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] Disambiguate polyphones with augmented dict and Jieba segmenter for Chinese FastPitch (#5541)
* Chinese TTS replaces default pypinyin dict
* Add jieba word segmenter as an option
Signed-off-by: Yuekai Zhang <zhangyuekai@foxmail.com>
Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Make separate parameters for device of transcription and viterbi steps
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add mention of gecko
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [workflow] add exclude labels option to ignore cherry-picks in release changelog. (#5645)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] bugfix for the tutorial and add NGC CLI installation guide. (#5643) (#5647)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [Add] ASR+VAD Inference Pipeline (#5575)
Added offline ASR+VAD inference pipeline that matches with what's in RIVA, along with some feature-based ASR and classification datasets.
Signed-off-by: stevehuang52 <heh@nvidia.com>
Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* rename separator to ctm_grouping_separator and refactor
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Bert interleaved (#5556)
* Adding SP and SAR support Bert
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding SP and SAR support Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding SP and SAR support Bert
* Adding SP and SAR support Bert
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Adding Sequence parallel support to Bert
* Update bert_model.py
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
* Adding tests
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* Adding interleaved pipeline parallelism
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Addressing Eric's comments
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Addressing Eric's comments
* Fix bug fix sequence parallel and Interleaved
* Fix bug fix sequence parallel and Interleaved
Signed-off-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add duration padding support for RADTTS inference (#5650)
* Added duration padding support for RADTTS inference
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Co-authored-by: Kevin Shih <kshih@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add remove_blank_tokens_from_ctm parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Dont save initial_silence line in CTM
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add DLLogger support to exp_manager (#5658)
* Add DLLogger support to exp_manager
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
* Move dllogger to separate file and check import
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Remove unused import
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
Signed-off-by: Alexandre Milesi <milesial@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add minimum_timestamp_duration parameter
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* add suggestion about removing blanks to README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* reorder args
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* clarify description of ctm_grouping_separator in README
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* update docstring
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS][ZH] bugfix for ngc cli installation. (#5652) (#5664)
Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Port stateless timer to exp manager (#5584)
* Port stateless timer to exp manager
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fixes and remove from all megatron code
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* Fixes
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Change message
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Fix EMA restart by allowing device to be set by the class init (#5668)
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Remove SDP (moved to separate repo) - merge to main (#5630)
* Remove sdp files from tools folder
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add page to docs with new SDP location
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add interface for making amax reduction optional for FP8 (#5447)
* add TE interface for making amax reduction optional
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Eric Harper <complex451@gmail.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [TTS] add tts dict cust notebook (#5662)
* add tts dict cust notebook
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* review
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* fixed audio links
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* remove old notebook
Signed-off-by: ekmb <ebakhturina@nvidia.com>
* fix typo
Signed-off-by: ekmb <ebakhturina@nvidia.com>
Signed-off-by: ekmb <ebakhturina@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* [ASR] Audio processing base, multi-channel enhancement models (#5356)
* Audio processing base model, enc-mask-dec enhancement, tests and modules
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed review comments
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Fixed CodeQL warnings
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed PR comments
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed PR comments:
- renamed AudioProcessingModel to AudioToAudioModel
- various small modifications
- updated unit tests
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
* Addressed comments
- Moved spectrogram to audio_preprocessing
- Renamed MultichannelFeatures
- Updated config and unit tests
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Expose ClusteringDiarizer device (#5681)
* Expose device for users to set
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
* Expose device for users to set
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: SeanNaren <snarenthiran@nvidia.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Add Beam Search support to ASR transcribe() (#5443)
* Add support for beam decoding via high level API.
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add ctc decoding section
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update ctc transcribe API to return results from beam search
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add argument to preserve arpa file
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update script to use hydra config, add some support for future compute timesteps, add doc for ctc decoding
Signed-off-by: smajumdar <titu1994@gmail.com>
* Update eval script and doc to use new API
Signed-off-by: smajumdar <titu1994@gmail.com>
* Add tests for ctc greedy decoding
Signed-off-by: smajumdar <titu1994@gmail.com>
* Address reviewer comments and add docstrings
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
* Fix changes and address comments
Signed-off-by: smajumdar <titu1994@gmail.com>
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
Signed-off-by: smajumdar <titu1994@gmail.com>
Co-authored-by: Samuel Kriman <samuelkriman@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com>
* Propagate attention_dropout flag for GPT-3 (#5669)
* Propagate attention_dropout flag for GPT-3
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
* Add default to megatron_gpt_config
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com>
Co-authored-by: Eric H…
titu1994
added a commit
to titu1994/NeMo
that referenced
this pull request
Mar 24, 2023
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (NVIDIA-NeMo#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (NVIDIA-NeMo#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (NVIDIA-NeMo#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (NVIDIA-NeMo#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (NVIDIA-NeMo#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (NVIDIA-NeMo#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA-NeMo#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (NVIDIA-NeMo#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (NVIDIA-NeMo#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (NVIDIA-NeMo#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (NVIDIA-NeMo#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (NVIDIA-NeMo#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (NVIDIA-NeMo#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (NVIDIA-NeMo#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (NVIDIA-NeMo#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (NVIDIA-NeMo#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (NVIDIA-NeMo#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (NVIDIA-NeMo#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (NVIDIA-NeMo#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (NVIDIA-NeMo#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (NVIDIA-NeMo#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (NVIDIA-NeMo#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (NVIDIA-NeMo#5277) * Typo fix (NVIDIA-NeMo#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (NVIDIA-NeMo#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (NVIDIA-NeMo#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (NVIDIA-NeMo#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (NVIDIA-NeMo#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (NVIDIA-NeMo#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (NVIDIA-NeMo#5359) * Add Gradio App to ASR Docs (NVIDIA-NeMo#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (NVIDIA-NeMo#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (NVIDIA-NeMo#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (NVIDIA-NeMo#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA-NeMo#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (NVIDIA-NeMo#5379) * Fixed bug in notebook (NVIDIA-NeMo#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (NVIDIA-NeMo#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (NVIDIA-NeMo#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA-NeMo#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (NVIDIA-NeMo#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (NVIDIA-NeMo#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (NVIDIA-NeMo#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA-NeMo#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (NVIDIA-NeMo#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA-NeMo#5421)" (NVIDIA-NeMo#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA-NeMo#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (NVIDIA-NeMo#5242) (NVIDIA-NeMo#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (NVIDIA-NeMo#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (NVIDIA-NeMo#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA-NeMo#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (NVIDIA-NeMo#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA-NeMo#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (NVIDIA-NeMo#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (NVIDIA-NeMo#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA-NeMo#5339) (NVIDIA-NeMo#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA-NeMo#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA-NeMo#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (NVIDIA-NeMo#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (NVIDIA-NeMo#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (NVIDIA-NeMo#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (NVIDIA-NeMo#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (NVIDIA-NeMo#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Signed-off-by: Elena Rastorgueva erastorgueva@nvidia.com
What does this PR do ?
Rename Speech Dataset Processor to Speech Data Processor.
Add a one line overview of what this PR aims to accomplish.
Update directory name and contents of README to reflect name change.
Collection: [Note which collection this PR will affect]
tools/speech_data_processor
Changelog
Usage
# Add a code snippet demonstrating how to use thisBefore your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information