Conversation
There was a problem hiding this comment.
https://github.com/NVIDIA/NeMo/blob/main/CONTRIBUTING.md
10. F-strings are preferred to formatted strings.
I recently figured that you have already used lots of str.format() method here and there. Following the NeMo contributing guide, I suggest you to use f-string. Which makes -> print(f"creating and writing chunks to {CWD}"
str.format() appeared in python 2.6.
There was a problem hiding this comment.
Thanks tj, updated the code to use f-string.
There was a problem hiding this comment.
I think this is a good time for doing "scp-> filelist" refactor, although you need to change the name of the file.
Two problems:
(1) What you are using is not a SCP file.
(2) Even if it is SCP file, SCP file format is very hard to find the description and there is no explanation about scp file format in NeMo.
I believe this script is quite important to prepare dataset for training speaker embedding model. I think changing the name is necessary.
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
d2fc2c3 to
2006a80
Compare
| "embs": NeuralType(('B', 'D'), AcousticEncodedRepresentation()), | ||
| } | ||
|
|
||
| @typecheck() |
There was a problem hiding this comment.
You can just replace your forwarded last two lines with calling forward_for_export() there. Do that in a later pr
* initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix/punctuation avoid overwritting tmp files (#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * set plugin to None when no apex (#4171) Signed-off-by: ekmb <ebakhturina@nvidia.com> * small fix (#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * rename folder VAD->vad (#4163) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info and dockerfile Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com>
* initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com>
* update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com>
* update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA-NeMo#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (NVIDIA-NeMo#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (NVIDIA-NeMo#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (NVIDIA-NeMo#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (NVIDIA-NeMo#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (NVIDIA-NeMo#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (NVIDIA-NeMo#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (NVIDIA-NeMo#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (NVIDIA-NeMo#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (NVIDIA-NeMo#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (NVIDIA-NeMo#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (NVIDIA-NeMo#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (NVIDIA-NeMo#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (NVIDIA-NeMo#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (NVIDIA-NeMo#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (NVIDIA-NeMo#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (NVIDIA-NeMo#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (NVIDIA-NeMo#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (NVIDIA-NeMo#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (NVIDIA-NeMo#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (NVIDIA-NeMo#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (NVIDIA-NeMo#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (NVIDIA-NeMo#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (NVIDIA-NeMo#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (NVIDIA-NeMo#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (NVIDIA-NeMo#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (NVIDIA-NeMo#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (NVIDIA-NeMo#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (NVIDIA-NeMo#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (NVIDIA-NeMo#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: stevehuang52 <heh@nvidia.com>
* update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA-NeMo#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (NVIDIA-NeMo#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (NVIDIA-NeMo#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (NVIDIA-NeMo#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (NVIDIA-NeMo#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (NVIDIA-NeMo#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (NVIDIA-NeMo#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (NVIDIA-NeMo#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (NVIDIA-NeMo#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (NVIDIA-NeMo#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (NVIDIA-NeMo#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (NVIDIA-NeMo#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (NVIDIA-NeMo#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (NVIDIA-NeMo#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (NVIDIA-NeMo#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (NVIDIA-NeMo#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (NVIDIA-NeMo#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (NVIDIA-NeMo#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (NVIDIA-NeMo#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (NVIDIA-NeMo#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (NVIDIA-NeMo#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (NVIDIA-NeMo#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (NVIDIA-NeMo#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (NVIDIA-NeMo#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (NVIDIA-NeMo#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (NVIDIA-NeMo#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (NVIDIA-NeMo#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (NVIDIA-NeMo#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (NVIDIA-NeMo#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (NVIDIA-NeMo#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: stevehuang52 <heh@nvidia.com>
* Update container to 22.05 (#4329) * update container to 22.05 Signed-off-by: ericharper <complex451@gmail.com> * try adding safe directory Signed-off-by: ericharper <complex451@gmail.com> * try env var Signed-off-by: ericharper <complex451@gmail.com> * printenv Signed-off-by: ericharper <complex451@gmail.com> * try GIT_BRANCH Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * remove dbug statements Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * Merge r1.9.0 main (#4331) * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * fix full_randn bucket hang Signed-off-by: stevehuang52 <heh@nvidia.com> * remove unused variables Signed-off-by: stevehuang52 <heh@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com>
* Update container to 22.05 (NVIDIA-NeMo#4329) * update container to 22.05 Signed-off-by: ericharper <complex451@gmail.com> * try adding safe directory Signed-off-by: ericharper <complex451@gmail.com> * try env var Signed-off-by: ericharper <complex451@gmail.com> * printenv Signed-off-by: ericharper <complex451@gmail.com> * try GIT_BRANCH Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * remove dbug statements Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * Merge r1.9.0 main (NVIDIA-NeMo#4331) * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA-NeMo#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (NVIDIA-NeMo#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (NVIDIA-NeMo#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (NVIDIA-NeMo#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (NVIDIA-NeMo#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (NVIDIA-NeMo#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (NVIDIA-NeMo#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (NVIDIA-NeMo#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (NVIDIA-NeMo#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (NVIDIA-NeMo#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (NVIDIA-NeMo#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (NVIDIA-NeMo#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (NVIDIA-NeMo#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (NVIDIA-NeMo#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (NVIDIA-NeMo#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (NVIDIA-NeMo#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (NVIDIA-NeMo#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (NVIDIA-NeMo#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (NVIDIA-NeMo#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (NVIDIA-NeMo#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (NVIDIA-NeMo#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (NVIDIA-NeMo#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (NVIDIA-NeMo#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (NVIDIA-NeMo#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (NVIDIA-NeMo#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (NVIDIA-NeMo#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (NVIDIA-NeMo#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (NVIDIA-NeMo#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (NVIDIA-NeMo#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (NVIDIA-NeMo#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * fix full_randn bucket hang Signed-off-by: stevehuang52 <heh@nvidia.com> * remove unused variables Signed-off-by: stevehuang52 <heh@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com>
* stateless RNNT working Signed-off-by: Hainan Xu <hainanx@nvidia.com> * batch decode working Signed-off-by: Hainan Xu <hainanx@nvidia.com> * working backup Signed-off-by: Hainan Xu <hainanx@nvidia.com> * good working version Signed-off-by: Hainan Xu <hainanx@nvidia.com> * temporarily make norm layer have affine Signed-off-by: Hainan Xu <hainanx@nvidia.com> * temp Signed-off-by: Hainan Xu <hainanx@nvidia.com> * temp Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] add staticmethod decoration for BetaBinomialInterpolator (#4319) Signed-off-by: Xuesong Yang <xuesongyxs@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] remove redundant lines and declare global variables and capture (#4320) exception of non-supported windows. Signed-off-by: Xuesong Yang <xuesongyxs@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Finetune T5 on the prefix-lm objective (#4328) * Add script and yaml config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix yaml config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Update yaml to remove hardcoded model path Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Fuse bias with geglu in ParallelMLP (#4213) * add code of fused_bias_geglu * call fused_bias_geglu in ParallelMLP * fix some bugs * change biad_gelu_activation to bias_activation_fusion * fix the setting of bias_actication_fusion for T5 * delete bias_gelu_fusion from T5 example config * push reformatted files * hto4h gemms fusion * remove hto4h gemms fusion * push reformatted files * disable bias_activation_fusion while activation is not geglu * add bias_activation_fusion in yaml config file * add bias_gelu_fusion in T5 config yaml file to pass CI test * change bias_gelu_fusion to bias_activation_fusion for T5 CI test * recover latest change Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Support larger datasets for question answering (#4205) * refactor dialogue state tracking for modelling/dataset interoperability Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style changes Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style raised by lgtm Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style formatting Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update template to include description of intent Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes based on requests in review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add compatibility with assistant dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove dialogue_state_tracking Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update huggingface utils for dialogue Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add docstrings for assistant data processsor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins for SGDGEN local checkpoint Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * use local vocab file for Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * patch for Jenkins CI using local file Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add slot filling prediction and metrics Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused code Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor metrics code out of Dialogue GPT Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate backward compatible support for IntentSlotClassificationModel (bert model) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * save prediction file for IntentSlotClassification Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue gpt model training for megatron gpt Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove batch generate for HF GPT2, which causes lower performance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add few shot capability to dialogue gpt model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile and remove unused import Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update code description and clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address PR comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate compatibility with ZeroShotIntentModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename folder to dialogue due to increased scope and further refactor for clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * added dialogue GPT for sequence generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add CI test for DialogueGPTGenerationModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate DialogueS2SGenerationModel for generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * modify huggingface utils to support HF t5/BART models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix bleu metric style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update based on PR #3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 2 based on PR #3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 3 based on PR #3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate sgd generation based on user user utterance and system slot-values to generate system utterance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add validation model saving capabilities Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * cleaned up code for SGD Based Answer extender Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue Generation CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix Jenkins CI issue" Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add support for design dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support megatron for dialogue_s2s_generation_model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce loaded samples in MSMarcoDataProcessor to 64 when cfg.model.dataset.debug_mode=True Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update checkpoint and predictions filename to include epoch number Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate HF BART MNLI into zero shot intent model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Nearest Neighbour Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor Dialogue SGD Data Processor to make interface for models cleaner Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue S2S Generation model for DialogueSGDDataProcessor interface Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support sgd and drive thru datasets by zero shot model and nearest neighbour model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add prediction saving code to nearest neighbour and zero shot intent models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo in sgd data processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Mellon QA Data Processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update mellon qa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue.py to remove outdated info Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address review comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix for cfg Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * make dependency on apex optional Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * change NLPDDPluggin calling logic to make it possible to run without apex Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add first draft of tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce ms marco size by removing lines without wellFormedAnswers Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update colab tutorial link in dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * include unit test and some refactor to facilitate unit test Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr issues Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove typos in dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support larger files for question answering Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary artifacts to reduce memory use Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * put 0 tensor to device Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update link within dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * restore previously delete files Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error handling when loss = nan Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update nan handling Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss func Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix type error raised in qa_dataset.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add error checking message Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update exp logging Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * limit number of negative samples Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused methods and style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add more documentation Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes base on PR review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Fix bugs in indexed dataset exam script (#4325) * fix the typo Signed-off-by: Yi Dong <yidong@nvidia.com> * add neighbors option Signed-off-by: Yi Dong <yidong@nvidia.com> * change the argument name Signed-off-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Adding docs for ASR SSL (#4303) * Initial commit for SSL docs Signed-off-by: Krishna Puvvada <kpuvvada@nvidia.com> * ssl docs update-1 Signed-off-by: Krishna Puvvada <kpuvvada@nvidia.com> * ssl docs update-2 Signed-off-by: Krishna Puvvada <kpuvvada@nvidia.com> Co-authored-by: Krishna Puvvada <kpuvvada@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Fuse grad division into async grad allreduce (#4327) * O2 runs but O1 does not Signed-off-by: ericharper <complex451@gmail.com> * disable async for O1 Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * update async flag in configure_optimizers Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * update _require if using async Signed-off-by: ericharper <complex451@gmail.com> * clean comments Signed-off-by: ericharper <complex451@gmail.com> * always all_reduce Signed-off-by: ericharper <complex451@gmail.com> * add async grad allreduce and chunk optimization to T5 * push reformatted files after style check * set chunk size as 0 while async grad allreduce is off * more experiments show that 125MB is a better default chunk size for most cases * add grad_allreduce_chunk_size_mb for GPT-3 * at the end of each training step, wait until all async grad allreduce works are done * replace individual allreduce work.wait() with a single dGPU evice synchroonization * add code of fused_bias_geglu * call fused_bias_geglu in ParallelMLP * record the status of each allreduce work seems too much for perf * add more comments * push a reformatted file * fix some bugs * change biad_gelu_activation to bias_activation_fusion * fix the setting of bias_actication_fusion for T5 * delete bias_gelu_fusion from T5 example config * push reformatted files * fuse grad scale with allreduce * push reformatted files * hto4h gemms fusion * remove hto4h gemms fusion * add grad_scale_ar_fusion into GPT-3 * push reformatted files * push reformatted files * rename grad_scale_ar_fusion to grad_div_ar_fusion * disable bias_activation_fusion while activation is not geglu * add bias_activation_fusion in yaml config file * add bias_gelu_fusion in T5 config yaml file to pass CI test * change bias_gelu_fusion to bias_activation_fusion for T5 CI test * recover latest change * add grad_div_ar_fusion in config yaml file * remove a redundant float() Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Update container to 22.05 (#4329) * update container to 22.05 Signed-off-by: ericharper <complex451@gmail.com> * try adding safe directory Signed-off-by: ericharper <complex451@gmail.com> * try env var Signed-off-by: ericharper <complex451@gmail.com> * printenv Signed-off-by: ericharper <complex451@gmail.com> * try GIT_BRANCH Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * remove dbug statements Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Torchaudio installation fix (#4330) * separate installer added Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> * apply suggestions, minor fixes Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] enforced pin_memory = True (#4341) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Merge r1.9.0 main (#4331) * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] Training Fastpitch on German text and phonemes and finetuning HiFi-GAN on predicted mels (#4266) * initial commit Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * cleared notebook outputs Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * formatting errors Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * formatting Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * addressed comments Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * addressed comments on tutorial Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated tutorial Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated grammar and fastpitch description Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated with feedback Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated with feedback Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * removed unused imports Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Speedup the speech commands dataset processing script (#4347) * Add multiprocessing support to the google speech commands dataset processing script Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * fix number of args error with __extract_all_files function Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * fix styling issues Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * fix bugs with silence set construction and update librosa output write to use soundfile write Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * add docstrings and return values in __construct_filepaths as dictionary Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * fix wrong requirement (#4349) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Refactored path to manifest (#4251) Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * IPA support for TTS (#4310) * IPA tokenizer and G2P untested draft Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add IPA CMUdict and new heteronyms list Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add draft FastPitch IPA config Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Minor bugfixes for IPA training Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add phoneme_probability to IPA G2P Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Updates to IPA FastPitch training config Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update IPA dict and heteronyms file Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Adjust default lr for IPA FastPitch to 1e-3 Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Rename IPA CMUdict to reflect date Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add docstrings for IPA tokenizer and G2P, update CMUdict path for config Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Fix IPA vocab ordering, add options to uppercase graphemes and remove stress symbols Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Mark IPA classes as experimental Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update apostrophe-S cases Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Tn install (#4055) * remove conda pynini requirement Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * remove remnants Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * merge with main Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * removing nlp collection dependency from text processing and thus breaking cyclyc imports Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix wrong requirement Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix bug in vi Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * update jenkins folders Signed-off-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * fix tutorial (#4352) Signed-off-by: stevehuang52 <heh@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * fix the post ln (#4350) Signed-off-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [Fix] Hanging for Fully Randomized Bucketing (#4348) * Update container to 22.05 (#4329) * update container to 22.05 Signed-off-by: ericharper <complex451@gmail.com> * try adding safe directory Signed-off-by: ericharper <complex451@gmail.com> * try env var Signed-off-by: ericharper <complex451@gmail.com> * printenv Signed-off-by: ericharper <complex451@gmail.com> * try GIT_BRANCH Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * remove dbug statements Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * Merge r1.9.0 main (#4331) * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * fix full_randn bucket hang Signed-off-by: stevehuang52 <heh@nvidia.com> * remove unused variables Signed-off-by: stevehuang52 <heh@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Bits of RADTTS support (#4343) * Bits of RADTTS support Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Fixed args mismatch Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Style Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Addressed review comments Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * More review comments Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Prompt Learning Pipeline Parallel (#4291) * Added get_forward_output_and_loss_func and updated train/val steps Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added preprocess flag before prompt table/encoder access Signed-off-by: Virginia Adams <vadams@nvidia.com> * Made two optimizer groups, one for frozen, one for soft prompt Signed-off-by: Virginia Adams <vadams@nvidia.com> * Pipeline parallel working Signed-off-by: Virginia Adams <vadams@nvidia.com> * Still figuring out setting lr/sched for one param group Signed-off-by: Virginia Adams <vadams@nvidia.com> * Set betas to zero Signed-off-by: Virginia Adams <vadams@nvidia.com> * Only unfreeze one sublayer with lr 0.0 Signed-off-by: Virginia Adams <vadams@nvidia.com> * Pipeline parallel working w/ one optimizer Signed-off-by: Virginia Adams <vadams@nvidia.com> * Trying to fix Jenkins file Signed-off-by: Virginia Adams <vadams@nvidia.com> * Trying to fix Jenkins file Signed-off-by: Virginia Adams <vadams@nvidia.com> * Getting updated jenkins test to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * Getting updated jenkins test to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * added prompt learning tp and pp CI tests Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added amp_o2 model compatibility Signed-off-by: Virginia Adams <vadams@nvidia.com> * Made CI test smaller Signed-off-by: Virginia Adams <vadams@nvidia.com> * Still trying to get Jenkins to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * Still trying to get Jenkins to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * Temporarily moving prompt learning CI test to beginning Signed-off-by: Virginia Adams <vadams@nvidia.com> * Changing the layer being unfrozen Signed-off-by: Virginia Adams <vadams@nvidia.com> * debug jenkins Signed-off-by: Virginia Adams <vadams@nvidia.com> * Move pp unfreeze to init Signed-off-by: Virginia Adams <vadams@nvidia.com> * Try to make Jenkins test parallel Signed-off-by: Virginia Adams <vadams@nvidia.com> * Fix python formatting Signed-off-by: Virginia Adams <vadams@nvidia.com> * Moved prompt learning tests back to where they belong Signed-off-by: Virginia Adams <vadams@nvidia.com> * add back checkpoint convertion CI test Signed-off-by: Virginia Adams <vadams@nvidia.com> * Revert "add back checkpoint convertion CI test" This reverts commit 61e2ffcdefe964c8e74b74d8c10906ae29f32b6d. * Add back checkpoint conversion test Signed-off-by: Virginia Adams <vadams@nvidia.com> * Setting requires grad to True everywhere Signed-off-by: Virginia Adams <vadams@nvidia.com> * Updated config comments and simplified param group code Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added comment on frozen_model having lr=0.0 Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added configure optimizers methods Signed-off-by: Virginia Adams <vadams@nvidia.com> * Set amp_o2 to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * removed o2 code Signed-off-by: Virginia Adams <vadams@nvidia.com> * Python formatting fix Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * auto switch conformer encoder adapter in_features (#4354) Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Dataloader, collector, loss and metric for multiscale diarization decoder (#4187) * First commit Signed-off-by: Taejin Park <tango4j@gmail.com> * Checked funtionality and imports Signed-off-by: Taejin Park <tango4j@gmail.com> * fixed import issues Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed the changed made by mistake Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed LGTM errors 001 Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed LGTM and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Changed docstrings Signed-off-by: Taejin Park <tango4j@gmail.com> * LGTM again Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed unnecessary torch setting lines Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix and isort Signed-off-by: Taejin Park <tango4j@gmail.com> * jbalam-nv comments reflected Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Reflected comments and created _diar_label.py Signed-off-by: Taejin Park <tango4j@gmail.com> * Typo fix and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed target_spks[0] index error Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * LGTM unused import IterDataset Signed-off-by: Taejin Park <tango4j@gmail.com> * revert collection doc year Signed-off-by: Taejin Park <tango4j@gmail.com> * Code format error in collections.py Signed-off-by: Taejin Park <tango4j@gmail.com> * fix collections space format error Signed-off-by: Taejin Park <tango4j@gmail.com> * merged main correctly Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Reflected all comments and tested Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix and LGTM Signed-off-by: Taejin Park <tango4j@gmail.com> * rttm_filepath to rttm_file and removed self included funcs, tested Signed-off-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Add ASR CTC Decoding module (#4342) * Initial commit Signed-off-by: smajumdar <smajumdar@nvidia.com> * Full support for decoding strategy Signed-off-by: smajumdar <smajumdar@nvidia.com> * Temp Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix labels of y_sequence Signed-off-by: smajumdar <smajumdar@nvidia.com> * Set support for sentencepiece subword merging Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix char and word based token merge alignment Signed-off-by: smajumdar <smajumdar@nvidia.com> * Revert incorrect change Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update docstring Signed-off-by: smajumdar <smajumdar@nvidia.com> * Improve compatibility with greedy tokens and log probs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update scripts to use decoding strategy Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add tests and docs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add tests and docs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix speaker decoder timestamps Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix speaker decoder timestamps Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix decoding of ctc models Signed-off-by: smajumdar <smajumdar@nvidia.com> * Address reviewer comments Signed-off-by: smajumdar <smajumdar@nvidia.com> * Address reviewer comments Signed-off-by: smajumdar <smajumdar@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Option to disable mp in VAD via num_workers=1 (#4317) * Option to disable mp in VAD via num_workers=1 In certain environments python multiprocessing can deadlock. This adds a convenient version to disable by setting num_workers to 1. Signed-off-by: Georg Kucsko <gkucsko@gmail.com> * add none handling Signed-off-by: Georg Kucsko <gkucsko@gmail.com> * additional none handling Signed-off-by: Georg Kucsko <gkucsko@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * remove redundant bias expand (#4382) * remove redundant bias expand Signed-off-by: Xiaowei Ren <xren@nvidia.com> * delete redundant code Signed-off-by: Xiaowei Ren <xren@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Add option for specifying wandb save_dir from config (#4379) * give option to user to specify wandb save dir via config Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * create save_dir directory for wandb logger if not exists Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * update save_dir get method with a default value Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Quick wav2vec fix. In-place operation adding convolutional positions to encoder was overwriting leaf history. Wasn't caught on previous torch versions. (#4383) Signed-off-by: tbartley94 <tbartley@nvidia.com> Co-authored-by: tbartley94 <tbartley@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [Bugfix][TTS] wrong order of returned tuple for general_collate_fn. (#4388) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Merge r1.10.0 main (#4398) * update branch Signed-off-by: ericharper <complex451@gmail.com> * Set headscale false (#4364) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add wandb as dependency (#4365) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Raise trainer error (#4356) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Set headscale false (#4364) (#4366) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: smajumdar <smajumdar@nvidia.com> * Finetuning changes for BART (#4003) * Temp Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Checkpoint converter to nemo for bart Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Make position embedding expansion specific to a batch to avoid checkpoint size mismatches (#4357) * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix logging warning Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Fix electronic bug, new time ITN rule (#4355) * fix electronic bug Signed-off-by: ekmb <ebakhturina@nvidia.com> * add new itn time rule Signed-off-by: ekmb <ebakhturina@nvidia.com> * revert domain changes Signed-off-by: ekmb <ebakhturina@nvidia.com> * remove repetition Signed-off-by: ekmb <ebakhturina@nvidia.com> * Correct support for dataclasses in default module dim (#4372) * Correct support for dataclasses in default module dim Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix path for save of results Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix pad id bug (#4377) Signed-off-by: Yi Dong <yidong@nvidia.com> * Question answering bug fix (#4381) * refactor dialogue state tracking for modelling/dataset interoperability Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style changes Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style raised by lgtm Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style formatting Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update template to include description of intent Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes based on requests in review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add compatibility with assistant dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove dialogue_state_tracking Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update huggingface utils for dialogue Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add docstrings for assistant data processsor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins for SGDGEN local checkpoint Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * use local vocab file for Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * patch for Jenkins CI using local file Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add slot filling prediction and metrics Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused code Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix …
* stateless RNNT working Signed-off-by: Hainan Xu <hainanx@nvidia.com> * batch decode working Signed-off-by: Hainan Xu <hainanx@nvidia.com> * working backup Signed-off-by: Hainan Xu <hainanx@nvidia.com> * good working version Signed-off-by: Hainan Xu <hainanx@nvidia.com> * temporarily make norm layer have affine Signed-off-by: Hainan Xu <hainanx@nvidia.com> * temp Signed-off-by: Hainan Xu <hainanx@nvidia.com> * temp Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] add staticmethod decoration for BetaBinomialInterpolator (#4319) Signed-off-by: Xuesong Yang <xuesongyxs@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] remove redundant lines and declare global variables and capture (#4320) exception of non-supported windows. Signed-off-by: Xuesong Yang <xuesongyxs@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Finetune T5 on the prefix-lm objective (#4328) * Add script and yaml config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix yaml config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Update yaml to remove hardcoded model path Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Fuse bias with geglu in ParallelMLP (#4213) * add code of fused_bias_geglu * call fused_bias_geglu in ParallelMLP * fix some bugs * change biad_gelu_activation to bias_activation_fusion * fix the setting of bias_actication_fusion for T5 * delete bias_gelu_fusion from T5 example config * push reformatted files * hto4h gemms fusion * remove hto4h gemms fusion * push reformatted files * disable bias_activation_fusion while activation is not geglu * add bias_activation_fusion in yaml config file * add bias_gelu_fusion in T5 config yaml file to pass CI test * change bias_gelu_fusion to bias_activation_fusion for T5 CI test * recover latest change Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Support larger datasets for question answering (#4205) * refactor dialogue state tracking for modelling/dataset interoperability Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style changes Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style raised by lgtm Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style formatting Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update template to include description of intent Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes based on requests in review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add compatibility with assistant dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove dialogue_state_tracking Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update huggingface utils for dialogue Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add docstrings for assistant data processsor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins for SGDGEN local checkpoint Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * use local vocab file for Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * patch for Jenkins CI using local file Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add slot filling prediction and metrics Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused code Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor metrics code out of Dialogue GPT Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate backward compatible support for IntentSlotClassificationModel (bert model) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * save prediction file for IntentSlotClassification Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue gpt model training for megatron gpt Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove batch generate for HF GPT2, which causes lower performance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add few shot capability to dialogue gpt model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile and remove unused import Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update code description and clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address PR comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate compatibility with ZeroShotIntentModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename folder to dialogue due to increased scope and further refactor for clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * added dialogue GPT for sequence generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add CI test for DialogueGPTGenerationModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate DialogueS2SGenerationModel for generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * modify huggingface utils to support HF t5/BART models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix bleu metric style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update based on PR #3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 2 based on PR #3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 3 based on PR #3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate sgd generation based on user user utterance and system slot-values to generate system utterance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add validation model saving capabilities Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * cleaned up code for SGD Based Answer extender Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue Generation CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix Jenkins CI issue" Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add support for design dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support megatron for dialogue_s2s_generation_model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce loaded samples in MSMarcoDataProcessor to 64 when cfg.model.dataset.debug_mode=True Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update checkpoint and predictions filename to include epoch number Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate HF BART MNLI into zero shot intent model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Nearest Neighbour Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor Dialogue SGD Data Processor to make interface for models cleaner Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue S2S Generation model for DialogueSGDDataProcessor interface Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support sgd and drive thru datasets by zero shot model and nearest neighbour model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add prediction saving code to nearest neighbour and zero shot intent models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo in sgd data processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Mellon QA Data Processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update mellon qa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue.py to remove outdated info Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address review comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix for cfg Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * make dependency on apex optional Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * change NLPDDPluggin calling logic to make it possible to run without apex Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add first draft of tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce ms marco size by removing lines without wellFormedAnswers Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update colab tutorial link in dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * include unit test and some refactor to facilitate unit test Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr issues Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove typos in dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support larger files for question answering Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary artifacts to reduce memory use Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * put 0 tensor to device Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update link within dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * restore previously delete files Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error handling when loss = nan Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update nan handling Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss func Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix type error raised in qa_dataset.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add error checking message Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update exp logging Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * limit number of negative samples Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused methods and style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add more documentation Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes base on PR review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Fix bugs in indexed dataset exam script (#4325) * fix the typo Signed-off-by: Yi Dong <yidong@nvidia.com> * add neighbors option Signed-off-by: Yi Dong <yidong@nvidia.com> * change the argument name Signed-off-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Adding docs for ASR SSL (#4303) * Initial commit for SSL docs Signed-off-by: Krishna Puvvada <kpuvvada@nvidia.com> * ssl docs update-1 Signed-off-by: Krishna Puvvada <kpuvvada@nvidia.com> * ssl docs update-2 Signed-off-by: Krishna Puvvada <kpuvvada@nvidia.com> Co-authored-by: Krishna Puvvada <kpuvvada@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Fuse grad division into async grad allreduce (#4327) * O2 runs but O1 does not Signed-off-by: ericharper <complex451@gmail.com> * disable async for O1 Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * update async flag in configure_optimizers Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * update _require if using async Signed-off-by: ericharper <complex451@gmail.com> * clean comments Signed-off-by: ericharper <complex451@gmail.com> * always all_reduce Signed-off-by: ericharper <complex451@gmail.com> * add async grad allreduce and chunk optimization to T5 * push reformatted files after style check * set chunk size as 0 while async grad allreduce is off * more experiments show that 125MB is a better default chunk size for most cases * add grad_allreduce_chunk_size_mb for GPT-3 * at the end of each training step, wait until all async grad allreduce works are done * replace individual allreduce work.wait() with a single dGPU evice synchroonization * add code of fused_bias_geglu * call fused_bias_geglu in ParallelMLP * record the status of each allreduce work seems too much for perf * add more comments * push a reformatted file * fix some bugs * change biad_gelu_activation to bias_activation_fusion * fix the setting of bias_actication_fusion for T5 * delete bias_gelu_fusion from T5 example config * push reformatted files * fuse grad scale with allreduce * push reformatted files * hto4h gemms fusion * remove hto4h gemms fusion * add grad_scale_ar_fusion into GPT-3 * push reformatted files * push reformatted files * rename grad_scale_ar_fusion to grad_div_ar_fusion * disable bias_activation_fusion while activation is not geglu * add bias_activation_fusion in yaml config file * add bias_gelu_fusion in T5 config yaml file to pass CI test * change bias_gelu_fusion to bias_activation_fusion for T5 CI test * recover latest change * add grad_div_ar_fusion in config yaml file * remove a redundant float() Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Update container to 22.05 (#4329) * update container to 22.05 Signed-off-by: ericharper <complex451@gmail.com> * try adding safe directory Signed-off-by: ericharper <complex451@gmail.com> * try env var Signed-off-by: ericharper <complex451@gmail.com> * printenv Signed-off-by: ericharper <complex451@gmail.com> * try GIT_BRANCH Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * remove dbug statements Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Torchaudio installation fix (#4330) * separate installer added Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> * apply suggestions, minor fixes Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] enforced pin_memory = True (#4341) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Merge r1.9.0 main (#4331) * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] Training Fastpitch on German text and phonemes and finetuning HiFi-GAN on predicted mels (#4266) * initial commit Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * cleared notebook outputs Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * formatting errors Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * formatting Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * addressed comments Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * addressed comments on tutorial Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated tutorial Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated grammar and fastpitch description Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated with feedback Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated with feedback Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * removed unused imports Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Speedup the speech commands dataset processing script (#4347) * Add multiprocessing support to the google speech commands dataset processing script Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * fix number of args error with __extract_all_files function Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * fix styling issues Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * fix bugs with silence set construction and update librosa output write to use soundfile write Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * add docstrings and return values in __construct_filepaths as dictionary Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * fix wrong requirement (#4349) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Refactored path to manifest (#4251) Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * IPA support for TTS (#4310) * IPA tokenizer and G2P untested draft Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add IPA CMUdict and new heteronyms list Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add draft FastPitch IPA config Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Minor bugfixes for IPA training Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add phoneme_probability to IPA G2P Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Updates to IPA FastPitch training config Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update IPA dict and heteronyms file Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Adjust default lr for IPA FastPitch to 1e-3 Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Rename IPA CMUdict to reflect date Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add docstrings for IPA tokenizer and G2P, update CMUdict path for config Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Fix IPA vocab ordering, add options to uppercase graphemes and remove stress symbols Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Mark IPA classes as experimental Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update apostrophe-S cases Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Tn install (#4055) * remove conda pynini requirement Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * remove remnants Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * merge with main Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * removing nlp collection dependency from text processing and thus breaking cyclyc imports Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix wrong requirement Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix bug in vi Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * update jenkins folders Signed-off-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * fix tutorial (#4352) Signed-off-by: stevehuang52 <heh@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * fix the post ln (#4350) Signed-off-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [Fix] Hanging for Fully Randomized Bucketing (#4348) * Update container to 22.05 (#4329) * update container to 22.05 Signed-off-by: ericharper <complex451@gmail.com> * try adding safe directory Signed-off-by: ericharper <complex451@gmail.com> * try env var Signed-off-by: ericharper <complex451@gmail.com> * printenv Signed-off-by: ericharper <complex451@gmail.com> * try GIT_BRANCH Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * remove dbug statements Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * Merge r1.9.0 main (#4331) * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * fix full_randn bucket hang Signed-off-by: stevehuang52 <heh@nvidia.com> * remove unused variables Signed-off-by: stevehuang52 <heh@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Bits of RADTTS support (#4343) * Bits of RADTTS support Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Fixed args mismatch Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Style Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Addressed review comments Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * More review comments Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Prompt Learning Pipeline Parallel (#4291) * Added get_forward_output_and_loss_func and updated train/val steps Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added preprocess flag before prompt table/encoder access Signed-off-by: Virginia Adams <vadams@nvidia.com> * Made two optimizer groups, one for frozen, one for soft prompt Signed-off-by: Virginia Adams <vadams@nvidia.com> * Pipeline parallel working Signed-off-by: Virginia Adams <vadams@nvidia.com> * Still figuring out setting lr/sched for one param group Signed-off-by: Virginia Adams <vadams@nvidia.com> * Set betas to zero Signed-off-by: Virginia Adams <vadams@nvidia.com> * Only unfreeze one sublayer with lr 0.0 Signed-off-by: Virginia Adams <vadams@nvidia.com> * Pipeline parallel working w/ one optimizer Signed-off-by: Virginia Adams <vadams@nvidia.com> * Trying to fix Jenkins file Signed-off-by: Virginia Adams <vadams@nvidia.com> * Trying to fix Jenkins file Signed-off-by: Virginia Adams <vadams@nvidia.com> * Getting updated jenkins test to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * Getting updated jenkins test to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * added prompt learning tp and pp CI tests Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added amp_o2 model compatibility Signed-off-by: Virginia Adams <vadams@nvidia.com> * Made CI test smaller Signed-off-by: Virginia Adams <vadams@nvidia.com> * Still trying to get Jenkins to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * Still trying to get Jenkins to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * Temporarily moving prompt learning CI test to beginning Signed-off-by: Virginia Adams <vadams@nvidia.com> * Changing the layer being unfrozen Signed-off-by: Virginia Adams <vadams@nvidia.com> * debug jenkins Signed-off-by: Virginia Adams <vadams@nvidia.com> * Move pp unfreeze to init Signed-off-by: Virginia Adams <vadams@nvidia.com> * Try to make Jenkins test parallel Signed-off-by: Virginia Adams <vadams@nvidia.com> * Fix python formatting Signed-off-by: Virginia Adams <vadams@nvidia.com> * Moved prompt learning tests back to where they belong Signed-off-by: Virginia Adams <vadams@nvidia.com> * add back checkpoint convertion CI test Signed-off-by: Virginia Adams <vadams@nvidia.com> * Revert "add back checkpoint convertion CI test" This reverts commit 61e2ffcdefe964c8e74b74d8c10906ae29f32b6d. * Add back checkpoint conversion test Signed-off-by: Virginia Adams <vadams@nvidia.com> * Setting requires grad to True everywhere Signed-off-by: Virginia Adams <vadams@nvidia.com> * Updated config comments and simplified param group code Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added comment on frozen_model having lr=0.0 Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added configure optimizers methods Signed-off-by: Virginia Adams <vadams@nvidia.com> * Set amp_o2 to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * removed o2 code Signed-off-by: Virginia Adams <vadams@nvidia.com> * Python formatting fix Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * auto switch conformer encoder adapter in_features (#4354) Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Dataloader, collector, loss and metric for multiscale diarization decoder (#4187) * First commit Signed-off-by: Taejin Park <tango4j@gmail.com> * Checked funtionality and imports Signed-off-by: Taejin Park <tango4j@gmail.com> * fixed import issues Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed the changed made by mistake Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed LGTM errors 001 Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed LGTM and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Changed docstrings Signed-off-by: Taejin Park <tango4j@gmail.com> * LGTM again Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed unnecessary torch setting lines Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix and isort Signed-off-by: Taejin Park <tango4j@gmail.com> * jbalam-nv comments reflected Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Reflected comments and created _diar_label.py Signed-off-by: Taejin Park <tango4j@gmail.com> * Typo fix and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed target_spks[0] index error Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * LGTM unused import IterDataset Signed-off-by: Taejin Park <tango4j@gmail.com> * revert collection doc year Signed-off-by: Taejin Park <tango4j@gmail.com> * Code format error in collections.py Signed-off-by: Taejin Park <tango4j@gmail.com> * fix collections space format error Signed-off-by: Taejin Park <tango4j@gmail.com> * merged main correctly Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Reflected all comments and tested Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix and LGTM Signed-off-by: Taejin Park <tango4j@gmail.com> * rttm_filepath to rttm_file and removed self included funcs, tested Signed-off-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Add ASR CTC Decoding module (#4342) * Initial commit Signed-off-by: smajumdar <smajumdar@nvidia.com> * Full support for decoding strategy Signed-off-by: smajumdar <smajumdar@nvidia.com> * Temp Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix labels of y_sequence Signed-off-by: smajumdar <smajumdar@nvidia.com> * Set support for sentencepiece subword merging Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix char and word based token merge alignment Signed-off-by: smajumdar <smajumdar@nvidia.com> * Revert incorrect change Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update docstring Signed-off-by: smajumdar <smajumdar@nvidia.com> * Improve compatibility with greedy tokens and log probs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update scripts to use decoding strategy Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add tests and docs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add tests and docs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix speaker decoder timestamps Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix speaker decoder timestamps Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix decoding of ctc models Signed-off-by: smajumdar <smajumdar@nvidia.com> * Address reviewer comments Signed-off-by: smajumdar <smajumdar@nvidia.com> * Address reviewer comments Signed-off-by: smajumdar <smajumdar@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Option to disable mp in VAD via num_workers=1 (#4317) * Option to disable mp in VAD via num_workers=1 In certain environments python multiprocessing can deadlock. This adds a convenient version to disable by setting num_workers to 1. Signed-off-by: Georg Kucsko <gkucsko@gmail.com> * add none handling Signed-off-by: Georg Kucsko <gkucsko@gmail.com> * additional none handling Signed-off-by: Georg Kucsko <gkucsko@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * remove redundant bias expand (#4382) * remove redundant bias expand Signed-off-by: Xiaowei Ren <xren@nvidia.com> * delete redundant code Signed-off-by: Xiaowei Ren <xren@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Add option for specifying wandb save_dir from config (#4379) * give option to user to specify wandb save dir via config Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * create save_dir directory for wandb logger if not exists Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * update save_dir get method with a default value Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Quick wav2vec fix. In-place operation adding convolutional positions to encoder was overwriting leaf history. Wasn't caught on previous torch versions. (#4383) Signed-off-by: tbartley94 <tbartley@nvidia.com> Co-authored-by: tbartley94 <tbartley@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [Bugfix][TTS] wrong order of returned tuple for general_collate_fn. (#4388) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Merge r1.10.0 main (#4398) * update branch Signed-off-by: ericharper <complex451@gmail.com> * Set headscale false (#4364) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add wandb as dependency (#4365) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Raise trainer error (#4356) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Set headscale false (#4364) (#4366) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: smajumdar <smajumdar@nvidia.com> * Finetuning changes for BART (#4003) * Temp Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Checkpoint converter to nemo for bart Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Make position embedding expansion specific to a batch to avoid checkpoint size mismatches (#4357) * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix logging warning Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Fix electronic bug, new time ITN rule (#4355) * fix electronic bug Signed-off-by: ekmb <ebakhturina@nvidia.com> * add new itn time rule Signed-off-by: ekmb <ebakhturina@nvidia.com> * revert domain changes Signed-off-by: ekmb <ebakhturina@nvidia.com> * remove repetition Signed-off-by: ekmb <ebakhturina@nvidia.com> * Correct support for dataclasses in default module dim (#4372) * Correct support for dataclasses in default module dim Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix path for save of results Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix pad id bug (#4377) Signed-off-by: Yi Dong <yidong@nvidia.com> * Question answering bug fix (#4381) * refactor dialogue state tracking for modelling/dataset interoperability Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style changes Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style raised by lgtm Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style formatting Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update template to include description of intent Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes based on requests in review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add compatibility with assistant dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove dialogue_state_tracking Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update huggingface utils for dialogue Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add docstrings for assistant data processsor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins for SGDGEN local checkpoint Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * use local vocab file for Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * patch for Jenkins CI using local file Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add slot filling prediction and metrics Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused code Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix …
* stateless RNNT working Signed-off-by: Hainan Xu <hainanx@nvidia.com> * batch decode working Signed-off-by: Hainan Xu <hainanx@nvidia.com> * working backup Signed-off-by: Hainan Xu <hainanx@nvidia.com> * good working version Signed-off-by: Hainan Xu <hainanx@nvidia.com> * temporarily make norm layer have affine Signed-off-by: Hainan Xu <hainanx@nvidia.com> * temp Signed-off-by: Hainan Xu <hainanx@nvidia.com> * temp Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] add staticmethod decoration for BetaBinomialInterpolator (#4319) Signed-off-by: Xuesong Yang <xuesongyxs@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] remove redundant lines and declare global variables and capture (#4320) exception of non-supported windows. Signed-off-by: Xuesong Yang <xuesongyxs@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Finetune T5 on the prefix-lm objective (#4328) * Add script and yaml config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix yaml config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Update yaml to remove hardcoded model path Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Fuse bias with geglu in ParallelMLP (#4213) * add code of fused_bias_geglu * call fused_bias_geglu in ParallelMLP * fix some bugs * change biad_gelu_activation to bias_activation_fusion * fix the setting of bias_actication_fusion for T5 * delete bias_gelu_fusion from T5 example config * push reformatted files * hto4h gemms fusion * remove hto4h gemms fusion * push reformatted files * disable bias_activation_fusion while activation is not geglu * add bias_activation_fusion in yaml config file * add bias_gelu_fusion in T5 config yaml file to pass CI test * change bias_gelu_fusion to bias_activation_fusion for T5 CI test * recover latest change Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Support larger datasets for question answering (#4205) * refactor dialogue state tracking for modelling/dataset interoperability Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style changes Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style raised by lgtm Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style formatting Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update template to include description of intent Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes based on requests in review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add compatibility with assistant dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove dialogue_state_tracking Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update huggingface utils for dialogue Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add docstrings for assistant data processsor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins for SGDGEN local checkpoint Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * use local vocab file for Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * patch for Jenkins CI using local file Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add slot filling prediction and metrics Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused code Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor metrics code out of Dialogue GPT Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate backward compatible support for IntentSlotClassificationModel (bert model) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * save prediction file for IntentSlotClassification Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue gpt model training for megatron gpt Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove batch generate for HF GPT2, which causes lower performance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add few shot capability to dialogue gpt model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile and remove unused import Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update code description and clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address PR comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate compatibility with ZeroShotIntentModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename folder to dialogue due to increased scope and further refactor for clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * added dialogue GPT for sequence generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add CI test for DialogueGPTGenerationModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate DialogueS2SGenerationModel for generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * modify huggingface utils to support HF t5/BART models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix bleu metric style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update based on PR #3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 2 based on PR #3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 3 based on PR #3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate sgd generation based on user user utterance and system slot-values to generate system utterance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add validation model saving capabilities Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * cleaned up code for SGD Based Answer extender Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue Generation CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix Jenkins CI issue" Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add support for design dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support megatron for dialogue_s2s_generation_model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce loaded samples in MSMarcoDataProcessor to 64 when cfg.model.dataset.debug_mode=True Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update checkpoint and predictions filename to include epoch number Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate HF BART MNLI into zero shot intent model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Nearest Neighbour Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor Dialogue SGD Data Processor to make interface for models cleaner Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue S2S Generation model for DialogueSGDDataProcessor interface Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support sgd and drive thru datasets by zero shot model and nearest neighbour model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add prediction saving code to nearest neighbour and zero shot intent models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo in sgd data processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Mellon QA Data Processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update mellon qa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue.py to remove outdated info Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address review comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix for cfg Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * make dependency on apex optional Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * change NLPDDPluggin calling logic to make it possible to run without apex Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add first draft of tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce ms marco size by removing lines without wellFormedAnswers Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update colab tutorial link in dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * include unit test and some refactor to facilitate unit test Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr issues Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove typos in dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support larger files for question answering Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary artifacts to reduce memory use Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * put 0 tensor to device Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update link within dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * restore previously delete files Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error handling when loss = nan Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update nan handling Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss func Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix type error raised in qa_dataset.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add error checking message Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update exp logging Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * limit number of negative samples Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused methods and style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add more documentation Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes base on PR review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Fix bugs in indexed dataset exam script (#4325) * fix the typo Signed-off-by: Yi Dong <yidong@nvidia.com> * add neighbors option Signed-off-by: Yi Dong <yidong@nvidia.com> * change the argument name Signed-off-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Adding docs for ASR SSL (#4303) * Initial commit for SSL docs Signed-off-by: Krishna Puvvada <kpuvvada@nvidia.com> * ssl docs update-1 Signed-off-by: Krishna Puvvada <kpuvvada@nvidia.com> * ssl docs update-2 Signed-off-by: Krishna Puvvada <kpuvvada@nvidia.com> Co-authored-by: Krishna Puvvada <kpuvvada@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Fuse grad division into async grad allreduce (#4327) * O2 runs but O1 does not Signed-off-by: ericharper <complex451@gmail.com> * disable async for O1 Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * update async flag in configure_optimizers Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * update _require if using async Signed-off-by: ericharper <complex451@gmail.com> * clean comments Signed-off-by: ericharper <complex451@gmail.com> * always all_reduce Signed-off-by: ericharper <complex451@gmail.com> * add async grad allreduce and chunk optimization to T5 * push reformatted files after style check * set chunk size as 0 while async grad allreduce is off * more experiments show that 125MB is a better default chunk size for most cases * add grad_allreduce_chunk_size_mb for GPT-3 * at the end of each training step, wait until all async grad allreduce works are done * replace individual allreduce work.wait() with a single dGPU evice synchroonization * add code of fused_bias_geglu * call fused_bias_geglu in ParallelMLP * record the status of each allreduce work seems too much for perf * add more comments * push a reformatted file * fix some bugs * change biad_gelu_activation to bias_activation_fusion * fix the setting of bias_actication_fusion for T5 * delete bias_gelu_fusion from T5 example config * push reformatted files * fuse grad scale with allreduce * push reformatted files * hto4h gemms fusion * remove hto4h gemms fusion * add grad_scale_ar_fusion into GPT-3 * push reformatted files * push reformatted files * rename grad_scale_ar_fusion to grad_div_ar_fusion * disable bias_activation_fusion while activation is not geglu * add bias_activation_fusion in yaml config file * add bias_gelu_fusion in T5 config yaml file to pass CI test * change bias_gelu_fusion to bias_activation_fusion for T5 CI test * recover latest change * add grad_div_ar_fusion in config yaml file * remove a redundant float() Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Update container to 22.05 (#4329) * update container to 22.05 Signed-off-by: ericharper <complex451@gmail.com> * try adding safe directory Signed-off-by: ericharper <complex451@gmail.com> * try env var Signed-off-by: ericharper <complex451@gmail.com> * printenv Signed-off-by: ericharper <complex451@gmail.com> * try GIT_BRANCH Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * remove dbug statements Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Torchaudio installation fix (#4330) * separate installer added Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> * apply suggestions, minor fixes Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] enforced pin_memory = True (#4341) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Merge r1.9.0 main (#4331) * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] Training Fastpitch on German text and phonemes and finetuning HiFi-GAN on predicted mels (#4266) * initial commit Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * cleared notebook outputs Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * formatting errors Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * formatting Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * addressed comments Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * addressed comments on tutorial Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated tutorial Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated grammar and fastpitch description Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated with feedback Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated with feedback Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * removed unused imports Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Speedup the speech commands dataset processing script (#4347) * Add multiprocessing support to the google speech commands dataset processing script Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * fix number of args error with __extract_all_files function Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * fix styling issues Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * fix bugs with silence set construction and update librosa output write to use soundfile write Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * add docstrings and return values in __construct_filepaths as dictionary Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * fix wrong requirement (#4349) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Refactored path to manifest (#4251) Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * IPA support for TTS (#4310) * IPA tokenizer and G2P untested draft Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add IPA CMUdict and new heteronyms list Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add draft FastPitch IPA config Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Minor bugfixes for IPA training Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add phoneme_probability to IPA G2P Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Updates to IPA FastPitch training config Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update IPA dict and heteronyms file Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Adjust default lr for IPA FastPitch to 1e-3 Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Rename IPA CMUdict to reflect date Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add docstrings for IPA tokenizer and G2P, update CMUdict path for config Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Fix IPA vocab ordering, add options to uppercase graphemes and remove stress symbols Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Mark IPA classes as experimental Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update apostrophe-S cases Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Tn install (#4055) * remove conda pynini requirement Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * remove remnants Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * merge with main Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * removing nlp collection dependency from text processing and thus breaking cyclyc imports Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix wrong requirement Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix bug in vi Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * update jenkins folders Signed-off-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * fix tutorial (#4352) Signed-off-by: stevehuang52 <heh@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * fix the post ln (#4350) Signed-off-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [Fix] Hanging for Fully Randomized Bucketing (#4348) * Update container to 22.05 (#4329) * update container to 22.05 Signed-off-by: ericharper <complex451@gmail.com> * try adding safe directory Signed-off-by: ericharper <complex451@gmail.com> * try env var Signed-off-by: ericharper <complex451@gmail.com> * printenv Signed-off-by: ericharper <complex451@gmail.com> * try GIT_BRANCH Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * remove dbug statements Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * Merge r1.9.0 main (#4331) * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * fix full_randn bucket hang Signed-off-by: stevehuang52 <heh@nvidia.com> * remove unused variables Signed-off-by: stevehuang52 <heh@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Bits of RADTTS support (#4343) * Bits of RADTTS support Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Fixed args mismatch Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Style Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Addressed review comments Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * More review comments Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Prompt Learning Pipeline Parallel (#4291) * Added get_forward_output_and_loss_func and updated train/val steps Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added preprocess flag before prompt table/encoder access Signed-off-by: Virginia Adams <vadams@nvidia.com> * Made two optimizer groups, one for frozen, one for soft prompt Signed-off-by: Virginia Adams <vadams@nvidia.com> * Pipeline parallel working Signed-off-by: Virginia Adams <vadams@nvidia.com> * Still figuring out setting lr/sched for one param group Signed-off-by: Virginia Adams <vadams@nvidia.com> * Set betas to zero Signed-off-by: Virginia Adams <vadams@nvidia.com> * Only unfreeze one sublayer with lr 0.0 Signed-off-by: Virginia Adams <vadams@nvidia.com> * Pipeline parallel working w/ one optimizer Signed-off-by: Virginia Adams <vadams@nvidia.com> * Trying to fix Jenkins file Signed-off-by: Virginia Adams <vadams@nvidia.com> * Trying to fix Jenkins file Signed-off-by: Virginia Adams <vadams@nvidia.com> * Getting updated jenkins test to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * Getting updated jenkins test to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * added prompt learning tp and pp CI tests Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added amp_o2 model compatibility Signed-off-by: Virginia Adams <vadams@nvidia.com> * Made CI test smaller Signed-off-by: Virginia Adams <vadams@nvidia.com> * Still trying to get Jenkins to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * Still trying to get Jenkins to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * Temporarily moving prompt learning CI test to beginning Signed-off-by: Virginia Adams <vadams@nvidia.com> * Changing the layer being unfrozen Signed-off-by: Virginia Adams <vadams@nvidia.com> * debug jenkins Signed-off-by: Virginia Adams <vadams@nvidia.com> * Move pp unfreeze to init Signed-off-by: Virginia Adams <vadams@nvidia.com> * Try to make Jenkins test parallel Signed-off-by: Virginia Adams <vadams@nvidia.com> * Fix python formatting Signed-off-by: Virginia Adams <vadams@nvidia.com> * Moved prompt learning tests back to where they belong Signed-off-by: Virginia Adams <vadams@nvidia.com> * add back checkpoint convertion CI test Signed-off-by: Virginia Adams <vadams@nvidia.com> * Revert "add back checkpoint convertion CI test" This reverts commit 61e2ffcdefe964c8e74b74d8c10906ae29f32b6d. * Add back checkpoint conversion test Signed-off-by: Virginia Adams <vadams@nvidia.com> * Setting requires grad to True everywhere Signed-off-by: Virginia Adams <vadams@nvidia.com> * Updated config comments and simplified param group code Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added comment on frozen_model having lr=0.0 Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added configure optimizers methods Signed-off-by: Virginia Adams <vadams@nvidia.com> * Set amp_o2 to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * removed o2 code Signed-off-by: Virginia Adams <vadams@nvidia.com> * Python formatting fix Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * auto switch conformer encoder adapter in_features (#4354) Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Dataloader, collector, loss and metric for multiscale diarization decoder (#4187) * First commit Signed-off-by: Taejin Park <tango4j@gmail.com> * Checked funtionality and imports Signed-off-by: Taejin Park <tango4j@gmail.com> * fixed import issues Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed the changed made by mistake Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed LGTM errors 001 Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed LGTM and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Changed docstrings Signed-off-by: Taejin Park <tango4j@gmail.com> * LGTM again Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed unnecessary torch setting lines Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix and isort Signed-off-by: Taejin Park <tango4j@gmail.com> * jbalam-nv comments reflected Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Reflected comments and created _diar_label.py Signed-off-by: Taejin Park <tango4j@gmail.com> * Typo fix and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed target_spks[0] index error Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * LGTM unused import IterDataset Signed-off-by: Taejin Park <tango4j@gmail.com> * revert collection doc year Signed-off-by: Taejin Park <tango4j@gmail.com> * Code format error in collections.py Signed-off-by: Taejin Park <tango4j@gmail.com> * fix collections space format error Signed-off-by: Taejin Park <tango4j@gmail.com> * merged main correctly Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Reflected all comments and tested Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix and LGTM Signed-off-by: Taejin Park <tango4j@gmail.com> * rttm_filepath to rttm_file and removed self included funcs, tested Signed-off-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Add ASR CTC Decoding module (#4342) * Initial commit Signed-off-by: smajumdar <smajumdar@nvidia.com> * Full support for decoding strategy Signed-off-by: smajumdar <smajumdar@nvidia.com> * Temp Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix labels of y_sequence Signed-off-by: smajumdar <smajumdar@nvidia.com> * Set support for sentencepiece subword merging Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix char and word based token merge alignment Signed-off-by: smajumdar <smajumdar@nvidia.com> * Revert incorrect change Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update docstring Signed-off-by: smajumdar <smajumdar@nvidia.com> * Improve compatibility with greedy tokens and log probs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update scripts to use decoding strategy Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add tests and docs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add tests and docs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix speaker decoder timestamps Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix speaker decoder timestamps Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix decoding of ctc models Signed-off-by: smajumdar <smajumdar@nvidia.com> * Address reviewer comments Signed-off-by: smajumdar <smajumdar@nvidia.com> * Address reviewer comments Signed-off-by: smajumdar <smajumdar@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Option to disable mp in VAD via num_workers=1 (#4317) * Option to disable mp in VAD via num_workers=1 In certain environments python multiprocessing can deadlock. This adds a convenient version to disable by setting num_workers to 1. Signed-off-by: Georg Kucsko <gkucsko@gmail.com> * add none handling Signed-off-by: Georg Kucsko <gkucsko@gmail.com> * additional none handling Signed-off-by: Georg Kucsko <gkucsko@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * remove redundant bias expand (#4382) * remove redundant bias expand Signed-off-by: Xiaowei Ren <xren@nvidia.com> * delete redundant code Signed-off-by: Xiaowei Ren <xren@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Add option for specifying wandb save_dir from config (#4379) * give option to user to specify wandb save dir via config Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * create save_dir directory for wandb logger if not exists Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * update save_dir get method with a default value Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Quick wav2vec fix. In-place operation adding convolutional positions to encoder was overwriting leaf history. Wasn't caught on previous torch versions. (#4383) Signed-off-by: tbartley94 <tbartley@nvidia.com> Co-authored-by: tbartley94 <tbartley@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [Bugfix][TTS] wrong order of returned tuple for general_collate_fn. (#4388) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Merge r1.10.0 main (#4398) * update branch Signed-off-by: ericharper <complex451@gmail.com> * Set headscale false (#4364) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add wandb as dependency (#4365) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Raise trainer error (#4356) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Set headscale false (#4364) (#4366) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: smajumdar <smajumdar@nvidia.com> * Finetuning changes for BART (#4003) * Temp Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Checkpoint converter to nemo for bart Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Make position embedding expansion specific to a batch to avoid checkpoint size mismatches (#4357) * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix logging warning Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Fix electronic bug, new time ITN rule (#4355) * fix electronic bug Signed-off-by: ekmb <ebakhturina@nvidia.com> * add new itn time rule Signed-off-by: ekmb <ebakhturina@nvidia.com> * revert domain changes Signed-off-by: ekmb <ebakhturina@nvidia.com> * remove repetition Signed-off-by: ekmb <ebakhturina@nvidia.com> * Correct support for dataclasses in default module dim (#4372) * Correct support for dataclasses in default module dim Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix path for save of results Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix pad id bug (#4377) Signed-off-by: Yi Dong <yidong@nvidia.com> * Question answering bug fix (#4381) * refactor dialogue state tracking for modelling/dataset interoperability Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style changes Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style raised by lgtm Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style formatting Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update template to include description of intent Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes based on requests in review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add compatibility with assistant dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove dialogue_state_tracking Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update huggingface utils for dialogue Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add docstrings for assistant data processsor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins for SGDGEN local checkpoint Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * use local vocab file for Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * patch for Jenkins CI using local file Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add slot filling prediction and metrics Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused code Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix …
* update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA-NeMo#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (NVIDIA-NeMo#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (NVIDIA-NeMo#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (NVIDIA-NeMo#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (NVIDIA-NeMo#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (NVIDIA-NeMo#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (NVIDIA-NeMo#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (NVIDIA-NeMo#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (NVIDIA-NeMo#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (NVIDIA-NeMo#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (NVIDIA-NeMo#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (NVIDIA-NeMo#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (NVIDIA-NeMo#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (NVIDIA-NeMo#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (NVIDIA-NeMo#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (NVIDIA-NeMo#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (NVIDIA-NeMo#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (NVIDIA-NeMo#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (NVIDIA-NeMo#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (NVIDIA-NeMo#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (NVIDIA-NeMo#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (NVIDIA-NeMo#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (NVIDIA-NeMo#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (NVIDIA-NeMo#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (NVIDIA-NeMo#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (NVIDIA-NeMo#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (NVIDIA-NeMo#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (NVIDIA-NeMo#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (NVIDIA-NeMo#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (NVIDIA-NeMo#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com>
* Update container to 22.05 (NVIDIA-NeMo#4329) * update container to 22.05 Signed-off-by: ericharper <complex451@gmail.com> * try adding safe directory Signed-off-by: ericharper <complex451@gmail.com> * try env var Signed-off-by: ericharper <complex451@gmail.com> * printenv Signed-off-by: ericharper <complex451@gmail.com> * try GIT_BRANCH Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * remove dbug statements Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * Merge r1.9.0 main (NVIDIA-NeMo#4331) * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA-NeMo#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (NVIDIA-NeMo#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (NVIDIA-NeMo#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (NVIDIA-NeMo#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (NVIDIA-NeMo#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (NVIDIA-NeMo#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (NVIDIA-NeMo#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (NVIDIA-NeMo#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (NVIDIA-NeMo#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (NVIDIA-NeMo#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (NVIDIA-NeMo#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (NVIDIA-NeMo#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (NVIDIA-NeMo#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (NVIDIA-NeMo#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (NVIDIA-NeMo#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (NVIDIA-NeMo#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (NVIDIA-NeMo#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (NVIDIA-NeMo#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (NVIDIA-NeMo#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (NVIDIA-NeMo#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (NVIDIA-NeMo#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (NVIDIA-NeMo#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (NVIDIA-NeMo#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (NVIDIA-NeMo#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (NVIDIA-NeMo#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (NVIDIA-NeMo#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (NVIDIA-NeMo#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (NVIDIA-NeMo#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (NVIDIA-NeMo#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (NVIDIA-NeMo#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * fix full_randn bucket hang Signed-off-by: stevehuang52 <heh@nvidia.com> * remove unused variables Signed-off-by: stevehuang52 <heh@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com>
* stateless RNNT working Signed-off-by: Hainan Xu <hainanx@nvidia.com> * batch decode working Signed-off-by: Hainan Xu <hainanx@nvidia.com> * working backup Signed-off-by: Hainan Xu <hainanx@nvidia.com> * good working version Signed-off-by: Hainan Xu <hainanx@nvidia.com> * temporarily make norm layer have affine Signed-off-by: Hainan Xu <hainanx@nvidia.com> * temp Signed-off-by: Hainan Xu <hainanx@nvidia.com> * temp Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] add staticmethod decoration for BetaBinomialInterpolator (#4319) Signed-off-by: Xuesong Yang <xuesongyxs@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] remove redundant lines and declare global variables and capture (#4320) exception of non-supported windows. Signed-off-by: Xuesong Yang <xuesongyxs@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Finetune T5 on the prefix-lm objective (#4328) * Add script and yaml config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix yaml config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Update yaml to remove hardcoded model path Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Fuse bias with geglu in ParallelMLP (#4213) * add code of fused_bias_geglu * call fused_bias_geglu in ParallelMLP * fix some bugs * change biad_gelu_activation to bias_activation_fusion * fix the setting of bias_actication_fusion for T5 * delete bias_gelu_fusion from T5 example config * push reformatted files * hto4h gemms fusion * remove hto4h gemms fusion * push reformatted files * disable bias_activation_fusion while activation is not geglu * add bias_activation_fusion in yaml config file * add bias_gelu_fusion in T5 config yaml file to pass CI test * change bias_gelu_fusion to bias_activation_fusion for T5 CI test * recover latest change Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Support larger datasets for question answering (#4205) * refactor dialogue state tracking for modelling/dataset interoperability Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style changes Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style raised by lgtm Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style formatting Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update template to include description of intent Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes based on requests in review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add compatibility with assistant dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove dialogue_state_tracking Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update huggingface utils for dialogue Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add docstrings for assistant data processsor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins for SGDGEN local checkpoint Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * use local vocab file for Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * patch for Jenkins CI using local file Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add slot filling prediction and metrics Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused code Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor metrics code out of Dialogue GPT Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate backward compatible support for IntentSlotClassificationModel (bert model) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * save prediction file for IntentSlotClassification Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue gpt model training for megatron gpt Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove batch generate for HF GPT2, which causes lower performance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add few shot capability to dialogue gpt model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile and remove unused import Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update code description and clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address PR comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate compatibility with ZeroShotIntentModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename folder to dialogue due to increased scope and further refactor for clarity Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * added dialogue GPT for sequence generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add CI test for DialogueGPTGenerationModel Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate DialogueS2SGenerationModel for generation task (e.g. answer extender) Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * modify huggingface utils to support HF t5/BART models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix bleu metric style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * debug bleu metric Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update based on PR #3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 2 based on PR #3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update 3 based on PR #3893 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate sgd generation based on user user utterance and system slot-values to generate system utterance Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add validation model saving capabilities Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * cleaned up code for SGD Based Answer extender Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue Generation CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix Jenkins CI issue" Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add support for design dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support megatron for dialogue_s2s_generation_model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce loaded samples in MSMarcoDataProcessor to 64 when cfg.model.dataset.debug_mode=True Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update CI Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update checkpoint and predictions filename to include epoch number Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate HF BART MNLI into zero shot intent model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Nearest Neighbour Model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * refactor Dialogue SGD Data Processor to make interface for models cleaner Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Dialogue S2S Generation model for DialogueSGDDataProcessor interface Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support sgd and drive thru datasets by zero shot model and nearest neighbour model Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add prediction saving code to nearest neighbour and zero shot intent models Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo in sgd data processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * integrate Dialogue Mellon QA Data Processor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update mellon qa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue.py to remove outdated info Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update dialogue_config.yaml Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address review comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix for cfg Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * make dependency on apex optional Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * change NLPDDPluggin calling logic to make it possible to run without apex Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add first draft of tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * reduce ms marco size by removing lines without wellFormedAnswers Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr comments Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update colab tutorial link in dialogue docs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * include unit test and some refactor to facilitate unit test Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * address pr issues Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove typos in dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * support larger files for question answering Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unnecessary artifacts to reduce memory use Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * put 0 tensor to device Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update link within dialogue tutorial Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * restore previously delete files Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error handling when loss = nan Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update nan handling Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss func Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update spanning loss Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix type error raised in qa_dataset.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add error checking message Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert back to float32 Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update exp logging Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update error msgs Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update loading of large file from pickle to json Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * limit number of negative samples Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * revert post processing Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused methods and style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add more documentation Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused imports Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes base on PR review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Zhilin Wang <zhilinw@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Fix bugs in indexed dataset exam script (#4325) * fix the typo Signed-off-by: Yi Dong <yidong@nvidia.com> * add neighbors option Signed-off-by: Yi Dong <yidong@nvidia.com> * change the argument name Signed-off-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Adding docs for ASR SSL (#4303) * Initial commit for SSL docs Signed-off-by: Krishna Puvvada <kpuvvada@nvidia.com> * ssl docs update-1 Signed-off-by: Krishna Puvvada <kpuvvada@nvidia.com> * ssl docs update-2 Signed-off-by: Krishna Puvvada <kpuvvada@nvidia.com> Co-authored-by: Krishna Puvvada <kpuvvada@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Fuse grad division into async grad allreduce (#4327) * O2 runs but O1 does not Signed-off-by: ericharper <complex451@gmail.com> * disable async for O1 Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * update async flag in configure_optimizers Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * revert Signed-off-by: ericharper <complex451@gmail.com> * update _require if using async Signed-off-by: ericharper <complex451@gmail.com> * clean comments Signed-off-by: ericharper <complex451@gmail.com> * always all_reduce Signed-off-by: ericharper <complex451@gmail.com> * add async grad allreduce and chunk optimization to T5 * push reformatted files after style check * set chunk size as 0 while async grad allreduce is off * more experiments show that 125MB is a better default chunk size for most cases * add grad_allreduce_chunk_size_mb for GPT-3 * at the end of each training step, wait until all async grad allreduce works are done * replace individual allreduce work.wait() with a single dGPU evice synchroonization * add code of fused_bias_geglu * call fused_bias_geglu in ParallelMLP * record the status of each allreduce work seems too much for perf * add more comments * push a reformatted file * fix some bugs * change biad_gelu_activation to bias_activation_fusion * fix the setting of bias_actication_fusion for T5 * delete bias_gelu_fusion from T5 example config * push reformatted files * fuse grad scale with allreduce * push reformatted files * hto4h gemms fusion * remove hto4h gemms fusion * add grad_scale_ar_fusion into GPT-3 * push reformatted files * push reformatted files * rename grad_scale_ar_fusion to grad_div_ar_fusion * disable bias_activation_fusion while activation is not geglu * add bias_activation_fusion in yaml config file * add bias_gelu_fusion in T5 config yaml file to pass CI test * change bias_gelu_fusion to bias_activation_fusion for T5 CI test * recover latest change * add grad_div_ar_fusion in config yaml file * remove a redundant float() Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Update container to 22.05 (#4329) * update container to 22.05 Signed-off-by: ericharper <complex451@gmail.com> * try adding safe directory Signed-off-by: ericharper <complex451@gmail.com> * try env var Signed-off-by: ericharper <complex451@gmail.com> * printenv Signed-off-by: ericharper <complex451@gmail.com> * try GIT_BRANCH Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * remove dbug statements Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Torchaudio installation fix (#4330) * separate installer added Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> * apply suggestions, minor fixes Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] enforced pin_memory = True (#4341) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Merge r1.9.0 main (#4331) * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [TTS] Training Fastpitch on German text and phonemes and finetuning HiFi-GAN on predicted mels (#4266) * initial commit Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * cleared notebook outputs Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * formatting errors Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * formatting Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * addressed comments Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * addressed comments on tutorial Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated tutorial Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated grammar and fastpitch description Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated with feedback Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * updated with feedback Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> * removed unused imports Signed-off-by: Akshit Arora <akshit.arora@colorado.edu> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Speedup the speech commands dataset processing script (#4347) * Add multiprocessing support to the google speech commands dataset processing script Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * fix number of args error with __extract_all_files function Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * fix styling issues Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * fix bugs with silence set construction and update librosa output write to use soundfile write Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * add docstrings and return values in __construct_filepaths as dictionary Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * fix wrong requirement (#4349) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Refactored path to manifest (#4251) Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * IPA support for TTS (#4310) * IPA tokenizer and G2P untested draft Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add IPA CMUdict and new heteronyms list Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add draft FastPitch IPA config Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Minor bugfixes for IPA training Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add phoneme_probability to IPA G2P Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Updates to IPA FastPitch training config Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update IPA dict and heteronyms file Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Adjust default lr for IPA FastPitch to 1e-3 Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Rename IPA CMUdict to reflect date Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Add docstrings for IPA tokenizer and G2P, update CMUdict path for config Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Fix IPA vocab ordering, add options to uppercase graphemes and remove stress symbols Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Mark IPA classes as experimental Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update apostrophe-S cases Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Tn install (#4055) * remove conda pynini requirement Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * remove remnants Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * merge with main Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * removing nlp collection dependency from text processing and thus breaking cyclyc imports Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix wrong requirement Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix bug in vi Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * update jenkins folders Signed-off-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * fix tutorial (#4352) Signed-off-by: stevehuang52 <heh@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * fix the post ln (#4350) Signed-off-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [Fix] Hanging for Fully Randomized Bucketing (#4348) * Update container to 22.05 (#4329) * update container to 22.05 Signed-off-by: ericharper <complex451@gmail.com> * try adding safe directory Signed-off-by: ericharper <complex451@gmail.com> * try env var Signed-off-by: ericharper <complex451@gmail.com> * printenv Signed-off-by: ericharper <complex451@gmail.com> * try GIT_BRANCH Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> * remove dbug statements Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * Merge r1.9.0 main (#4331) * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * small fix (#4180) Signed-off-by: fayejf <fayejf07@gmail.com> * fix the server key value problem (#4196) Signed-off-by: Yi Dong <yidong@nvidia.com> * Fix/punctuation/trainer required for setting test data (#4199) * Draft of fix Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add warnings and replace globa_step with current_epoch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Small improvements to warnings Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Error and warning messages improvements Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Replace self.trainer with self._trainer Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Update ContextNet version (#4207) Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix bugs for dialogue tutorial (#4211) Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4214) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Add docs for Thutmose Tagger (#4173) * Add docs for Thutmose Tagger Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add level in docs Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * delete folder to avoid error with running when folder exists from previous run Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> * Dialogue tutorial fix (#4218) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * Dialogue tutorial fix (#4221) * fix bugs for dialogue tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update path for convert_datasets.py due to conflict PR Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * restore previously deleted files Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * style fix Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * update tutorial Signed-off-by: Zhilin Wang <wangzhilin12061996@hotmail.com> * fix syntax error in ipynb-file (#4228) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix json serialize (#4235) Signed-off-by: Yi Dong <yidong@nvidia.com> * Prompt Learning Typo Fixes (#4238) * Prompt tuning notebook typo fixes Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update tutorials.rst * Update prompt_learning.rst * Update prompt_learning.rst * fixing bug 3642622 (#4250) * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> * fixing bug 3642622 Signed-off-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> * fix broken link in the tutorial (#4257) Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * Typo fix, branch change, better download messagae (#4262) Signed-off-by: Virginia Adams <vadams@nvidia.com> * Raise error if bicleaner is not installed in NMT Data preprocesing notebook (#4264) * Raise error if bicleaner is not installed Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Clear cells Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix missing validation dataset, whitelist certain keywords for datasets (#4269) * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix missing validation dataset, whitelist certain keywords for datasets Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update asr configs with num_workers and pin_memory (#4270) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix epoch end (#4265) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Eric Harper <complex451@gmail.com> * Set Save on train end to false (#4274) * Set Save on train end to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * Update prompt_learning.rst * Update prompt_learning.rst * Update YAML (#4261) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Updated config to fix CI test OOM error (#4279) * Updated config to fix CI test issue Signed-off-by: Virginia Adams <vadams@nvidia.com> * Increased num workers Signed-off-by: Virginia Adams <vadams@nvidia.com> * verbose k2 install, skip if failed (#4289) Signed-off-by: Aleksandr Laptev <alaptev@nvidia.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> * Changed total virtual prompt tokens (#4295) * Changed total virtual prompt tokens Signed-off-by: Virginia Adams <vadams@nvidia.com> * put number of workers back Signed-off-by: Virginia Adams <vadams@nvidia.com> * upper bound lightning Signed-off-by: ericharper <complex451@gmail.com> * update branch Signed-off-by: ericharper <complex451@gmail.com> * update config Signed-off-by: ericharper <complex451@gmail.com> * remove duplicate test Signed-off-by: ericharper <complex451@gmail.com> * fix tn test cases Signed-off-by: ericharper <complex451@gmail.com> * add another safe.directory Signed-off-by: ericharper <complex451@gmail.com> * typo Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: stevehuang52 <heh@nvidia.com> * fix full_randn bucket hang Signed-off-by: stevehuang52 <heh@nvidia.com> * remove unused variables Signed-off-by: stevehuang52 <heh@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: treacker <36159472+treacker@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Yi Dong <43824965+yidong72@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: ekmb <ebakhturina@nvidia.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: Ghasem <35242805+pasandi20@users.noreply.github.com> Co-authored-by: Ghasem Pasandi <gpasandi@nvidia.com> Co-authored-by: Aleksandr Laptev <laptevsasha12@gmail.com> Co-authored-by: Aleksandr Laptev <alaptev@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Bits of RADTTS support (#4343) * Bits of RADTTS support Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Fixed args mismatch Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Style Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Addressed review comments Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * More review comments Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Prompt Learning Pipeline Parallel (#4291) * Added get_forward_output_and_loss_func and updated train/val steps Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added preprocess flag before prompt table/encoder access Signed-off-by: Virginia Adams <vadams@nvidia.com> * Made two optimizer groups, one for frozen, one for soft prompt Signed-off-by: Virginia Adams <vadams@nvidia.com> * Pipeline parallel working Signed-off-by: Virginia Adams <vadams@nvidia.com> * Still figuring out setting lr/sched for one param group Signed-off-by: Virginia Adams <vadams@nvidia.com> * Set betas to zero Signed-off-by: Virginia Adams <vadams@nvidia.com> * Only unfreeze one sublayer with lr 0.0 Signed-off-by: Virginia Adams <vadams@nvidia.com> * Pipeline parallel working w/ one optimizer Signed-off-by: Virginia Adams <vadams@nvidia.com> * Trying to fix Jenkins file Signed-off-by: Virginia Adams <vadams@nvidia.com> * Trying to fix Jenkins file Signed-off-by: Virginia Adams <vadams@nvidia.com> * Getting updated jenkins test to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * Getting updated jenkins test to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * added prompt learning tp and pp CI tests Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added amp_o2 model compatibility Signed-off-by: Virginia Adams <vadams@nvidia.com> * Made CI test smaller Signed-off-by: Virginia Adams <vadams@nvidia.com> * Still trying to get Jenkins to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * Still trying to get Jenkins to work Signed-off-by: Virginia Adams <vadams@nvidia.com> * Temporarily moving prompt learning CI test to beginning Signed-off-by: Virginia Adams <vadams@nvidia.com> * Changing the layer being unfrozen Signed-off-by: Virginia Adams <vadams@nvidia.com> * debug jenkins Signed-off-by: Virginia Adams <vadams@nvidia.com> * Move pp unfreeze to init Signed-off-by: Virginia Adams <vadams@nvidia.com> * Try to make Jenkins test parallel Signed-off-by: Virginia Adams <vadams@nvidia.com> * Fix python formatting Signed-off-by: Virginia Adams <vadams@nvidia.com> * Moved prompt learning tests back to where they belong Signed-off-by: Virginia Adams <vadams@nvidia.com> * add back checkpoint convertion CI test Signed-off-by: Virginia Adams <vadams@nvidia.com> * Revert "add back checkpoint convertion CI test" This reverts commit 61e2ffcdefe964c8e74b74d8c10906ae29f32b6d. * Add back checkpoint conversion test Signed-off-by: Virginia Adams <vadams@nvidia.com> * Setting requires grad to True everywhere Signed-off-by: Virginia Adams <vadams@nvidia.com> * Updated config comments and simplified param group code Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added comment on frozen_model having lr=0.0 Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added configure optimizers methods Signed-off-by: Virginia Adams <vadams@nvidia.com> * Set amp_o2 to false Signed-off-by: Virginia Adams <vadams@nvidia.com> * removed o2 code Signed-off-by: Virginia Adams <vadams@nvidia.com> * Python formatting fix Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * auto switch conformer encoder adapter in_features (#4354) Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Dataloader, collector, loss and metric for multiscale diarization decoder (#4187) * First commit Signed-off-by: Taejin Park <tango4j@gmail.com> * Checked funtionality and imports Signed-off-by: Taejin Park <tango4j@gmail.com> * fixed import issues Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed the changed made by mistake Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed LGTM errors 001 Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed LGTM and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Changed docstrings Signed-off-by: Taejin Park <tango4j@gmail.com> * LGTM again Signed-off-by: Taejin Park <tango4j@gmail.com> * Removed unnecessary torch setting lines Signed-off-by: Taejin Park <tango4j@gmail.com> * Style fix and isort Signed-off-by: Taejin Park <tango4j@gmail.com> * jbalam-nv comments reflected Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Reflected comments and created _diar_label.py Signed-off-by: Taejin Park <tango4j@gmail.com> * Typo fix and style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Fixed target_spks[0] index error Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * LGTM unused import IterDataset Signed-off-by: Taejin Park <tango4j@gmail.com> * revert collection doc year Signed-off-by: Taejin Park <tango4j@gmail.com> * Code format error in collections.py Signed-off-by: Taejin Park <tango4j@gmail.com> * fix collections space format error Signed-off-by: Taejin Park <tango4j@gmail.com> * merged main correctly Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix Signed-off-by: Taejin Park <tango4j@gmail.com> * Reflected all comments and tested Signed-off-by: Taejin Park <tango4j@gmail.com> * style fix and LGTM Signed-off-by: Taejin Park <tango4j@gmail.com> * rttm_filepath to rttm_file and removed self included funcs, tested Signed-off-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Add ASR CTC Decoding module (#4342) * Initial commit Signed-off-by: smajumdar <smajumdar@nvidia.com> * Full support for decoding strategy Signed-off-by: smajumdar <smajumdar@nvidia.com> * Temp Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix labels of y_sequence Signed-off-by: smajumdar <smajumdar@nvidia.com> * Set support for sentencepiece subword merging Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix char and word based token merge alignment Signed-off-by: smajumdar <smajumdar@nvidia.com> * Revert incorrect change Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update docstring Signed-off-by: smajumdar <smajumdar@nvidia.com> * Improve compatibility with greedy tokens and log probs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Update scripts to use decoding strategy Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add tests and docs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Add tests and docs Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix speaker decoder timestamps Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix speaker decoder timestamps Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix decoding of ctc models Signed-off-by: smajumdar <smajumdar@nvidia.com> * Address reviewer comments Signed-off-by: smajumdar <smajumdar@nvidia.com> * Address reviewer comments Signed-off-by: smajumdar <smajumdar@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Option to disable mp in VAD via num_workers=1 (#4317) * Option to disable mp in VAD via num_workers=1 In certain environments python multiprocessing can deadlock. This adds a convenient version to disable by setting num_workers to 1. Signed-off-by: Georg Kucsko <gkucsko@gmail.com> * add none handling Signed-off-by: Georg Kucsko <gkucsko@gmail.com> * additional none handling Signed-off-by: Georg Kucsko <gkucsko@gmail.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * remove redundant bias expand (#4382) * remove redundant bias expand Signed-off-by: Xiaowei Ren <xren@nvidia.com> * delete redundant code Signed-off-by: Xiaowei Ren <xren@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Add option for specifying wandb save_dir from config (#4379) * give option to user to specify wandb save dir via config Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * create save_dir directory for wandb logger if not exists Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> * update save_dir get method with a default value Signed-off-by: Shantanu Acharya <shantanua@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Quick wav2vec fix. In-place operation adding convolutional positions to encoder was overwriting leaf history. Wasn't caught on previous torch versions. (#4383) Signed-off-by: tbartley94 <tbartley@nvidia.com> Co-authored-by: tbartley94 <tbartley@nvidia.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * [Bugfix][TTS] wrong order of returned tuple for general_collate_fn. (#4388) Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Hainan Xu <hainanx@nvidia.com> * Merge r1.10.0 main (#4398) * update branch Signed-off-by: ericharper <complex451@gmail.com> * Set headscale false (#4364) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Add wandb as dependency (#4365) Signed-off-by: smajumdar <smajumdar@nvidia.com> * Raise trainer error (#4356) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Set headscale false (#4364) (#4366) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: smajumdar <smajumdar@nvidia.com> * Finetuning changes for BART (#4003) * Temp Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Checkpoint converter to nemo for bart Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Make position embedding expansion specific to a batch to avoid checkpoint size mismatches (#4357) * Style Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix logging warning Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Micha Livne <michalivne@users.noreply.github.com> * Fix electronic bug, new time ITN rule (#4355) * fix electronic bug Signed-off-by: ekmb <ebakhturina@nvidia.com> * add new itn time rule Signed-off-by: ekmb <ebakhturina@nvidia.com> * revert domain changes Signed-off-by: ekmb <ebakhturina@nvidia.com> * remove repetition Signed-off-by: ekmb <ebakhturina@nvidia.com> * Correct support for dataclasses in default module dim (#4372) * Correct support for dataclasses in default module dim Signed-off-by: smajumdar <smajumdar@nvidia.com> * Fix path for save of results Signed-off-by: smajumdar <smajumdar@nvidia.com> * fix pad id bug (#4377) Signed-off-by: Yi Dong <yidong@nvidia.com> * Question answering bug fix (#4381) * refactor dialogue state tracking for modelling/dataset interoperability Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style changes Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style raised by lgtm Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style formatting Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update template to include description of intent Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * changes based on requests in review Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add compatibility with assistant dataset Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove dialogue_state_tracking Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update huggingface utils for dialogue Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * rename dialogue_state_tracking_hybrid to dialogue_state_tracking_sgdqa Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix nemo/collections/nlp/models/dialogue_state_tracking_sgdqa/__init__.py Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkinsfile for SGDGEN Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * fix typo Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add docstrings for assistant data processsor Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update Jenkins for SGDGEN local checkpoint Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * update style Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * use local vocab file for Jenkinsfile Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * patch for Jenkins CI using local file Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * add slot filling prediction and metrics Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * remove unused code Signed-off-by: Zhilin Wang <zhilinw@nvidia.com> * style fix …
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (#5277) * Typo fix (#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (#5359) * Add Gradio App to ASR Docs (#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (#5379) * Fixed bug in notebook (#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (#5421)" (#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (#5242) (#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (NVIDIA-NeMo#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (NVIDIA-NeMo#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (NVIDIA-NeMo#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (NVIDIA-NeMo#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (NVIDIA-NeMo#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (NVIDIA-NeMo#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA-NeMo#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (NVIDIA-NeMo#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (NVIDIA-NeMo#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (NVIDIA-NeMo#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (NVIDIA-NeMo#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (NVIDIA-NeMo#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (NVIDIA-NeMo#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (NVIDIA-NeMo#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (NVIDIA-NeMo#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (NVIDIA-NeMo#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (NVIDIA-NeMo#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (NVIDIA-NeMo#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (NVIDIA-NeMo#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (NVIDIA-NeMo#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (NVIDIA-NeMo#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (NVIDIA-NeMo#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (NVIDIA-NeMo#5277) * Typo fix (NVIDIA-NeMo#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (NVIDIA-NeMo#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (NVIDIA-NeMo#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (NVIDIA-NeMo#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (NVIDIA-NeMo#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (NVIDIA-NeMo#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (NVIDIA-NeMo#5359) * Add Gradio App to ASR Docs (NVIDIA-NeMo#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (NVIDIA-NeMo#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (NVIDIA-NeMo#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (NVIDIA-NeMo#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA-NeMo#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (NVIDIA-NeMo#5379) * Fixed bug in notebook (NVIDIA-NeMo#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (NVIDIA-NeMo#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (NVIDIA-NeMo#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA-NeMo#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (NVIDIA-NeMo#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (NVIDIA-NeMo#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (NVIDIA-NeMo#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA-NeMo#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (NVIDIA-NeMo#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA-NeMo#5421)" (NVIDIA-NeMo#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA-NeMo#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (NVIDIA-NeMo#5242) (NVIDIA-NeMo#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (NVIDIA-NeMo#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (NVIDIA-NeMo#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA-NeMo#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (NVIDIA-NeMo#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA-NeMo#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (NVIDIA-NeMo#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (NVIDIA-NeMo#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA-NeMo#5339) (NVIDIA-NeMo#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA-NeMo#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA-NeMo#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (NVIDIA-NeMo#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (NVIDIA-NeMo#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (NVIDIA-NeMo#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (NVIDIA-NeMo#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (NVIDIA-NeMo#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (#5277) * Typo fix (#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (#5359) * Add Gradio App to ASR Docs (#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (#5379) * Fixed bug in notebook (#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (#5421)" (#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (#5242) (#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (#5277) * Typo fix (#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (#5359) * Add Gradio App to ASR Docs (#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (#5379) * Fixed bug in notebook (#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (#5421)" (#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (#5242) (#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (#5339) (#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (NVIDIA-NeMo#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (NVIDIA-NeMo#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (NVIDIA-NeMo#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (NVIDIA-NeMo#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (NVIDIA-NeMo#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (NVIDIA-NeMo#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA-NeMo#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (NVIDIA-NeMo#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (NVIDIA-NeMo#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (NVIDIA-NeMo#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (NVIDIA-NeMo#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (NVIDIA-NeMo#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (NVIDIA-NeMo#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (NVIDIA-NeMo#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (NVIDIA-NeMo#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (NVIDIA-NeMo#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (NVIDIA-NeMo#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (NVIDIA-NeMo#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (NVIDIA-NeMo#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (NVIDIA-NeMo#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (NVIDIA-NeMo#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (NVIDIA-NeMo#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (NVIDIA-NeMo#5277) * Typo fix (NVIDIA-NeMo#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (NVIDIA-NeMo#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (NVIDIA-NeMo#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (NVIDIA-NeMo#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (NVIDIA-NeMo#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (NVIDIA-NeMo#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (NVIDIA-NeMo#5359) * Add Gradio App to ASR Docs (NVIDIA-NeMo#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (NVIDIA-NeMo#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (NVIDIA-NeMo#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (NVIDIA-NeMo#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA-NeMo#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (NVIDIA-NeMo#5379) * Fixed bug in notebook (NVIDIA-NeMo#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (NVIDIA-NeMo#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (NVIDIA-NeMo#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA-NeMo#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (NVIDIA-NeMo#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (NVIDIA-NeMo#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (NVIDIA-NeMo#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA-NeMo#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (NVIDIA-NeMo#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA-NeMo#5421)" (NVIDIA-NeMo#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA-NeMo#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (NVIDIA-NeMo#5242) (NVIDIA-NeMo#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (NVIDIA-NeMo#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (NVIDIA-NeMo#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA-NeMo#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (NVIDIA-NeMo#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA-NeMo#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (NVIDIA-NeMo#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (NVIDIA-NeMo#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA-NeMo#5339) (NVIDIA-NeMo#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA-NeMo#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA-NeMo#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (NVIDIA-NeMo#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (NVIDIA-NeMo#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (NVIDIA-NeMo#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (NVIDIA-NeMo#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (NVIDIA-NeMo#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
* Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount * new structure for tts datasets in script folder Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * remove cmudict downloading Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * rename mixertts dataset, add vocoder dataset Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add libritts processing Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * update tts dataset and libritts get data Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix bugs in vocoder ds Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add ds * changed vits yaml * rm yaml * fix yaml and model * Added scaler * refactored yaml * managed to run in fp16 * refactoring Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix small bugs and add new todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix optimizers Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * Port Variational Inference with Adversarial Learning (VITS) to NeMo TTS (NVIDIA-NeMo#6) * Add vits files Add vits_losses.py, vits_modules.py and vits.py. * Move non-vits models to modules * Add vits.yaml * Add _loader to vits.py * Add basic template for vits * Update vits.yaml with vits parameters * Remove extra space * Add top level training script * Add some variables to vits yaml * Add forward and training methods * Fix imports * Added validation step * Log training losses * Update loss calls to use class attributes * Add VITS to models list * Fix all imports * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Fix imports for VITS * Remove old module calls * Fix typo in monotonic align import * Modified validation step 1. reverted to tensorboard 2. validation_step logs audio, mel-spec for batch 0 3. validation_step_alt logs audio, mel-spec for batch 0 and loss_mel * Add parameters from original VITS config * Fix config file * Fix imports and generate spec from audio * Fix incorrect dimensions * Progress update * Fix loss * Fix cuda thing * Fix monotonic align import * Fix typos in vits.py * Disable loss typecheck * Fix spectrogram lengths * Remove Precision 16 requirement * Address lgtm alerts * clean up unused code * Address lgtm alerts * Refactor audio_to_mel_torch method * Use NeMo FilterBank to get melspec Todo: set self.fb * Fix filterbank max frequency to match with original VITS * Fix filterbank features correct length * Address lgtm issues * Remove print statements * Remove stft_pad_amount Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Jason <jasoli@nvidia.com> Signed-off-by: Jason <jasoli@nvidia.com> * make new commit Signed-off-by: Jason <jasoli@nvidia.com> * add copyright headers Signed-off-by: Jason <jasoli@nvidia.com> * style Signed-off-by: Jason <jasoli@nvidia.com> * rename README Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * fix style without vits_modules Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * add numba code, fix style and add todos Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> * small fix * fix some todos * added numba mas * added DDP sampler * specified versions * fixed for new librosa version * added feature loss * added IPA phonemizer * refactored IPA g2p * added vits losses * some ref * fix * added checkpointing * cp * cfg * merged some 1.8.0 fixes * plt fix * fix logging * fix checkpoint loading * refactored inference * fp32 run * update branch Signed-off-by: ericharper <complex451@gmail.com> * update package info Signed-off-by: ericharper <complex451@gmail.com> * new exp * update branch Signed-off-by: ericharper <complex451@gmail.com> * Restored tests previously disabled for 22.03 base (NVIDIA-NeMo#4109) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * add augmentation to label models (NVIDIA-NeMo#4113) * add augmentation to label models Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * duration fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Call register_bert_model after assigning self.bert_model variable (NVIDIA-NeMo#4116) Signed-off-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> * Tutorial on ITN with Thutmose tagger and small fixes (NVIDIA-NeMo#4117) * 1. Add tutorial. 2. Move a function to fix import in tutorial. 3. Merge multiple spaces into one space in the final output Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fixes for code review Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * Add tutorial to tutorials.rst Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update the default (NVIDIA-NeMo#4135) Signed-off-by: ekmb <ebakhturina@nvidia.com> * Draft: Fix restoring from checkpoint for case when `model.common_dataset_parameters.label_vocab_dir` is provided (NVIDIA-NeMo#4136) * Fix restoring from checkpoint with label vocab dir Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Add tests for various ways to pass label ids to model Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix typo Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Do not create tmp directory Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix parameter name Signed-off-by: PeganovAnton <peganoff2@mail.ru> * finish cherry-pick op Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix labels errors Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove duplicate stage Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Change target branch Signed-off-by: PeganovAnton <peganoff2@mail.ru> * fix typo (NVIDIA-NeMo#4140) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Fix/punctuation avoid overwritting tmp files (NVIDIA-NeMo#4144) * Add draft of fixing tmp files overwritting Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Remove accidental changes Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Use built-in tempfile library Signed-off-by: PeganovAnton <peganoff2@mail.ru> * Fix code style Signed-off-by: PeganovAnton <peganoff2@mail.ru> * bug_fix_diarization_manifest_creation (NVIDIA-NeMo#4125) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> * fix doc (NVIDIA-NeMo#4146) Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * Tacotron2 retrain (NVIDIA-NeMo#4103) * fix yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Fix for new TTSDataset class Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * added wandb logging Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * fix numpy version Signed-off-by: treacker <emshabalin@yandex.ru> * inference fix Signed-off-by: treacker <emshabalin@yandex.ru> * removed old code Signed-off-by: treacker <emshabalin@yandex.ru> * updated parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * reverted version update Signed-off-by: treacker <emshabalin@yandex.ru> * refactored parser logic Signed-off-by: treacker <emshabalin@yandex.ru> * Updated Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Refactored tutorial for Tacotron2 Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Made backward compatibility Signed-off-by: treacker <emshabalin@yandex.ru> * Update Jenkinsfile Signed-off-by: treacker <emshabalin@yandex.ru> * Update tacotron.yaml Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * cleaned up TN/ ITN doc (NVIDIA-NeMo#4119) * cleaned up TN/ ITN doc Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix typo Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> * fix image Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: treacker <emshabalin@yandex.ru> * Check implicit grad acc in GLUE dataset building (NVIDIA-NeMo#4123) * Check implicit grad acc in GLUE dataset building Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix jenkins test for GLUE/XNLI Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Fixed jenkins Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> * Refactoring Signed-off-by: treacker <emshabalin@yandex.ru> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> * Multiprocess improvements (NVIDIA-NeMo#4127) * initial commit Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * start fix Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * improve multiprocessing speed while creating speaker dataset Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * updated scp to filelist Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * WaveGlow input type fixes (NVIDIA-NeMo#4151) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * notebooks' link, typo and import fix (NVIDIA-NeMo#4158) * redo missing pr 4007 Signed-off-by: fayejf <fayejf07@gmail.com> * remove extremely unreliable links Signed-off-by: fayejf <fayejf07@gmail.com> * Thutmose tagger bug fixes (NVIDIA-NeMo#4162) * add pretrained ngc model, small fixes Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix model location Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * 1. fix typos. 2. write magic functions without space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * add example of inference with pretrained model Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * changed model location to nemo Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * style fix Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> * fix space Signed-off-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> * update speaker docs (NVIDIA-NeMo#4164) * update speaker docs Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * chunks -> segments Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * Khz -> kHz Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> * changed to vits g2p * refactoring * added cosineLR * Updated whitelist path * added vanilla torch grad scaler * Fixed lightning version * added warmup and wd * switched to cosineLR * refactored data classes for vits * some fixes * fixed import * changeg train loop * fixed scheduler bug * refactoring for exps * Refactored loss logic * Ref for exps * added coqui stuff * exps * bugfix * added side file * bugfix * reverted * fixed sampler behaviour * updated for ptl 1.7.2 * refactored dataloader func * some cleaning * reverted to vanilla loss * modified for pickling * added dataset class * fixed torch version * added autocast for fp training * removed coqui files * 'Fixed tokenizer' * Fix tokenizer * update branch Signed-off-by: ericharper <complex451@gmail.com> * Fix link to inference notebook (NVIDIA-NeMo#5247) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Update ASR scores table (NVIDIA-NeMo#5254) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Fix links to speaker identification notebook (NVIDIA-NeMo#5260) Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> * Minor typo fixes in TTS tutorial (NVIDIA-NeMo#5266) Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> * Pcla tutorial fixes (NVIDIA-NeMo#5271) * Fixed typos Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed cell type and tatoeba reference Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed typo Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fixed branch variable Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix bug into Dialogue tutorial (NVIDIA-NeMo#5277) * Typo fix (NVIDIA-NeMo#5288) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Fix dialogue tutorial bug (NVIDIA-NeMo#5297) * set add_pooling_layer=False for huggingface bert model * remove add_pooling_layer=False and set find_unused_parameters=True * set num_prompt_tokens to 0 for huggingface * small bugfix for r1.13.0 (NVIDIA-NeMo#5310) * typo fix Signed-off-by: fayejf <fayejf07@gmail.com> * udpate transcribe Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: fayejf <fayejf07@gmail.com> * Add italian model checkpoints (NVIDIA-NeMo#5316) Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> * [STT] Add Ru ASR Conformer-CTC and Conformer-Transducer (NVIDIA-NeMo#5340) * [STT] Add stt_ru_conformer_ctc_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [STT] Add stt_ru_conformer_transducer_large Add stt_ru_conformer_transducer_large Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Pcla tutorial fixes (NVIDIA-NeMo#5313) * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * fixes Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * moved `create_text_and_labels` to token_classification_utils.py Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * a lot of refactoring * strict ptl version * strict ptl version * reverted plt version * Added base text2audio class * Fix issue with HF Model upload tutorial (NVIDIA-NeMo#5359) * Add Gradio App to ASR Docs (NVIDIA-NeMo#5270) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> (cherry picked from commit e4b6a38) * Fix issue with normalized config for dataset name Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * tutorial fixes (NVIDIA-NeMo#5354) Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> * Add SDP documentation (NVIDIA-NeMo#5274) * Add details to SDP README.md Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to WriteManifest processor Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add docstring to CreateInitialManifestMLS Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ModifyManifestTextProcessor docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add ASRInference docstring Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add base_processor docstrings Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Add minimal SDP docs page Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Update tools/speech_dataset_processor/README.md Co-authored-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> * Write simple README for SDP and move complex explanations to docs Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove incorrect type hints Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Make config example less confusing Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Fix typo Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Clarify that YAML file is config file in README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove unused imports Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove SDP docs for now Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * Remove links to docs in SDP README Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> * [Bugfix] Added rm -f / wget- nc command in multispeaker sim notebook to r1.13.0 (NVIDIA-NeMo#5375) * Fix minor error in notebook Signed-off-by: Taejin Park <tango4j@gmail.com> * changed branch name in tutorial notebook Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Taejin Park <tango4j@gmail.com> * Rename Speech Dataset Processor to Speech Data Processor (NVIDIA-NeMo#5378) Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> * fix for num worker 0 causing issues in losses after 1 epoch (NVIDIA-NeMo#5379) * Fixed bug in notebook (NVIDIA-NeMo#5382) Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> * Force MHA QKV onto fp32 (NVIDIA-NeMo#5391) Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> * Added scheduling variety * ref * Fix for prompt table restore error (NVIDIA-NeMo#5393) * Fix for prompt table restore error Signed-off-by: Virginia Adams <vadams@nvidia.com> * Added more saftey checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Added more condition checks Signed-off-by: Virginia Adams <vadams@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Virginia Adams <vadams@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Fix args (NVIDIA-NeMo#5410) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * bugfix * import tests * Add temporary fix for CUDA issue in Dockerfile (NVIDIA-NeMo#5421) Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> * Megatron Export Update (NVIDIA-NeMo#5343) * export update for Megatron + change ORT optimization Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated export_utils to use autocast instead of manually casting >:/ Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * removed dtype from LayerNorm Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added comment Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * reverting changes on FloatCast Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * Cherry-picked changes from megatron-norm Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * updated asr_model import to cast_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * updated del onnx_model place Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * changed ort optimization to basic -> temp fix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> * disable pc test (NVIDIA-NeMo#5426) Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: ekmb <ebakhturina@nvidia.com> * Fix GPT generation when using sentencepiece tokenizer (NVIDIA-NeMo#5413) * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Disable sync_batch_comm in validation_step for GPT (NVIDIA-NeMo#5397) * disable sync_batch_comm in validation_step Signed-off-by: ericharper <complex451@gmail.com> * Read sync_batch_comm from config or default to False Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Update megatron_gpt_config to default sync_batch_comm to False to avoid CUDA error Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> * Empty Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Comment out test Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> * Revert "Add temporary fix for CUDA issue in Dockerfile (NVIDIA-NeMo#5421)" (NVIDIA-NeMo#5431) This reverts commit 0718b17. * Revert workaround for T5 that sets number of workers to 0 & sync_batch_comm=False (NVIDIA-NeMo#5420) * Revert workers workaround Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix in config Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Fixed discrepancies * updated Jenkisfile * updated Jenkisfile * Cleaning * fixed the onnx bug in conformer for non-streaming models. (NVIDIA-NeMo#5242) (NVIDIA-NeMo#5446) Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> * Set sync_batch_comm in other places (NVIDIA-NeMo#5448) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Radtts 1.13 (NVIDIA-NeMo#5451) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA-NeMo#5358) * [TTS] add CI test for RADTTS training recipe. Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Radtts 1.13 plus (NVIDIA-NeMo#5457) * [TTS] Fixing RADTTS training - removing view buffer and fixing accuracy issue (NVIDIA-NeMo#5358) * Fixing RADTTS training - removing view buffer and fixing accuracy issue * Fixes for Torchscript/Triton * Added autocast to radtts UT * using cuda() for training example Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> * Add num layers check (NVIDIA-NeMo#5470) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Change to kwargs (NVIDIA-NeMo#5475) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Support for finetuning and finetuning inference with .ckpt files & batch size refactoring (NVIDIA-NeMo#5339) (NVIDIA-NeMo#5478) * Initial refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Resolve config before passing to load_from_checkpoint Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for model parallel and nemo restore Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fixes for eval Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Revert config changes Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Refactor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix typo Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove comments Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Minor Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix validation reconfiguration Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Remove old comment Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fixes for test_ds Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * export_utils bugfix (NVIDIA-NeMo#5480) * updated export_utils Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * Export fixes for Riva (NVIDIA-NeMo#5496) * Export fixes for Riva Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * Cleaning up training_utils Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> * minor bug fix (NVIDIA-NeMo#5521) Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> * added set_start_method + function param bugfix (NVIDIA-NeMo#5539) * added set_start_method + function param bugfix Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * upper bound torchmetrics Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: ericharper <complex451@gmail.com> * remove notebook (NVIDIA-NeMo#5548) Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: ericharper <complex451@gmail.com> * Remove broadcast (NVIDIA-NeMo#5558) Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * cleaning * Fix all gather while writing to a file during T5 finetuning (NVIDIA-NeMo#5561) * Gather from data parallel only instead of all ranks Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * Fix Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> * update readme Signed-off-by: ericharper <complex451@gmail.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added copyright * fixed imports * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * cleaning * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fixed filesize check * last cleaning Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * updated cmudict path * fixed merge bug Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * warnings fix * fix warnings Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * storing * updated version Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * update Jenkinsfile versions Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed issues Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * fixed more issues * more fixes Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * added experimental tag * Clarification updates Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * remove old cython code Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docstring fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Enhancements Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * imports fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix typo Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * excessive comtutations fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * typecheck fix Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Small refactoring * Small refactoring Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * reversed exp_manager params Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * Fixed call for new function signature Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Oktai Tatanov <oktai.tatanov@gmail.com> Signed-off-by: Jason <jasoli@nvidia.com> Signed-off-by: ericharper <complex451@gmail.com> Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Signed-off-by: nithinraok <nithinrao.koluguri@gmail.com> Signed-off-by: Yang Zhang <yangzhang@nvidia.com> Signed-off-by: MaximumEntropy <sandeep.subramanian.1@umontreal.ca> Signed-off-by: ekmb <ebakhturina@nvidia.com> Signed-off-by: PeganovAnton <peganoff2@mail.ru> Signed-off-by: Jocelyn Huang <jocelynh@nvidia.com> Signed-off-by: fayejf <fayejf07@gmail.com> Signed-off-by: smajumdar <titu1994@gmail.com> Signed-off-by: SeanNaren <snarenthiran@nvidia.com> Signed-off-by: Matvei Novikov <mattyson.so@gmail.com> Signed-off-by: Igor Gitman <igitman@nvidia.com> Signed-off-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Signed-off-by: Elena Rastorgueva <erastorgueva@nvidia.com> Signed-off-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Signed-off-by: Taejin Park <tango4j@gmail.com> Signed-off-by: Virginia Adams <vadams@nvidia.com> Signed-off-by: Yu Yao <yuya@nvidia.com> Signed-off-by: David Mosallanezhad <dmosallanezh@nvidia.com> Signed-off-by: Markel Sanz Ausin <markelsanz14@gmail.com> Signed-off-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Signed-off-by: Vahid <vnoroozi@nvidia.com> Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> Signed-off-by: Evgeniy Shabalin <baah1999@yandex.ru> Co-authored-by: jasonjjl1999 <jasonjjl1999@gmail.com> Co-authored-by: richa.ren@mail.utoronto.ca <richa.ren@mail.utoronto.ca> Co-authored-by: Oktai Tatanov <oktai.tatanov@gmail.com> Co-authored-by: jasonjjl1999 <43978361+jasonjjl1999@users.noreply.github.com> Co-authored-by: martynwei <martyn.wei@gmail.com> Co-authored-by: Ryan Hong <66425733+rhong99@users.noreply.github.com> Co-authored-by: Jason <jasoli@nvidia.com> Co-authored-by: ericharper <complex451@gmail.com> Co-authored-by: Boris Fomitchev <borisfom@users.noreply.github.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com> Co-authored-by: Ramanathan Arunachalam <ramanathan.arun@rutgers.edu> Co-authored-by: Ramanathan Arunachalam <rarunachalam@nvidia.com> Co-authored-by: bene-ges <61418381+bene-ges@users.noreply.github.com> Co-authored-by: Alexandra Antonova <aleksandraa@nvidia.com> Co-authored-by: Yang Zhang <yzhang123@users.noreply.github.com> Co-authored-by: Sandeep Subramanian <sandeep.subramanian.1@umontreal.ca> Co-authored-by: Evelina <10428420+ekmb@users.noreply.github.com> Co-authored-by: PeganovAnton <peganoff2@mail.ru> Co-authored-by: Jocelyn <jocelynh@nvidia.com> Co-authored-by: fayejf <36722593+fayejf@users.noreply.github.com> Co-authored-by: Somshubra Majumdar <titu1994@gmail.com> Co-authored-by: Sean Naren <snarenthiran@nvidia.com> Co-authored-by: Matvei Novikov <mattyson.so@gmail.com> Co-authored-by: Zhilin Wang <wangzhilin12061996@hotmail.com> Co-authored-by: Igor Gitman <igitman@nvidia.com> Co-authored-by: Sasha Meister <117230141+ssh-meister@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Elena Rastorgueva <80532067+erastorgueva-nv@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Adi Renduchintala <108822655+arendu@users.noreply.github.com> Co-authored-by: Virginia Adams <78445382+vadam5@users.noreply.github.com> Co-authored-by: yaoyu-33 <54727607+yaoyu-33@users.noreply.github.com> Co-authored-by: David <amosalla@asu.edu> Co-authored-by: David Mosallanezhad <dmosallanezh@nvidia.com> Co-authored-by: Boris Fomitchev <bfomitchev@nvidia.com> Co-authored-by: Yi Dong <yidong@nvidia.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@users.noreply.github.com> Co-authored-by: Markel Sanz Ausin <markelsanz14@gmail.com> Co-authored-by: Oleksii Kuchaiev <okuchaiev@nvidia.com> Co-authored-by: Vladimir Bataev <vbataev@nvidia.com> Co-authored-by: Vahid Noroozi <VahidooX@users.noreply.github.com> Co-authored-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
What does this PR do ?
Improve multiprocessing speed by reading audio file only once, and also remove type_check decorator for forward_to_export function in label_models.py
Collection: ASR
PR Type: