Magpietts 2508 merge main#81
Merged
blisc merged 94 commits intoblisc:magpietts_2508_test_mergefrom Oct 10, 2025
Merged
Conversation
* Support QwenVL for inference engine * Apply isort and black reformatting Signed-off-by: meatybobby <meatybobby@users.noreply.github.com> * Remove comment out * Reformat * Skip pylint check * Add unit tests * Apply isort and black reformatting Signed-off-by: meatybobby <meatybobby@users.noreply.github.com> --------- Signed-off-by: meatybobby <meatybobby@users.noreply.github.com> Co-authored-by: meatybobby <meatybobby@users.noreply.github.com>
* Fix sequence packing loss calculation Signed-off-by: Rayan Dasoriya <dasoriyarayan@gmail.com> * Fix nemo2 path Signed-off-by: Rayan Dasoriya <dasoriyarayan@gmail.com> * Skip pylint Signed-off-by: Rayan Dasoriya <dasoriyarayan@gmail.com> --------- Signed-off-by: Rayan Dasoriya <dasoriyarayan@gmail.com> Co-authored-by: Dmytro Pykhtar <37850217+dimapihtar@users.noreply.github.com>
* [Audio]: added streaming mode to SpectrogramToAudio Signed-off-by: Rauf <rnasretdinov@nvidia.com> * added time buffer Signed-off-by: Rauf <rnasretdinov@nvidia.com> * renamed Nf -> num_frames Signed-off-by: Rauf <rnasretdinov@nvidia.com> * added AudioToSpectrogram and scale and magnitude power Signed-off-by: Rauf <rnasretdinov@nvidia.com> * added multiple chunking support Signed-off-by: Rauf <rnasretdinov@nvidia.com> * added properties _stream_initialized, _eps, got rid of _prev_spec_frame Signed-off-by: Rauf <rnasretdinov@nvidia.com> * added hanning window Signed-off-by: Rauf <rnasretdinov@nvidia.com> * Apply isort and black reformatting Signed-off-by: nasretdinovr <nasretdinovr@users.noreply.github.com> * added a docstring regarding streaming istft mode Signed-off-by: Rauf <rnasretdinov@nvidia.com> --------- Signed-off-by: Rauf <rnasretdinov@nvidia.com> Signed-off-by: nasretdinovr <nasretdinovr@users.noreply.github.com> Co-authored-by: nasretdinovr <nasretdinovr@users.noreply.github.com>
…rs (#14514) * Update evo2 defaults so converted checkpoints have the right parameters Signed-off-by: John St John <jstjohn@nvidia.com> * Fix line too long issue Signed-off-by: John St John <jstjohn@nvidia.com> * Fix expected changes to configs that are locked into our tests Signed-off-by: John St John <jstjohn@nvidia.com> --------- Signed-off-by: John St John <jstjohn@nvidia.com>
Signed-off-by: dimapihtar <dpihtar@gmail.com>
Signed-off-by: Malay Nagda <malayn@nvidia.com>
…t_store flags (#14522) * Add use te activation func and save act input in fp8 flags Signed-off-by: Guyue Huang <guyueh@nvidia.com> * Fix field name Signed-off-by: Guyue Huang <guyueh@nvidia.com> * Update scripts/performance/vlm/finetune_qwen25vl_32b.py Co-authored-by: malay-nagda <malayn@nvidia.com> Signed-off-by: Guyue Huang <140554423+guyueh1@users.noreply.github.com> --------- Signed-off-by: Guyue Huang <guyueh@nvidia.com> Signed-off-by: Guyue Huang <140554423+guyueh1@users.noreply.github.com> Co-authored-by: malay-nagda <malayn@nvidia.com>
* Bump TE and Mcore Signed-off-by: Charlie Truong <chtruong@nvidia.com> * Use Mcore 69b65 Signed-off-by: Charlie Truong <chtruong@nvidia.com> --------- Signed-off-by: Charlie Truong <chtruong@nvidia.com>
* remove sync in logging Signed-off-by: qiyuw <qiyuw@nvidia.com> * Apply isort and black reformatting Signed-off-by: WanZzzzzz <WanZzzzzz@users.noreply.github.com> * add class and func docstrings in data_sampler.py for pylint Signed-off-by: qiyuw <qiyuw@nvidia.com> * Apply isort and black reformatting Signed-off-by: WanZzzzzz <WanZzzzzz@users.noreply.github.com> --------- Signed-off-by: qiyuw <qiyuw@nvidia.com> Signed-off-by: WanZzzzzz <WanZzzzzz@users.noreply.github.com> Co-authored-by: qiyuw <qiyuw@nvidia.com> Co-authored-by: WanZzzzzz <WanZzzzzz@users.noreply.github.com>
* add 1b arclongcontextconfig Signed-off-by: Farhad Ramezanghorbani <farhadr@nvidia.com> * fix device mess Signed-off-by: Farhad Ramezanghorbani <farhadr@nvidia.com> * add implicit_filter support Signed-off-by: Farhad Ramezanghorbani <farhadr@nvidia.com> * use padded input Signed-off-by: Farhad Ramezanghorbani <farhadr@nvidia.com> * Apply isort and black reformatting Signed-off-by: farhadrgh <farhadrgh@users.noreply.github.com> * Revert "add 1b arclongcontextconfig" This reverts commit 029969b. --------- Signed-off-by: Farhad Ramezanghorbani <farhadr@nvidia.com> Signed-off-by: farhadrgh <farhadrgh@users.noreply.github.com>
* fix gemma2 27b kv dimension Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com> * fix gemma2 27b kv dimension Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com> --------- Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
* feat: print expert groups on megatron init (#13874) Signed-off-by: Alexander Zhipa <azzhipa@amazon.com> Co-authored-by: Alexander Zhipa <azzhipa@amazon.com> Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt> * set a different seed for each dp rank Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt> * calculate loss inside autocast Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt> * disable per token loss, grad acc fusion Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt> * add missing self.seed Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt> * black formatting Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt> * Apply isort and black reformatting Signed-off-by: gautham-kollu <gautham-kollu@users.noreply.github.com> --------- Signed-off-by: Alexander Zhipa <azzhipa@amazon.com> Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt> Signed-off-by: gautham-kollu <gautham-kollu@users.noreply.github.com> Co-authored-by: Alexander Zhipa <alex.zhipa@proton.me> Co-authored-by: Alexander Zhipa <azzhipa@amazon.com> Co-authored-by: gautham-kollu <gkollu@nvidia.com> Co-authored-by: gautham-kollu <gautham-kollu@users.noreply.github.com>
* [Flux] Add MXFP8 support. Signed-off-by: Wil Kong <alpha0422@gmail.com> * [Flux] Add current and block scaling. Signed-off-by: Wil Kong <alpha0422@gmail.com> --------- Signed-off-by: Wil Kong <alpha0422@gmail.com>
Signed-off-by: Ao Tang <aot@nvidia.com>
…li triplet dataset with NeMo Framework (#14584) * Create E2E-Embedding-Finetuning Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com> * Update E2E-Embedding-Finetuning Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com> * Delete tutorials/llm/embedding/E2E-Embedding-Finetuning Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com> * Create README.md Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com> * Add files via upload Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com> * Add files via upload This is a notebook for E2E finetuning a embedding model Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com> * Update README.md Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com> * Update README.md Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com> * Delete tutorials/llm/embedding/E2E-Embedding-Finetuning/download_dataset.py Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com> * Delete tutorials/llm/embedding/E2E-Embedding-Finetuning/finetune_e5.py Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com> * Delete tutorials/llm/embedding/E2E-Embedding-Finetuning/finetune_llama1b.py Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com> * Delete tutorials/llm/embedding/E2E-Embedding-Finetuning/import_e5_large.py Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com> * Delete tutorials/llm/embedding/E2E-Embedding-Finetuning/import_llama1b.py Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com> --------- Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com> Co-authored-by: Ao Tang <aot@nvidia.com>
Signed-off-by: Guyue Huang <guyueh@nvidia.com>
Signed-off-by: dimapihtar <dpihtar@gmail.com>
Signed-off-by: jenchen13 <jennifchen@nvidia.com>
Signed-off-by: Chen Cui <chcui@nvidia.com>
Signed-off-by: Rauf <rnasretdinov@nvidia.com>
Signed-off-by: Chen Cui <chcui@nvidia.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
* fix flux seed as optional Signed-off-by: Ao Tang <aot@nvidia.com> * fix fluxcontrolnet Signed-off-by: Ao Tang <aot@nvidia.com> * Fix code checkout during test Signed-off-by: Charlie Truong <chtruong@nvidia.com> --------- Signed-off-by: Ao Tang <aot@nvidia.com> Signed-off-by: Charlie Truong <chtruong@nvidia.com> Co-authored-by: Charlie Truong <chtruong@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
* Remove PEFT scheme condition from recipe Signed-off-by: Ali Taghibakhshi <71892896+JRD971000@users.noreply.github.com> * remove unnecessary peft conditioning 12b --------- Signed-off-by: Ali Taghibakhshi <71892896+JRD971000@users.noreply.github.com>
* add gpt-oss lora exporter Signed-off-by: Chen Cui <chcui@nvidia.com> * Apply isort and black reformatting Signed-off-by: cuichenx <cuichenx@users.noreply.github.com> * update lora exporter for experts Signed-off-by: Chen Cui <chcui@nvidia.com> * disallow exporting expert lora since nemo implementation is not equivalent to hf Signed-off-by: Chen Cui <chcui@nvidia.com> * linting Signed-off-by: Chen Cui <chcui@nvidia.com> * Apply isort and black reformatting Signed-off-by: cuichenx <cuichenx@users.noreply.github.com> * address comment Signed-off-by: Chen Cui <chcui@nvidia.com> --------- Signed-off-by: Chen Cui <chcui@nvidia.com> Signed-off-by: cuichenx <cuichenx@users.noreply.github.com> Co-authored-by: cuichenx <cuichenx@users.noreply.github.com> Co-authored-by: Charlie Truong <chtruong@nvidia.com>
* update streaming ASR Signed-off-by: stevehuang52 <heh@nvidia.com> * add voice agent Signed-off-by: stevehuang52 <heh@nvidia.com> * update readme Signed-off-by: stevehuang52 <heh@nvidia.com> * update websocket Signed-off-by: stevehuang52 <heh@nvidia.com> * update Signed-off-by: stevehuang52 <heh@nvidia.com> * update Signed-off-by: stevehuang52 <heh@nvidia.com> * update readme Signed-off-by: stevehuang52 <heh@nvidia.com> * update Signed-off-by: stevehuang52 <heh@nvidia.com> * clean up Signed-off-by: stevehuang52 <heh@nvidia.com> * clean up Signed-off-by: stevehuang52 <heh@nvidia.com> * fix typo Signed-off-by: stevehuang52 <heh@nvidia.com> * fix codeQL Signed-off-by: stevehuang52 <heh@nvidia.com> * update cfg Signed-off-by: stevehuang52 <heh@nvidia.com> * remove unused Signed-off-by: stevehuang52 <heh@nvidia.com> * update readme Signed-off-by: stevehuang52 <heh@nvidia.com> * change default models Signed-off-by: stevehuang52 <heh@nvidia.com> * fix diar diable Signed-off-by: stevehuang52 <heh@nvidia.com> * fix diar diable Signed-off-by: stevehuang52 <heh@nvidia.com> * update ux Signed-off-by: stevehuang52 <heh@nvidia.com> * update tts Signed-off-by: stevehuang52 <heh@nvidia.com> * update readme Signed-off-by: stevehuang52 <heh@nvidia.com> * fix and update Signed-off-by: stevehuang52 <heh@nvidia.com> * fix asr Signed-off-by: stevehuang52 <heh@nvidia.com> * update readmme Signed-off-by: stevehuang52 <heh@nvidia.com> * update doc and llm dtype Signed-off-by: stevehuang52 <heh@nvidia.com> * refactor and add example prompts Signed-off-by: stevehuang52 <heh@nvidia.com> * update readme Signed-off-by: stevehuang52 <heh@nvidia.com> * update readme Signed-off-by: stevehuang52 <heh@nvidia.com> * clean up Signed-off-by: stevehuang52 <heh@nvidia.com> * clean up Signed-off-by: stevehuang52 <heh@nvidia.com> * update info on streaming sortformer Signed-off-by: stevehuang52 <heh@nvidia.com> * move code to 'nemo/agents/voice_agent' Signed-off-by: stevehuang52 <heh@nvidia.com> * update doc Signed-off-by: stevehuang52 <heh@nvidia.com> * clean up Signed-off-by: stevehuang52 <heh@nvidia.com> * refactor Signed-off-by: stevehuang52 <heh@nvidia.com> * update doc Signed-off-by: stevehuang52 <heh@nvidia.com> * remove the unnecessary streaming state conversion and import it from sortformer_modules, remove PostProcessingParams Signed-off-by: Weiqing Wang <weiqingw@nvidia.com> * Apply isort and black reformatting Signed-off-by: weiqingw4ng <weiqingw4ng@users.noreply.github.com> * update doc Signed-off-by: stevehuang52 <heh@nvidia.com> * clean up Signed-off-by: stevehuang52 <heh@nvidia.com> * fix for llama-nemotron template, and refactor Signed-off-by: stevehuang52 <heh@nvidia.com> * fix tts separator Signed-off-by: stevehuang52 <heh@nvidia.com> * fix for llama-nemotron Signed-off-by: stevehuang52 <heh@nvidia.com> * update cfg Signed-off-by: stevehuang52 <heh@nvidia.com> * refactor and update doc Signed-off-by: stevehuang52 <heh@nvidia.com> * change default llm to qwen Signed-off-by: stevehuang52 <heh@nvidia.com> * update doc Signed-off-by: stevehuang52 <heh@nvidia.com> --------- Signed-off-by: stevehuang52 <heh@nvidia.com> Signed-off-by: Weiqing Wang <weiqingw@nvidia.com> Signed-off-by: weiqingw4ng <weiqingw4ng@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: Kunal Dhawan <kunaldhawan97@gmail.com> Co-authored-by: Weiqing Wang <weiqingw@nvidia.com> Co-authored-by: weiqingw4ng <weiqingw4ng@users.noreply.github.com>
* replace texterros with kaldialign for f-score computation Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * replace texterros with kaldialign for asr confidence Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * replace texterrors with kaldialign for ASR_Confidence_Estimation.ipynb Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * replace texterrors with kaldialing for ASR_Context_Biasing.ipynb Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * Apply isort and black reformatting Signed-off-by: andrusenkoau <andrusenkoau@users.noreply.github.com> * decrease kaldialign version Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> --------- Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> Signed-off-by: andrusenkoau <andrusenkoau@users.noreply.github.com> Co-authored-by: andrusenkoau <andrusenkoau@users.noreply.github.com>
* Update prune-distill notebooks to Qwen3 + simplify Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com> * address comments Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com> * Add readme.rst Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com> --------- Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>
* add deprecation notice Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * add deprecation notice Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * add deprecation warning Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * remove import Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * move code Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * add more notices Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> * Remove automodel cicd Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com> * Add deprecation notice for Automodel Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com> Co-authored-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Signed-off-by: Chen Cui <chcui@nvidia.com>
Signed-off-by: Aditya Vavre <avavre@nvidia.com>
…rs (#14639) * Add fallback for file copy to handle metadata errors Signed-off-by: vipnydav <vipinydv@google.com> * Add robust_copy for resilient file copy Signed-off-by: vipnydav <vipinydv@google.com> * Apply isort and black reformatting Signed-off-by: vipnydav <vipnydav@users.noreply.github.com> * remove imported Path from test_file.py Signed-off-by: vipnydav <vipinydv@google.com> * Move robust_copy method to util file Signed-off-by: vipnydav <vipinydv@google.com> * Apply isort and black reformatting Signed-off-by: vipnydav <vipnydav@users.noreply.github.com> * Fix lint Signed-off-by: vipnydav <vipinydv@google.com> --------- Signed-off-by: vipnydav <vipinydv@google.com> Signed-off-by: vipnydav <vipnydav@users.noreply.github.com> Co-authored-by: vipnydav <vipnydav@users.noreply.github.com>
* feat: add callback group definition & callback ABC Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: "Zhengjiang Shao" <zshao@nvidia.com> Signed-off-by: Zhengjiang Shao <zshao@nvidia.com> * Apply isort and black reformatting Signed-off-by: PytLab <PytLab@users.noreply.github.com> * feat: insert callback functions of CallbackGroup Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: "Zhengjiang Shao" <zshao@nvidia.com> Signed-off-by: Zhengjiang Shao <zshao@nvidia.com> * Apply isort and black reformatting Signed-off-by: PytLab <PytLab@users.noreply.github.com> * chore: PR test for jiashang Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * feat: use __init_subclass__ to cover all ModelPT subclasses Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: "Zhengjiang Shao" <zshao@nvidia.com> Signed-off-by: Zhengjiang Shao <zshao@nvidia.com> * Apply isort and black reformatting Signed-off-by: PytLab <PytLab@users.noreply.github.com> * feat: Adding metadata config manager poc Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: "Saju Prasad" <sajup@dc2-container-xterm-023.prd.it.nvidia.com> Signed-off-by: Saju Prasad <sajup@dc2-container-xterm-023.prd.it.nvidia.com> * Apply isort and black reformatting Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com> * feat: revert test changes. Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: Updating metadata attributes Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: sajup <sajup@nvidia.com> * Apply isort and black reformatting Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com> * fix: Adding OneloggerCallback Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: sajup <sajup@nvidia.com> * fix: Reverting changes in examples/multimodal/speech_llm/modular_audio_gpt_train.py Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: sajup <sajup@nvidia.com> * Apply isort and black reformatting Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com> * fix: update modular models and megatron GPT models Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * feat: add on_app_start and on_app_end Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: Adding small test example for testing Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: sajup <sajup@nvidia.com> * Apply isort and black reformatting Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com> * fix: Fixing review comments as discussed with Jiashang Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: "Saju Prasad" <sajup@draco-oci-login-02.cm.cluster> Signed-off-by: Saju Prasad <sajup@draco-oci-login-02.cm.cluster> * Apply isort and black reformatting Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com> * fix: updating nemo code to v2 Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: sajup <sajup@nvidia.com> * Apply isort and black reformatting Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com> * fix: updating wandb to get info from env Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: sajup <sajup@nvidia.com> * Apply isort and black reformatting Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com> * fix: fix som impl issue Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix issue for exp manager. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * feat: remove callback_group Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * feat: fix timingtracker issue Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * feat: fix for startup callbcaks Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * feat: change to adapter Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * feat: use new nv-one-logger Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * feat: add on_app_end Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * feat: make OneLogger configurable Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * feat: remove NeMocallback import Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * feat: fix the enable_onelogger setting. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * feat: clean the code. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * feat: enable onelogger Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * test: Adding few unit tests Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: "Saju Prasad" <sajup@cw-dfw-cs-001-vscode-01.cm.cluster> Signed-off-by: Saju Prasad <sajup@cw-dfw-cs-001-vscode-01.cm.cluster> * Apply isort and black reformatting Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com> * feat: tmp fix for functional testing. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: add on_app_end for NeMov2 Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * fix: typo. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix the get attributes Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * fix: moving test test_meta_info_manager.py to tests/collections/common/ Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: "Saju Prasad" <sajup@cw-dfw-cs-001-vscode-01.cm.cluster> Signed-off-by: Saju Prasad <sajup@cw-dfw-cs-001-vscode-01.cm.cluster> * fix: fix format issue. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix lint errors Signed-off-by: liquor233 <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * Revert "Apply isort and black reformatting" This reverts commit de6994d. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Revert "fix: fix lint errors" This reverts commit 8e47ecd. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix linting issues. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix linting issue Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: add copyright info Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: small fix. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix small issues for t5 Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix dataloader issue. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * fix: remove dataloader setting. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * feat: update OneLogger. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix hydra runner. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: start using partial config. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix the unused variables Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: change get_one_logger name Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: code clean up. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: import more specific to avoid circular dependency. (#14306) Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: Peiyuan <qipeiyuan@outlook.com> * fix: use ptl callback from ls Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * feat: fix meta info manager. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix meta data issue. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix the lint issue Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix the unit tests. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix minor metadata issue. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix some test issues Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix pytest issue for meta info manager Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix lint issues for optimizers. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix pytest issues. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix CICD issues. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix all pytests Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * chore: fix lint Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * chore: fix unused import issues. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * chore: fix CICD issues. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix the CICD issues. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix the linting issue Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix CICD issues. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix the circular import issue. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix some pytests. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: revert some change. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: error handling for init onelogger Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * chore: fix one_logger code. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * chore: remove unused vars. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix CICD for nemo Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * chore: fix NeMo CICD. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * chore: renaming onelogger Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * chore: fix some exception. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * chore: renaming. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * chore: resolve some comments. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * chore: remove duplicate init. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * chore: resolve some github comments. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * chore: fix the linting issue. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * chore(callbacks): restore generic CallbackGroup and route telemetry v… (#14628) * chore(callbacks): restore generic CallbackGroup and route telemetry via group\n\n- Add BaseCallback and CallbackGroup with update_config and class init hook\n- Register OneLoggerAdapterCallback into group; merge config update into class\n- Replace direct OneLogger API usages with CallbackGroup across code\n- Ensure trainer attaches registered callbacks via group.update_config\n- Add nv-one-logger>=2.0.0 to base requirements\n\nSigned-off-by: Jiashang Hu <jiashangh@nvidia.com> Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> * chore: renaming. * chore: revert the change to install nv-one-logger * chore: fix the linting issue Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> --------- Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Co-authored-by: liquor233 <liquor233@users.noreply.github.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * Add tests for callback group (#14632) * chore: fix some circular dependency issues. * chore: move the files to utils. * chore: add unit tests * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> * chore: fix nv-one-logger tests * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> * chore: fix lint issue. * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> * chore: change the location. * chore: remaining fix. * chore: remaining changes. * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> * chore: fix the tests * chore: fix some lint. * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> * Revert prompt_encoder.py to c5ef26c (Jason Wang) to undo auto-formatting * pre-commit: exclude prompt_encoder.py from black/isort formatting * chore: undo lasst commit. * fix: fix some part for nemocallback. * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> * chore: fix some pytest * fix: verify the auto-hooked functions are called once Signed-off-by: Zhengjiang Shao <zshao@nvidia.com> --------- Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: Zhengjiang Shao <zshao@nvidia.com> Co-authored-by: liquor233 <liquor233@users.noreply.github.com> Co-authored-by: Zhengjiang Shao <zshao@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix the double init issue Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> * fix: fix the push Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Guarantee one logger on_app_end calls (#14691) * fix: guarantee on_app_end calls can be invoked finally Signed-off-by: Zhengjiang Shao <zshao@nvidia.com> * feat: add context manager creator in CallbackGroup * Revert "feat: add context manager creator in CallbackGroup" This reverts commit 381f83d. Signed-off-by: Zhengjiang Shao <zshao@nvidia.com> --------- Signed-off-by: Zhengjiang Shao <zshao@nvidia.com> * fix: remove meta info manager (#14689) * fix: remove meta info manager Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> --------- Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Co-authored-by: liquor233 <liquor233@users.noreply.github.com> * fix: fix some linting issues. * fix: fix unit tests. * chore: fix mcore Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix the installing problem Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix requirements Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix the mcore version. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix the mcore version. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix the mcore version. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix the mcore version. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix the mcore version. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix the mcore version. Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: use correct global_step for async ckpt success event Signed-off-by: Zhengjiang Shao <zshao@nvidia.com> * fix: fix unit tests Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix requirements Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: refactor the unit tests Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: insert callbacks in CallbackGroup before other PTL callbacks Signed-off-by: Zhengjiang Shao <zshao@nvidia.com> * fix: fix call on app start flag Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> * fix: fix unit tests Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: bump nv-one-logger version Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix the unit tests Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> * fix: fix the cicd issues. * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> * fix: fix some lint issues Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix unused import Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: make oneloggernemocallback singleton Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix lint issues Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: make oneloggernemocallback singleton * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> * fix: keep the original callbacks order in CallbackGroup when merging with trainer.callbacks * fix: fix the unit tests Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix unit tests Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> * fix: fix lint issues Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix the pickle issue. * Apply isort and black reformatting Signed-off-by: liquor233 <liquor233@users.noreply.github.com> * fix: fix issue. * fix: fix callback Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> * fix: fix callback group Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> --------- Signed-off-by: Zhengjiang Shao <zshao@nvidia.com> Signed-off-by: PytLab <PytLab@users.noreply.github.com> Signed-off-by: Jiashang Hu <jiashangh@nvidia.com> Signed-off-by: Saju Prasad <sajup@dc2-container-xterm-023.prd.it.nvidia.com> Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com> Signed-off-by: liquor233 <jiashangh@nvidia.com> Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: sajup <sajup@nvidia.com> Signed-off-by: sajup <sajup@nvidia.com> Signed-off-by: Saju Prasad <sajup@draco-oci-login-02.cm.cluster> Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com> Signed-off-by: Saju Prasad <sajup@cw-dfw-cs-001-vscode-01.cm.cluster> Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: Peiyuan <qipeiyuan@outlook.com> Signed-off-by: liquor233 <liquor233@users.noreply.github.com> Co-authored-by: PytLab <PytLab@users.noreply.github.com> Co-authored-by: Jiashang Hu <jiashangh@nvidia.com> Co-authored-by: Saju Prasad <sajup@dc2-container-xterm-023.prd.it.nvidia.com> Co-authored-by: sajup-oss <sajup-oss@users.noreply.github.com> Co-authored-by: sajup <sajup@nvidia.com> Co-authored-by: liquor233 <liquor233@users.noreply.github.com> Co-authored-by: Saju Prasad <sajup@draco-oci-login-02.cm.cluster> Co-authored-by: Saju Prasad <sajup@cw-dfw-cs-001-vscode-01.cm.cluster> Co-authored-by: Peiyuan <qipeiyuan@outlook.com> Co-authored-by: Peiyuan Qi <bqi@nvidia.com>
Signed-off-by: Pablo Garay <pagaray@nvidia.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
* Add mistral small3 24B config and recipe Signed-off-by: Joosung Yoon <joosungy@nvidia.com> --------- Signed-off-by: Joosung Yoon <joosungy@nvidia.com> Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com> Co-authored-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
* beep boop: Update changelog Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Update changelog for 2.3.3 Signed-off-by: Charlie Truong <chtruong@nvidia.com> * Fix changelog for 2.3.3 Signed-off-by: Charlie Truong <chtruong@nvidia.com> --------- Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Signed-off-by: Charlie Truong <chtruong@nvidia.com> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Charlie Truong <chtruong@nvidia.com>
* QWEN2.5-VL FP8 Recipe Signed-off-by: Lifu Zhang <tomzhanglf@gmail.com> * Apply isort and black reformatting Signed-off-by: tomlifu <tomlifu@users.noreply.github.com> * add model configs Signed-off-by: Lifu Zhang <tomzhanglf@gmail.com> --------- Signed-off-by: Lifu Zhang <tomzhanglf@gmail.com> Signed-off-by: tomlifu <tomlifu@users.noreply.github.com> Co-authored-by: tomlifu <tomlifu@users.noreply.github.com>
* Add Customization Capabilities to Cache-Aware Models Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Unify params with other transcription scripts Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Fix usage with manifests containing relative paths Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Fix decoding config setup Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Return back output_path Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Raise not implemented error if batched beam search performed with partial hypotheses Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Raise not implemented error if batched beam search in transducer performed with partial hypotheses Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Fix after merge Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Fix att_context_size param Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Use optional for left_chunks Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Apply isort and black reformatting Signed-off-by: artbataev <artbataev@users.noreply.github.com> * Unify parameters with transcribe_speech Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Fix docstring Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Unify dtype selection Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Fix unused variables Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> * Enhance inline documentation. Set compute_dtype=float32 by default. Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> --------- Signed-off-by: Vladimir Bataev <vbataev@nvidia.com> Signed-off-by: artbataev <artbataev@users.noreply.github.com> Co-authored-by: artbataev <artbataev@users.noreply.github.com>
* Address problems where sometimes in 1m dataset there are very large masked segments Signed-off-by: John St John <jstjohn@nvidia.com> * only flip the tag extra if the segment length is too long Signed-off-by: John St John <jstjohn@nvidia.com> * Undo the change to the pre commit config Signed-off-by: John St John <jstjohn@nvidia.com> * Add clarifying comments about the state flipping logic Signed-off-by: John St John <jstjohn@nvidia.com> --------- Signed-off-by: John St John <jstjohn@nvidia.com>
* Update cherry-pick workflow to use version 0.63.0 Signed-off-by: Pablo Garay <palenq@gmail.com> * Update cherry-pick workflow version tag Signed-off-by: Pablo Garay <palenq@gmail.com> --------- Signed-off-by: Pablo Garay <palenq@gmail.com>
Signed-off-by: Andrew Schilling <aschilling@nvidia.com>
* beep boop: Update changelog Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Fix changelog for 2.4.1 Signed-off-by: Charlie Truong <chtruong@nvidia.com> --------- Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Signed-off-by: Charlie Truong <chtruong@nvidia.com> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Charlie Truong <chtruong@nvidia.com>
* Adding to false_positives.json Signed-off-by: Andrew Schilling <aschilling@nvidia.com> * Fixing redirected URLs in nlp_all.bib Signed-off-by: Andrew Schilling <aschilling@nvidia.com> --------- Signed-off-by: Andrew Schilling <aschilling@nvidia.com>
* add documentation for gpu phrase boosting Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * minor fixes Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * minor fixes Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * fix depth_scaling description Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * change default depth_scaling value Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * use default depth_scaling=1 for AED models Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> * Apply isort and black reformatting Signed-off-by: andrusenkoau <andrusenkoau@users.noreply.github.com> * fixe broken link Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> --------- Signed-off-by: andrusenkoau <andrusenkoau@gmail.com> Signed-off-by: andrusenkoau <andrusenkoau@users.noreply.github.com> Co-authored-by: andrusenkoau <andrusenkoau@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com>
* add vllm support Signed-off-by: stevehuang52 <heh@nvidia.com> * refactor Signed-off-by: stevehuang52 <heh@nvidia.com> * update cfg Signed-off-by: stevehuang52 <heh@nvidia.com> * update cfg Signed-off-by: stevehuang52 <heh@nvidia.com> * update for nano-v2 Signed-off-by: stevehuang52 <heh@nvidia.com> * Potential fix for code scanning alert no. 16177: Unused import Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com> * update and refactor Signed-off-by: stevehuang52 <heh@nvidia.com> * update readme Signed-off-by: stevehuang52 <heh@nvidia.com> * update to pipecat=0.0.84 Signed-off-by: stevehuang52 <heh@nvidia.com> * add auto start/stop vllm server Signed-off-by: stevehuang52 <heh@nvidia.com> * update readme Signed-off-by: stevehuang52 <heh@nvidia.com> * auto switch between vllm and hf Signed-off-by: stevehuang52 <heh@nvidia.com> * refactor Signed-off-by: stevehuang52 <heh@nvidia.com> * update default cfg Signed-off-by: stevehuang52 <heh@nvidia.com> * add qwen3 example, refactor Signed-off-by: stevehuang52 <heh@nvidia.com> * update readme according to feedback Signed-off-by: stevehuang52 <heh@nvidia.com> * pin package version Signed-off-by: stevehuang52 <heh@nvidia.com> * Adding config manager and llm-specific yamls with short default yaml Signed-off-by: taejinp <tango4j@gmail.com> * Apply isort and black reformatting Signed-off-by: tango4j <tango4j@users.noreply.github.com> * Adding unit test for config_manager.py Signed-off-by: taejinp <tango4j@gmail.com> * Resolving merge conflict on config manager Signed-off-by: taejinp <tango4j@gmail.com> * Apply isort and black reformatting Signed-off-by: tango4j <tango4j@users.noreply.github.com> * Resolving Code QL. Signed-off-by: taejinp <tango4j@gmail.com> * Adding Conflict resolved config manager and test Signed-off-by: taejinp <tango4j@gmail.com> * Apply isort and black reformatting Signed-off-by: tango4j <tango4j@users.noreply.github.com> * Moving test files to example folder for cofinguration testing Signed-off-by: taejinp <tango4j@gmail.com> * Removed backup file Signed-off-by: taejinp <tango4j@gmail.com> * Adding config manager and llm-specific yamls and fixed the bugs Signed-off-by: taejinp <tango4j@gmail.com> * Adding NeMoTron Nano-9B-v2 as a default Signed-off-by: taejinp <tango4j@gmail.com> * Apply isort and black reformatting Signed-off-by: tango4j <tango4j@users.noreply.github.com> * fix environment Signed-off-by: stevehuang52 <heh@nvidia.com> * fix hf param resolve Signed-off-by: stevehuang52 <heh@nvidia.com> * update readme Signed-off-by: stevehuang52 <heh@nvidia.com> * update config manager, add llama3.1 example, refactor config style Signed-off-by: stevehuang52 <heh@nvidia.com> * update default yaml Signed-off-by: stevehuang52 <heh@nvidia.com> * update readme Signed-off-by: stevehuang52 <heh@nvidia.com> * update readme Signed-off-by: stevehuang52 <heh@nvidia.com> * pin nemo to 2.5 Signed-off-by: stevehuang52 <heh@nvidia.com> * fix env and cfg Signed-off-by: stevehuang52 <heh@nvidia.com> * Removing Qwen from generic hf config Signed-off-by: taejinp <tango4j@gmail.com> --------- Signed-off-by: stevehuang52 <heh@nvidia.com> Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com> Signed-off-by: taejinp <tango4j@gmail.com> Signed-off-by: tango4j <tango4j@users.noreply.github.com> Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Co-authored-by: Taejin Park <tango4j@gmail.com> Co-authored-by: tango4j <tango4j@users.noreply.github.com>
* remove local attn constraint Signed-off-by: Chen Cui <chcui@nvidia.com> * fix Signed-off-by: Chen Cui <chcui@nvidia.com> --------- Signed-off-by: Chen Cui <chcui@nvidia.com>
* beep boop: Update changelog Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Remove 2.4.0 cherry-picks Signed-off-by: Charlie Truong <chtruong@nvidia.com> * Add speech highlights Signed-off-by: Charlie Truong <chtruong@nvidia.com> * Update changelog Signed-off-by: Charlie Truong <chtruong@nvidia.com> * Update the changelog Signed-off-by: Charlie Truong <chtruong@nvidia.com> --------- Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Signed-off-by: Charlie Truong <chtruong@nvidia.com> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
* add qwen3 flops fix Signed-off-by: gdeng <gdeng@nvidia.com> * update model flops Signed-off-by: gdeng <gdeng@nvidia.com> * fix the flop cal bug Signed-off-by: gdeng <gdeng@nvidia.com> * Apply isort and black reformatting Signed-off-by: gdengk <gdengk@users.noreply.github.com> --------- Signed-off-by: gdeng <gdeng@nvidia.com> Signed-off-by: gdengk <gdengk@users.noreply.github.com> Co-authored-by: gdengk <gdengk@users.noreply.github.com>
…bucket (#14891) * [lhotse][aistore] added support input_cfg.yaml directly from aistore bucket Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * fix: convert pythoon dict obj into DictConf Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> * move OmegaConf.create() outside of for loop. Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com> --------- Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Jason <jasoli@nvidia.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Important
The
Update branchbutton must only be pressed in very rare occassions.An outdated branch is never blocking the merge of a PR.
Please reach out to the automation team before pressing that button.
What does this PR do ?
Add a one line overview of what this PR aims to accomplish.
Collection: [Note which collection this PR will affect]
Changelog
Usage
# Add a code snippet demonstrating how to use thisGitHub Actions CI
The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.
The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".
Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information