Skip to content

Magpietts 2508 merge main#81

Merged
blisc merged 94 commits intoblisc:magpietts_2508_test_mergefrom
NVIDIA-NeMo:magpietts_2508_merge_main
Oct 10, 2025
Merged

Magpietts 2508 merge main#81
blisc merged 94 commits intoblisc:magpietts_2508_test_mergefrom
NVIDIA-NeMo:magpietts_2508_merge_main

Conversation

@blisc
Copy link
Owner

@blisc blisc commented Oct 10, 2025

Important

The Update branch button must only be pressed in very rare occassions.
An outdated branch is never blocking the merge of a PR.
Please reach out to the automation team before pressing that button.

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Collection: [Note which collection this PR will affect]

Changelog

  • Add specific line by line info of high level changes in this PR.

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this 

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • New Feature
  • Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

meatybobby and others added 30 commits August 25, 2025 15:15
* Support QwenVL for inference engine

* Apply isort and black reformatting

Signed-off-by: meatybobby <meatybobby@users.noreply.github.com>

* Remove comment out

* Reformat

* Skip pylint check

* Add unit tests

* Apply isort and black reformatting

Signed-off-by: meatybobby <meatybobby@users.noreply.github.com>

---------

Signed-off-by: meatybobby <meatybobby@users.noreply.github.com>
Co-authored-by: meatybobby <meatybobby@users.noreply.github.com>
* Fix sequence packing loss calculation

Signed-off-by: Rayan Dasoriya <dasoriyarayan@gmail.com>

* Fix nemo2 path

Signed-off-by: Rayan Dasoriya <dasoriyarayan@gmail.com>

* Skip pylint

Signed-off-by: Rayan Dasoriya <dasoriyarayan@gmail.com>

---------

Signed-off-by: Rayan Dasoriya <dasoriyarayan@gmail.com>
Co-authored-by: Dmytro Pykhtar <37850217+dimapihtar@users.noreply.github.com>
* [Audio]: added streaming mode to SpectrogramToAudio

Signed-off-by: Rauf <rnasretdinov@nvidia.com>

* added time buffer

Signed-off-by: Rauf <rnasretdinov@nvidia.com>

* renamed Nf -> num_frames

Signed-off-by: Rauf <rnasretdinov@nvidia.com>

* added AudioToSpectrogram and scale and magnitude power

Signed-off-by: Rauf <rnasretdinov@nvidia.com>

* added multiple chunking support

Signed-off-by: Rauf <rnasretdinov@nvidia.com>

* added properties _stream_initialized, _eps, got rid of _prev_spec_frame

Signed-off-by: Rauf <rnasretdinov@nvidia.com>

* added hanning window

Signed-off-by: Rauf <rnasretdinov@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: nasretdinovr <nasretdinovr@users.noreply.github.com>

* added a docstring regarding streaming istft mode

Signed-off-by: Rauf <rnasretdinov@nvidia.com>

---------

Signed-off-by: Rauf <rnasretdinov@nvidia.com>
Signed-off-by: nasretdinovr <nasretdinovr@users.noreply.github.com>
Co-authored-by: nasretdinovr <nasretdinovr@users.noreply.github.com>
…rs (#14514)

* Update evo2 defaults so converted checkpoints have the right parameters

Signed-off-by: John St John <jstjohn@nvidia.com>

* Fix line too long issue

Signed-off-by: John St John <jstjohn@nvidia.com>

* Fix expected changes to configs that are locked into our tests

Signed-off-by: John St John <jstjohn@nvidia.com>

---------

Signed-off-by: John St John <jstjohn@nvidia.com>
Signed-off-by: dimapihtar <dpihtar@gmail.com>
Signed-off-by: Malay Nagda <malayn@nvidia.com>
…t_store flags (#14522)

* Add use te activation func and save act input in fp8 flags

Signed-off-by: Guyue Huang <guyueh@nvidia.com>

* Fix field name

Signed-off-by: Guyue Huang <guyueh@nvidia.com>

* Update scripts/performance/vlm/finetune_qwen25vl_32b.py

Co-authored-by: malay-nagda <malayn@nvidia.com>
Signed-off-by: Guyue Huang <140554423+guyueh1@users.noreply.github.com>

---------

Signed-off-by: Guyue Huang <guyueh@nvidia.com>
Signed-off-by: Guyue Huang <140554423+guyueh1@users.noreply.github.com>
Co-authored-by: malay-nagda <malayn@nvidia.com>
* Bump TE and Mcore

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

* Use Mcore 69b65

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

---------

Signed-off-by: Charlie Truong <chtruong@nvidia.com>
* remove sync in logging

Signed-off-by: qiyuw <qiyuw@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: WanZzzzzz <WanZzzzzz@users.noreply.github.com>

* add class and func docstrings in data_sampler.py for pylint

Signed-off-by: qiyuw <qiyuw@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: WanZzzzzz <WanZzzzzz@users.noreply.github.com>

---------

Signed-off-by: qiyuw <qiyuw@nvidia.com>
Signed-off-by: WanZzzzzz <WanZzzzzz@users.noreply.github.com>
Co-authored-by: qiyuw <qiyuw@nvidia.com>
Co-authored-by: WanZzzzzz <WanZzzzzz@users.noreply.github.com>
* add 1b arclongcontextconfig

Signed-off-by: Farhad Ramezanghorbani <farhadr@nvidia.com>

* fix device mess

Signed-off-by: Farhad Ramezanghorbani <farhadr@nvidia.com>

* add implicit_filter support

Signed-off-by: Farhad Ramezanghorbani <farhadr@nvidia.com>

* use padded input

Signed-off-by: Farhad Ramezanghorbani <farhadr@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: farhadrgh <farhadrgh@users.noreply.github.com>

* Revert "add 1b arclongcontextconfig"

This reverts commit 029969b.

---------

Signed-off-by: Farhad Ramezanghorbani <farhadr@nvidia.com>
Signed-off-by: farhadrgh <farhadrgh@users.noreply.github.com>
* fix gemma2 27b kv dimension

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* fix gemma2 27b kv dimension

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

---------

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
* feat: print expert groups on megatron init (#13874)

Signed-off-by: Alexander Zhipa <azzhipa@amazon.com>
Co-authored-by: Alexander Zhipa <azzhipa@amazon.com>
Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt>

* set a different seed for each dp rank

Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt>

* calculate loss inside autocast

Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt>

* disable per token loss, grad acc fusion

Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt>

* add missing self.seed

Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt>

* black formatting

Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt>

* Apply isort and black reformatting

Signed-off-by: gautham-kollu <gautham-kollu@users.noreply.github.com>

---------

Signed-off-by: Alexander Zhipa <azzhipa@amazon.com>
Signed-off-by: CarlosGomes98 <carlosmiguel.gomes@live.com.pt>
Signed-off-by: gautham-kollu <gautham-kollu@users.noreply.github.com>
Co-authored-by: Alexander Zhipa <alex.zhipa@proton.me>
Co-authored-by: Alexander Zhipa <azzhipa@amazon.com>
Co-authored-by: gautham-kollu <gkollu@nvidia.com>
Co-authored-by: gautham-kollu <gautham-kollu@users.noreply.github.com>
* [Flux] Add MXFP8 support.

Signed-off-by: Wil Kong <alpha0422@gmail.com>

* [Flux] Add current and block scaling.

Signed-off-by: Wil Kong <alpha0422@gmail.com>

---------

Signed-off-by: Wil Kong <alpha0422@gmail.com>
Signed-off-by: Ao Tang <aot@nvidia.com>
…li triplet dataset with NeMo Framework (#14584)

* Create E2E-Embedding-Finetuning

Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com>

* Update E2E-Embedding-Finetuning

Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com>

* Delete tutorials/llm/embedding/E2E-Embedding-Finetuning

Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com>

* Create README.md

Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com>

* Add files via upload

Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com>

* Add files via upload

This is a notebook for E2E finetuning a embedding model

Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com>

* Update README.md

Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com>

* Update README.md

Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com>

* Delete tutorials/llm/embedding/E2E-Embedding-Finetuning/download_dataset.py

Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com>

* Delete tutorials/llm/embedding/E2E-Embedding-Finetuning/finetune_e5.py

Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com>

* Delete tutorials/llm/embedding/E2E-Embedding-Finetuning/finetune_llama1b.py

Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com>

* Delete tutorials/llm/embedding/E2E-Embedding-Finetuning/import_e5_large.py

Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com>

* Delete tutorials/llm/embedding/E2E-Embedding-Finetuning/import_llama1b.py

Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com>

---------

Signed-off-by: Hemant Giri <30834697+girihemant19@users.noreply.github.com>
Co-authored-by: Ao Tang <aot@nvidia.com>
Signed-off-by: Guyue Huang <guyueh@nvidia.com>
Signed-off-by: dimapihtar <dpihtar@gmail.com>
Signed-off-by: jenchen13 <jennifchen@nvidia.com>
Signed-off-by: Chen Cui <chcui@nvidia.com>
Signed-off-by: Rauf <rnasretdinov@nvidia.com>
Signed-off-by: Chen Cui <chcui@nvidia.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
* fix flux seed as optional

Signed-off-by: Ao Tang <aot@nvidia.com>

* fix fluxcontrolnet

Signed-off-by: Ao Tang <aot@nvidia.com>

* Fix code checkout during test

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

---------

Signed-off-by: Ao Tang <aot@nvidia.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Co-authored-by: Charlie Truong <chtruong@nvidia.com>
Signed-off-by: Jason <jasoli@nvidia.com>
* Remove PEFT scheme condition from recipe

Signed-off-by: Ali Taghibakhshi <71892896+JRD971000@users.noreply.github.com>

* remove unnecessary peft conditioning 12b

---------

Signed-off-by: Ali Taghibakhshi <71892896+JRD971000@users.noreply.github.com>
* add gpt-oss lora exporter

Signed-off-by: Chen Cui <chcui@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>

* update lora exporter for experts

Signed-off-by: Chen Cui <chcui@nvidia.com>

* disallow exporting expert lora since nemo implementation is not equivalent to hf

Signed-off-by: Chen Cui <chcui@nvidia.com>

* linting

Signed-off-by: Chen Cui <chcui@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>

* address comment

Signed-off-by: Chen Cui <chcui@nvidia.com>

---------

Signed-off-by: Chen Cui <chcui@nvidia.com>
Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>
Co-authored-by: cuichenx <cuichenx@users.noreply.github.com>
Co-authored-by: Charlie Truong <chtruong@nvidia.com>
* update streaming ASR

Signed-off-by: stevehuang52 <heh@nvidia.com>

* add voice agent

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update readme

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update websocket

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update readme

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update

Signed-off-by: stevehuang52 <heh@nvidia.com>

* clean up

Signed-off-by: stevehuang52 <heh@nvidia.com>

* clean up

Signed-off-by: stevehuang52 <heh@nvidia.com>

* fix typo

Signed-off-by: stevehuang52 <heh@nvidia.com>

* fix codeQL

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update cfg

Signed-off-by: stevehuang52 <heh@nvidia.com>

* remove unused

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update readme

Signed-off-by: stevehuang52 <heh@nvidia.com>

* change default models

Signed-off-by: stevehuang52 <heh@nvidia.com>

* fix diar diable

Signed-off-by: stevehuang52 <heh@nvidia.com>

* fix diar diable

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update ux

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update tts

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update readme

Signed-off-by: stevehuang52 <heh@nvidia.com>

* fix and update

Signed-off-by: stevehuang52 <heh@nvidia.com>

* fix asr

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update readmme

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update doc and llm dtype

Signed-off-by: stevehuang52 <heh@nvidia.com>

* refactor and add example prompts

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update readme

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update readme

Signed-off-by: stevehuang52 <heh@nvidia.com>

* clean up

Signed-off-by: stevehuang52 <heh@nvidia.com>

* clean up

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update info on streaming sortformer

Signed-off-by: stevehuang52 <heh@nvidia.com>

* move code to 'nemo/agents/voice_agent'

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update doc

Signed-off-by: stevehuang52 <heh@nvidia.com>

* clean up

Signed-off-by: stevehuang52 <heh@nvidia.com>

* refactor

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update doc

Signed-off-by: stevehuang52 <heh@nvidia.com>

* remove the unnecessary streaming state conversion and import it from sortformer_modules, remove PostProcessingParams

Signed-off-by: Weiqing Wang <weiqingw@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: weiqingw4ng <weiqingw4ng@users.noreply.github.com>

* update doc

Signed-off-by: stevehuang52 <heh@nvidia.com>

* clean up

Signed-off-by: stevehuang52 <heh@nvidia.com>

* fix for llama-nemotron template, and refactor

Signed-off-by: stevehuang52 <heh@nvidia.com>

* fix tts separator

Signed-off-by: stevehuang52 <heh@nvidia.com>

* fix for llama-nemotron

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update cfg

Signed-off-by: stevehuang52 <heh@nvidia.com>

* refactor and update doc

Signed-off-by: stevehuang52 <heh@nvidia.com>

* change default llm to qwen

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update doc

Signed-off-by: stevehuang52 <heh@nvidia.com>

---------

Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: Weiqing Wang <weiqingw@nvidia.com>
Signed-off-by: weiqingw4ng <weiqingw4ng@users.noreply.github.com>
Co-authored-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: Kunal Dhawan <kunaldhawan97@gmail.com>
Co-authored-by: Weiqing Wang <weiqingw@nvidia.com>
Co-authored-by: weiqingw4ng <weiqingw4ng@users.noreply.github.com>
andrusenkoau and others added 29 commits September 23, 2025 08:33
* replace texterros with kaldialign for f-score computation

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

* replace texterros with kaldialign for asr confidence

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

* replace texterrors with kaldialign for ASR_Confidence_Estimation.ipynb

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

* replace texterrors with kaldialing for ASR_Context_Biasing.ipynb

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <andrusenkoau@users.noreply.github.com>

* decrease kaldialign version

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

---------

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>
Signed-off-by: andrusenkoau <andrusenkoau@users.noreply.github.com>
Co-authored-by: andrusenkoau <andrusenkoau@users.noreply.github.com>
* Update prune-distill notebooks to Qwen3 + simplify

Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>

* address comments

Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>

* Add readme.rst

Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>

---------

Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>
* add deprecation notice

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* add deprecation notice

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* add deprecation warning

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* remove import

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* move code

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* add more notices

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>

* Remove automodel cicd

Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com>

* Add deprecation notice for Automodel

Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com>

---------

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com>
Co-authored-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Signed-off-by: Chen Cui <chcui@nvidia.com>
Signed-off-by: Aditya Vavre <avavre@nvidia.com>
…rs (#14639)

* Add fallback for file copy to handle metadata errors

Signed-off-by: vipnydav <vipinydv@google.com>

* Add robust_copy for resilient file copy

Signed-off-by: vipnydav <vipinydv@google.com>

* Apply isort and black reformatting

Signed-off-by: vipnydav <vipnydav@users.noreply.github.com>

* remove imported Path from test_file.py

Signed-off-by: vipnydav <vipinydv@google.com>

* Move robust_copy method to util file

Signed-off-by: vipnydav <vipinydv@google.com>

* Apply isort and black reformatting

Signed-off-by: vipnydav <vipnydav@users.noreply.github.com>

* Fix lint

Signed-off-by: vipnydav <vipinydv@google.com>

---------

Signed-off-by: vipnydav <vipinydv@google.com>
Signed-off-by: vipnydav <vipnydav@users.noreply.github.com>
Co-authored-by: vipnydav <vipnydav@users.noreply.github.com>
* feat: add callback group definition & callback ABC

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: "Zhengjiang Shao" <zshao@nvidia.com>

Signed-off-by: Zhengjiang Shao <zshao@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: PytLab <PytLab@users.noreply.github.com>

* feat: insert callback functions of CallbackGroup

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: "Zhengjiang Shao" <zshao@nvidia.com>

Signed-off-by: Zhengjiang Shao <zshao@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: PytLab <PytLab@users.noreply.github.com>

* chore: PR test for jiashang

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* feat: use __init_subclass__ to cover all ModelPT subclasses

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: "Zhengjiang Shao" <zshao@nvidia.com>

Signed-off-by: Zhengjiang Shao <zshao@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: PytLab <PytLab@users.noreply.github.com>

* feat: Adding metadata config manager poc

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: "Saju Prasad" <sajup@dc2-container-xterm-023.prd.it.nvidia.com>

Signed-off-by: Saju Prasad <sajup@dc2-container-xterm-023.prd.it.nvidia.com>

* Apply isort and black reformatting

Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com>

* feat: revert test changes.

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: Updating metadata attributes

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: sajup <sajup@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com>

* fix: Adding OneloggerCallback

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: sajup <sajup@nvidia.com>

* fix: Reverting changes in examples/multimodal/speech_llm/modular_audio_gpt_train.py

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: sajup <sajup@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com>

* fix: update modular models and megatron GPT models

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* feat: add on_app_start and on_app_end

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: Adding small test example for testing

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: sajup <sajup@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com>

* fix: Fixing review comments as discussed with Jiashang

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: "Saju Prasad" <sajup@draco-oci-login-02.cm.cluster>

Signed-off-by: Saju Prasad <sajup@draco-oci-login-02.cm.cluster>

* Apply isort and black reformatting

Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com>

* fix: updating nemo code to v2

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: sajup <sajup@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com>

* fix: updating wandb to get info from env

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: sajup <sajup@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com>

* fix: fix som impl issue

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix issue for exp manager.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* feat: remove callback_group

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* feat: fix timingtracker issue

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* feat: fix for startup callbcaks

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* feat: change to adapter

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* feat: use new nv-one-logger

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* feat: add on_app_end

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* feat: make OneLogger configurable

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* feat: remove NeMocallback import

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* feat: fix the enable_onelogger setting.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* feat: clean the code.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* feat: enable onelogger

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* test: Adding few unit tests

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: "Saju Prasad" <sajup@cw-dfw-cs-001-vscode-01.cm.cluster>

Signed-off-by: Saju Prasad <sajup@cw-dfw-cs-001-vscode-01.cm.cluster>

* Apply isort and black reformatting

Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com>

* feat: tmp fix for functional testing.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: add on_app_end for NeMov2

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* fix: typo.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix the get attributes

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* fix: moving test test_meta_info_manager.py to tests/collections/common/

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: "Saju Prasad" <sajup@cw-dfw-cs-001-vscode-01.cm.cluster>

Signed-off-by: Saju Prasad <sajup@cw-dfw-cs-001-vscode-01.cm.cluster>

* fix: fix format issue.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix lint errors

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* Revert "Apply isort and black reformatting"

This reverts commit de6994d.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Revert "fix: fix lint errors"

This reverts commit 8e47ecd.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix linting issues.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix linting issue

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: add copyright info

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: small fix.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix small issues for t5

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix dataloader issue.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* fix: remove dataloader setting.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* feat: update OneLogger.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix hydra runner.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: start using partial config.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix the unused variables

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: change get_one_logger name

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: code clean up.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: import more specific to avoid circular dependency. (#14306)

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: Peiyuan <qipeiyuan@outlook.com>

* fix: use ptl callback from ls

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* feat: fix meta info manager.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix meta data issue.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix the lint issue

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix the unit tests.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix minor metadata issue.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix some test issues

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix pytest issue for meta info manager

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix lint issues for optimizers.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix pytest issues.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix CICD issues.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix all pytests

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* chore: fix lint

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* chore: fix unused import issues.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* chore: fix CICD issues.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix the CICD issues.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix the linting issue

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix CICD issues.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix the circular import issue.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix some pytests.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: revert some change.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: error handling for init onelogger

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* chore: fix one_logger code.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* chore: remove unused vars.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix CICD for nemo

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* chore: fix NeMo CICD.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* chore: renaming onelogger

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* chore: fix some exception.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* chore: renaming.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* chore: resolve some comments.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* chore: remove duplicate init.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* chore: resolve some github comments.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* chore: fix the linting issue.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* chore(callbacks): restore generic CallbackGroup and route telemetry v… (#14628)

* chore(callbacks): restore generic CallbackGroup and route telemetry via group\n\n- Add BaseCallback and CallbackGroup with update_config and class init hook\n- Register OneLoggerAdapterCallback into group; merge config update into class\n- Replace direct OneLogger API usages with CallbackGroup across code\n- Ensure trainer attaches registered callbacks via group.update_config\n- Add nv-one-logger>=2.0.0 to base requirements\n\nSigned-off-by: Jiashang Hu <jiashangh@nvidia.com>

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

* chore: renaming.

* chore: revert the change to install nv-one-logger

* chore: fix the linting issue

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

---------

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>
Signed-off-by: liquor233 <liquor233@users.noreply.github.com>
Co-authored-by: liquor233 <liquor233@users.noreply.github.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* Add tests for callback group (#14632)

* chore: fix some circular dependency issues.

* chore: move the files to utils.

* chore: add unit tests

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

* chore: fix nv-one-logger tests

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

* chore: fix lint issue.

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

* chore: change the location.

* chore: remaining fix.

* chore: remaining changes.

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

* chore: fix the tests

* chore: fix some lint.

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

* Revert prompt_encoder.py to c5ef26c (Jason Wang) to undo auto-formatting

* pre-commit: exclude prompt_encoder.py from black/isort formatting

* chore: undo lasst commit.

* fix: fix some part for nemocallback.

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

* chore: fix some pytest

* fix: verify the auto-hooked functions are called once

Signed-off-by: Zhengjiang Shao <zshao@nvidia.com>

---------

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>
Signed-off-by: Zhengjiang Shao <zshao@nvidia.com>
Co-authored-by: liquor233 <liquor233@users.noreply.github.com>
Co-authored-by: Zhengjiang Shao <zshao@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix the double init issue

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

Signed-off-by: liquor233 <jiashangh@nvidia.com>

* fix: fix the push

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Guarantee one logger on_app_end calls (#14691)

* fix: guarantee on_app_end calls can be invoked finally

Signed-off-by: Zhengjiang Shao <zshao@nvidia.com>

* feat: add context manager creator in CallbackGroup

* Revert "feat: add context manager creator in CallbackGroup"

This reverts commit 381f83d.

Signed-off-by: Zhengjiang Shao <zshao@nvidia.com>

---------

Signed-off-by: Zhengjiang Shao <zshao@nvidia.com>

* fix: remove meta info manager (#14689)

* fix: remove meta info manager

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

---------

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>
Signed-off-by: liquor233 <liquor233@users.noreply.github.com>
Co-authored-by: liquor233 <liquor233@users.noreply.github.com>

* fix: fix some linting issues.

* fix: fix unit tests.

* chore: fix mcore

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix the installing problem

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix requirements

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix the mcore version.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix the mcore version.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix the mcore version.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix the mcore version.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix the mcore version.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix the mcore version.

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: use correct global_step for async ckpt success event

Signed-off-by: Zhengjiang Shao <zshao@nvidia.com>

* fix: fix unit tests

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix requirements

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: refactor the unit tests

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: insert callbacks in CallbackGroup before other PTL callbacks

Signed-off-by: Zhengjiang Shao <zshao@nvidia.com>

* fix: fix call on app start flag

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

* fix: fix unit tests

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: bump nv-one-logger version

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix the unit tests

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

* fix: fix the cicd issues.

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

* fix: fix some lint issues

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix unused import

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: make oneloggernemocallback singleton

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix lint issues

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: make oneloggernemocallback singleton

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

* fix: keep the original callbacks order in CallbackGroup when merging with trainer.callbacks

* fix: fix the unit tests

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix unit tests

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

* fix: fix lint issues

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix the pickle issue.

* Apply isort and black reformatting

Signed-off-by: liquor233 <liquor233@users.noreply.github.com>

* fix: fix issue.

* fix: fix callback

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

* fix: fix callback group

Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>

---------

Signed-off-by: Zhengjiang Shao <zshao@nvidia.com>
Signed-off-by: PytLab <PytLab@users.noreply.github.com>
Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>
Signed-off-by: Saju Prasad <sajup@dc2-container-xterm-023.prd.it.nvidia.com>
Signed-off-by: sajup-oss <sajup-oss@users.noreply.github.com>
Signed-off-by: liquor233 <jiashangh@nvidia.com>
Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: sajup <sajup@nvidia.com>
Signed-off-by: sajup <sajup@nvidia.com>
Signed-off-by: Saju Prasad <sajup@draco-oci-login-02.cm.cluster>
Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: liquor233 <jiashangh@nvidia.com>
Signed-off-by: Saju Prasad <sajup@cw-dfw-cs-001-vscode-01.cm.cluster>
Signed-off-by: Jiashang Hu <jiashangh@nvidia.com>\nSigned-off-by: Peiyuan <qipeiyuan@outlook.com>
Signed-off-by: liquor233 <liquor233@users.noreply.github.com>
Co-authored-by: PytLab <PytLab@users.noreply.github.com>
Co-authored-by: Jiashang Hu <jiashangh@nvidia.com>
Co-authored-by: Saju Prasad <sajup@dc2-container-xterm-023.prd.it.nvidia.com>
Co-authored-by: sajup-oss <sajup-oss@users.noreply.github.com>
Co-authored-by: sajup <sajup@nvidia.com>
Co-authored-by: liquor233 <liquor233@users.noreply.github.com>
Co-authored-by: Saju Prasad <sajup@draco-oci-login-02.cm.cluster>
Co-authored-by: Saju Prasad <sajup@cw-dfw-cs-001-vscode-01.cm.cluster>
Co-authored-by: Peiyuan <qipeiyuan@outlook.com>
Co-authored-by: Peiyuan Qi <bqi@nvidia.com>
Signed-off-by: Pablo Garay <pagaray@nvidia.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
* Add mistral small3 24B config and recipe

Signed-off-by: Joosung Yoon <joosungy@nvidia.com>

---------

Signed-off-by: Joosung Yoon <joosungy@nvidia.com>
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Co-authored-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
* beep boop: Update changelog

Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

* Update changelog for 2.3.3

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

* Fix changelog for 2.3.3

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

---------

Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: Charlie Truong <chtruong@nvidia.com>
* QWEN2.5-VL FP8 Recipe

Signed-off-by: Lifu Zhang <tomzhanglf@gmail.com>

* Apply isort and black reformatting

Signed-off-by: tomlifu <tomlifu@users.noreply.github.com>

* add model configs

Signed-off-by: Lifu Zhang <tomzhanglf@gmail.com>

---------

Signed-off-by: Lifu Zhang <tomzhanglf@gmail.com>
Signed-off-by: tomlifu <tomlifu@users.noreply.github.com>
Co-authored-by: tomlifu <tomlifu@users.noreply.github.com>
* Add Customization Capabilities to Cache-Aware Models

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Unify params with other transcription scripts

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Fix usage with manifests containing relative paths

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Fix decoding config setup

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Return back output_path

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Raise not implemented error if batched beam search performed with partial hypotheses

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Raise not implemented error if batched beam search in transducer performed with partial hypotheses

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Fix after merge

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Fix att_context_size param

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Use optional for left_chunks

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: artbataev <artbataev@users.noreply.github.com>

* Unify parameters with transcribe_speech

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Fix docstring

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Unify dtype selection

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Fix unused variables

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

* Enhance inline documentation. Set compute_dtype=float32 by default.

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>

---------

Signed-off-by: Vladimir Bataev <vbataev@nvidia.com>
Signed-off-by: artbataev <artbataev@users.noreply.github.com>
Co-authored-by: artbataev <artbataev@users.noreply.github.com>
* Address problems where sometimes in 1m dataset there are very large masked segments

Signed-off-by: John St John <jstjohn@nvidia.com>

* only flip the tag extra if the segment length is too long

Signed-off-by: John St John <jstjohn@nvidia.com>

* Undo the change to the pre commit config

Signed-off-by: John St John <jstjohn@nvidia.com>

* Add clarifying comments about the state flipping logic

Signed-off-by: John St John <jstjohn@nvidia.com>

---------

Signed-off-by: John St John <jstjohn@nvidia.com>
* Update cherry-pick workflow to use version 0.63.0

Signed-off-by: Pablo Garay <palenq@gmail.com>

* Update cherry-pick workflow version tag

Signed-off-by: Pablo Garay <palenq@gmail.com>

---------

Signed-off-by: Pablo Garay <palenq@gmail.com>
Signed-off-by: Andrew Schilling <aschilling@nvidia.com>
* beep boop: Update changelog

Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

* Fix changelog for 2.4.1

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

---------

Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: Charlie Truong <chtruong@nvidia.com>
* Adding to false_positives.json

Signed-off-by: Andrew Schilling <aschilling@nvidia.com>

* Fixing redirected URLs in nlp_all.bib

Signed-off-by: Andrew Schilling <aschilling@nvidia.com>

---------

Signed-off-by: Andrew Schilling <aschilling@nvidia.com>
* add documentation for gpu phrase boosting

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

* minor fixes

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

* minor fixes

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

* fix depth_scaling description

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

* change default depth_scaling value

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

* use default depth_scaling=1 for AED models

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

* Apply isort and black reformatting

Signed-off-by: andrusenkoau <andrusenkoau@users.noreply.github.com>

* fixe broken link

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>

---------

Signed-off-by: andrusenkoau <andrusenkoau@gmail.com>
Signed-off-by: andrusenkoau <andrusenkoau@users.noreply.github.com>
Co-authored-by: andrusenkoau <andrusenkoau@users.noreply.github.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>
)

Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com>
* add vllm support

Signed-off-by: stevehuang52 <heh@nvidia.com>

* refactor

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update cfg

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update cfg

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update for nano-v2

Signed-off-by: stevehuang52 <heh@nvidia.com>

* Potential fix for code scanning alert no. 16177: Unused import

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>

* update and refactor

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update readme

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update to pipecat=0.0.84

Signed-off-by: stevehuang52 <heh@nvidia.com>

* add auto start/stop vllm server

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update readme

Signed-off-by: stevehuang52 <heh@nvidia.com>

* auto switch between vllm and hf

Signed-off-by: stevehuang52 <heh@nvidia.com>

* refactor

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update default cfg

Signed-off-by: stevehuang52 <heh@nvidia.com>

* add qwen3 example, refactor

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update readme according to feedback

Signed-off-by: stevehuang52 <heh@nvidia.com>

* pin package version

Signed-off-by: stevehuang52 <heh@nvidia.com>

* Adding config manager and llm-specific yamls with short default yaml

Signed-off-by: taejinp <tango4j@gmail.com>

* Apply isort and black reformatting

Signed-off-by: tango4j <tango4j@users.noreply.github.com>

* Adding unit test for config_manager.py

Signed-off-by: taejinp <tango4j@gmail.com>

* Resolving merge conflict on config manager

Signed-off-by: taejinp <tango4j@gmail.com>

* Apply isort and black reformatting

Signed-off-by: tango4j <tango4j@users.noreply.github.com>

* Resolving Code QL.

Signed-off-by: taejinp <tango4j@gmail.com>

* Adding Conflict resolved config manager and test

Signed-off-by: taejinp <tango4j@gmail.com>

* Apply isort and black reformatting

Signed-off-by: tango4j <tango4j@users.noreply.github.com>

* Moving test files to example folder for cofinguration testing

Signed-off-by: taejinp <tango4j@gmail.com>

* Removed backup file

Signed-off-by: taejinp <tango4j@gmail.com>

* Adding config manager and llm-specific yamls and fixed the bugs

Signed-off-by: taejinp <tango4j@gmail.com>

* Adding NeMoTron Nano-9B-v2 as a default

Signed-off-by: taejinp <tango4j@gmail.com>

* Apply isort and black reformatting

Signed-off-by: tango4j <tango4j@users.noreply.github.com>

* fix environment

Signed-off-by: stevehuang52 <heh@nvidia.com>

* fix hf param resolve

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update readme

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update config manager, add llama3.1 example, refactor config style

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update default yaml

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update readme

Signed-off-by: stevehuang52 <heh@nvidia.com>

* update readme

Signed-off-by: stevehuang52 <heh@nvidia.com>

* pin nemo to 2.5

Signed-off-by: stevehuang52 <heh@nvidia.com>

* fix env and cfg

Signed-off-by: stevehuang52 <heh@nvidia.com>

* Removing Qwen from generic hf config

Signed-off-by: taejinp <tango4j@gmail.com>

---------

Signed-off-by: stevehuang52 <heh@nvidia.com>
Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
Signed-off-by: taejinp <tango4j@gmail.com>
Signed-off-by: tango4j <tango4j@users.noreply.github.com>
Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>
Co-authored-by: Taejin Park <tango4j@gmail.com>
Co-authored-by: tango4j <tango4j@users.noreply.github.com>
* remove local attn constraint

Signed-off-by: Chen Cui <chcui@nvidia.com>

* fix

Signed-off-by: Chen Cui <chcui@nvidia.com>

---------

Signed-off-by: Chen Cui <chcui@nvidia.com>
* beep boop: Update changelog

Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

* Remove 2.4.0 cherry-picks

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

* Add speech highlights

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

* Update changelog

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

* Update the changelog

Signed-off-by: Charlie Truong <chtruong@nvidia.com>

---------

Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
* add qwen3 flops fix

Signed-off-by: gdeng <gdeng@nvidia.com>

* update model flops

Signed-off-by: gdeng <gdeng@nvidia.com>

* fix the flop cal bug

Signed-off-by: gdeng <gdeng@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: gdengk <gdengk@users.noreply.github.com>

---------

Signed-off-by: gdeng <gdeng@nvidia.com>
Signed-off-by: gdengk <gdengk@users.noreply.github.com>
Co-authored-by: gdengk <gdengk@users.noreply.github.com>
…bucket (#14891)

* [lhotse][aistore] added support input_cfg.yaml directly from aistore bucket

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* fix: convert pythoon dict obj into DictConf

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

* move OmegaConf.create() outside of for loop.

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>

---------

Signed-off-by: Xuesong Yang <1646669+XuesongYang@users.noreply.github.com>
Signed-off-by: Jason <jasoli@nvidia.com>
@blisc blisc merged commit 2498ae2 into blisc:magpietts_2508_test_merge Oct 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.