Skip to content

Initial README commit#53

Merged
pablo-garay merged 31 commits intomainfrom
readme_init
Dec 3, 2025
Merged

Initial README commit#53
pablo-garay merged 31 commits intomainfrom
readme_init

Conversation

@abhinavg4
Copy link
Contributor

Init README.md

Copy link
Contributor Author

@abhinavg4 abhinavg4 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Tagging relevant people

- Corrected the link in the README for the performance summary to point to the correct file.
- Introduced a new `performance-summary.md` document detailing performance benchmarks for large language models using DFM, including nomenclature, performance metrics, and system configurations.
pablo-garay
pablo-garay previously approved these changes Nov 17, 2025
Signed-off-by: sajadn <snorouzi@nvidia.com>
@copy-pr-bot
Copy link

copy-pr-bot bot commented Nov 18, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

parthmannan and others added 3 commits November 19, 2025 11:26
Signed-off-by: Parth Mannan <pmannan@nvidia.com>
Signed-off-by: Parth Mannan <pmannan@nvidia.com>
- Removed redundant description of the framework.
- Clarified the relationship between Megatron Bridge and Megatron Core in the Dual-Path Architecture section.
README.md Outdated
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

uv run --group megatron-bridge python -m torch.distributed.run --nproc_per_node=2 examples/megatron/recipes/wan/pretrain_wan.py --config-file examples/megatron/recipes/wan/config/1.3B_mock.yaml

abhinavg4 and others added 4 commits November 20, 2025 18:56
…m descriptions

- Updated the Megatron Bridge Path section to include 6D parallelism details.
- Added state-of-the-art performance optimizations to the Dual Training Paths section.
- Clarified parallelism terminology in the comparison table for better understanding.
Signed-off-by: Parth Mannan <pmannan@nvidia.com>
…init

Signed-off-by: Parth Mannan <pmannan@nvidia.com>
Signed-off-by: linnan wang <wangnan318@gmail.com>
abhinavg4 and others added 6 commits December 1, 2025 04:31
Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
…ness

- Simplified descriptions of Megatron Bridge and AutoModel paths in README.md.
- Removed outdated comparison table to streamline content.
- Updated performance-summary.md to generalize model references and improve clarity.

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
@abhinavg4
Copy link
Contributor Author

/ok to test 31e7def

@abhinavg4 abhinavg4 enabled auto-merge (squash) December 1, 2025 18:04
@abhinavg4
Copy link
Contributor Author

/ok to test f86c51e

@pablo-garay pablo-garay disabled auto-merge December 3, 2025 02:29
@pablo-garay pablo-garay merged commit b867706 into main Dec 3, 2025
6 checks passed
lbliii pushed a commit that referenced this pull request Dec 3, 2025
* Initial README commit

* Update README and add performance summary documentation

- Corrected the link in the README for the performance summary to point to the correct file.
- Introduced a new `performance-summary.md` document detailing performance benchmarks for large language models using DFM, including nomenclature, performance metrics, and system configurations.

* add DiT megatron links.

Signed-off-by: sajadn <snorouzi@nvidia.com>

* Performance Docs update

Signed-off-by: Parth Mannan <pmannan@nvidia.com>

* Performance Docs update fix

Signed-off-by: Parth Mannan <pmannan@nvidia.com>

* Update README to enhance clarity and accuracy

- Removed redundant description of the framework.
- Clarified the relationship between Megatron Bridge and Megatron Core in the Dual-Path Architecture section.

* Enhance README with detailed performance optimizations and parallelism descriptions

- Updated the Megatron Bridge Path section to include 6D parallelism details.
- Added state-of-the-art performance optimizations to the Dual Training Paths section.
- Clarified parallelism terminology in the comparison table for better understanding.

* Update perf doc

Signed-off-by: Parth Mannan <pmannan@nvidia.com>

* update

Signed-off-by: linnan wang <wangnan318@gmail.com>

* Update README with fine-tuning command

Removed TODO comment and added a command for fine-tuning a video diffusion model.

* Apply suggestion from @akoumpa

* Apply suggestion from @akoumpa

* Apply suggestion from @akoumpa

* Update README, Wan-related.

Updated command syntax and improved clarity in README.

* Apply suggestion from @akoumpa

* Fixing typo @akoumpa

* fix automodel section

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* update DFM-specific readme

Signed-off-by: Pablo Garay <pagaray@nvidia.com>

* Update performance-summary.md

Thanks a lot @linnanwang for the bench numbers.

* Update performance-summary.md

* Update performance-summary.md

* Update README.md

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

* Update README.md

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

* Update README.md

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

* Update README.md

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

* Refactor README.md and performance-summary.md for clarity and conciseness

- Simplified descriptions of Megatron Bridge and AutoModel paths in README.md.
- Removed outdated comparison table to streamline content.
- Updated performance-summary.md to generalize model references and improve clarity.

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

* Fix typo in README.md: changed "Built" to "Build" in the container section header for consistency.

---------

Signed-off-by: sajadn <snorouzi@nvidia.com>
Signed-off-by: Parth Mannan <pmannan@nvidia.com>
Signed-off-by: linnan wang <wangnan318@gmail.com>
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Signed-off-by: Pablo Garay <pagaray@nvidia.com>
Co-authored-by: sajadn <snorouzi@nvidia.com>
Co-authored-by: Parth Mannan <pmannan@nvidia.com>
Co-authored-by: linnan wang <wangnan318@gmail.com>
Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Co-authored-by: Huy Vu <86480512+huvunvidia@users.noreply.github.com>
Co-authored-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Co-authored-by: Pablo Garay <pagaray@nvidia.com>
Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
Signed-off-by: Lawrence Lane <llane@nvidia.com>
@chtruong814 chtruong814 deleted the readme_init branch January 29, 2026 20:26
huvunvidia added a commit that referenced this pull request Feb 12, 2026
* Initial README commit

* Update README and add performance summary documentation

- Corrected the link in the README for the performance summary to point to the correct file.
- Introduced a new `performance-summary.md` document detailing performance benchmarks for large language models using DFM, including nomenclature, performance metrics, and system configurations.

* add DiT megatron links.

Signed-off-by: sajadn <snorouzi@nvidia.com>

* Performance Docs update

Signed-off-by: Parth Mannan <pmannan@nvidia.com>

* Performance Docs update fix

Signed-off-by: Parth Mannan <pmannan@nvidia.com>

* Update README to enhance clarity and accuracy

- Removed redundant description of the framework.
- Clarified the relationship between Megatron Bridge and Megatron Core in the Dual-Path Architecture section.

* Enhance README with detailed performance optimizations and parallelism descriptions

- Updated the Megatron Bridge Path section to include 6D parallelism details.
- Added state-of-the-art performance optimizations to the Dual Training Paths section.
- Clarified parallelism terminology in the comparison table for better understanding.

* Update perf doc

Signed-off-by: Parth Mannan <pmannan@nvidia.com>

* update

Signed-off-by: linnan wang <wangnan318@gmail.com>

* Update README with fine-tuning command

Removed TODO comment and added a command for fine-tuning a video diffusion model.

* Apply suggestion from @akoumpa

* Apply suggestion from @akoumpa

* Apply suggestion from @akoumpa

* Update README, Wan-related.

Updated command syntax and improved clarity in README.

* Apply suggestion from @akoumpa

* Fixing typo @akoumpa

* fix automodel section

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* fix

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

* update DFM-specific readme

Signed-off-by: Pablo Garay <pagaray@nvidia.com>

* Update performance-summary.md

Thanks a lot @linnanwang for the bench numbers.

* Update performance-summary.md

* Update performance-summary.md

* Update README.md

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

* Update README.md

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

* Update README.md

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

* Update README.md

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

* Refactor README.md and performance-summary.md for clarity and conciseness

- Simplified descriptions of Megatron Bridge and AutoModel paths in README.md.
- Removed outdated comparison table to streamline content.
- Updated performance-summary.md to generalize model references and improve clarity.

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

* Fix typo in README.md: changed "Built" to "Build" in the container section header for consistency.

---------

Signed-off-by: sajadn <snorouzi@nvidia.com>
Signed-off-by: Parth Mannan <pmannan@nvidia.com>
Signed-off-by: linnan wang <wangnan318@gmail.com>
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Signed-off-by: Pablo Garay <pagaray@nvidia.com>
Co-authored-by: sajadn <snorouzi@nvidia.com>
Co-authored-by: Parth Mannan <pmannan@nvidia.com>
Co-authored-by: linnan wang <wangnan318@gmail.com>
Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Co-authored-by: Huy Vu <86480512+huvunvidia@users.noreply.github.com>
Co-authored-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Co-authored-by: Pablo Garay <pagaray@nvidia.com>
Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Comments