Initial README commit by abhinavg4 · Pull Request #53 · NVIDIA-NeMo/DFM

abhinavg4 · 2025-11-16T18:35:35Z

Init README.md

abhinavg4

Tagging relevant people

README.md

- Corrected the link in the README for the performance summary to point to the correct file. - Introduced a new `performance-summary.md` document detailing performance benchmarks for large language models using DFM, including nomenclature, performance metrics, and system configurations.

docs/performance-summary.md

README.md

Signed-off-by: sajadn <snorouzi@nvidia.com>

copy-pr-bot · 2025-11-18T22:13:16Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

README.md

Signed-off-by: Parth Mannan <pmannan@nvidia.com>

- Removed redundant description of the framework. - Clarified the relationship between Megatron Bridge and Megatron Core in the Dual-Path Architecture section.

abhinavg4 · 2025-11-20T17:49:46Z

README.md

uv run --group megatron-bridge python -m torch.distributed.run --nproc_per_node=2 examples/megatron/recipes/wan/pretrain_wan.py --config-file examples/megatron/recipes/wan/config/1.3B_mock.yaml

…m descriptions - Updated the Megatron Bridge Path section to include 6D parallelism details. - Added state-of-the-art performance optimizations to the Dual Training Paths section. - Clarified parallelism terminology in the comparison table for better understanding.

Signed-off-by: Parth Mannan <pmannan@nvidia.com>

…init Signed-off-by: Parth Mannan <pmannan@nvidia.com>

Signed-off-by: linnan wang <wangnan318@gmail.com>

docs/performance-summary.md

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

…ness - Simplified descriptions of Megatron Bridge and AutoModel paths in README.md. - Removed outdated comparison table to streamline content. - Updated performance-summary.md to generalize model references and improve clarity. Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

abhinavg4 · 2025-12-01T12:59:05Z

/ok to test 31e7def

…ction header for consistency.

abhinavg4 · 2025-12-01T18:04:39Z

/ok to test f86c51e

@akoumpa

* Initial README commit * Update README and add performance summary documentation - Corrected the link in the README for the performance summary to point to the correct file. - Introduced a new `performance-summary.md` document detailing performance benchmarks for large language models using DFM, including nomenclature, performance metrics, and system configurations. * add DiT megatron links. Signed-off-by: sajadn <snorouzi@nvidia.com> * Performance Docs update Signed-off-by: Parth Mannan <pmannan@nvidia.com> * Performance Docs update fix Signed-off-by: Parth Mannan <pmannan@nvidia.com> * Update README to enhance clarity and accuracy - Removed redundant description of the framework. - Clarified the relationship between Megatron Bridge and Megatron Core in the Dual-Path Architecture section. * Enhance README with detailed performance optimizations and parallelism descriptions - Updated the Megatron Bridge Path section to include 6D parallelism details. - Added state-of-the-art performance optimizations to the Dual Training Paths section. - Clarified parallelism terminology in the comparison table for better understanding. * Update perf doc Signed-off-by: Parth Mannan <pmannan@nvidia.com> * update Signed-off-by: linnan wang <wangnan318@gmail.com> * Update README with fine-tuning command Removed TODO comment and added a command for fine-tuning a video diffusion model. * Apply suggestion from @akoumpa * Apply suggestion from @akoumpa * Apply suggestion from @akoumpa * Update README, Wan-related. Updated command syntax and improved clarity in README. * Apply suggestion from @akoumpa * Fixing typo @akoumpa * fix automodel section Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * update DFM-specific readme Signed-off-by: Pablo Garay <pagaray@nvidia.com> * Update performance-summary.md Thanks a lot @linnanwang for the bench numbers. * Update performance-summary.md * Update performance-summary.md * Update README.md Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com> * Update README.md Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com> * Update README.md Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com> * Update README.md Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com> * Refactor README.md and performance-summary.md for clarity and conciseness - Simplified descriptions of Megatron Bridge and AutoModel paths in README.md. - Removed outdated comparison table to streamline content. - Updated performance-summary.md to generalize model references and improve clarity. Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com> * Fix typo in README.md: changed "Built" to "Build" in the container section header for consistency. --------- Signed-off-by: sajadn <snorouzi@nvidia.com> Signed-off-by: Parth Mannan <pmannan@nvidia.com> Signed-off-by: linnan wang <wangnan318@gmail.com> Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: Pablo Garay <pagaray@nvidia.com> Co-authored-by: sajadn <snorouzi@nvidia.com> Co-authored-by: Parth Mannan <pmannan@nvidia.com> Co-authored-by: linnan wang <wangnan318@gmail.com> Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com> Co-authored-by: Huy Vu <86480512+huvunvidia@users.noreply.github.com> Co-authored-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Co-authored-by: Pablo Garay <pagaray@nvidia.com> Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com> Signed-off-by: Lawrence Lane <llane@nvidia.com>

@akoumpa

* Initial README commit * Update README and add performance summary documentation - Corrected the link in the README for the performance summary to point to the correct file. - Introduced a new `performance-summary.md` document detailing performance benchmarks for large language models using DFM, including nomenclature, performance metrics, and system configurations. * add DiT megatron links. Signed-off-by: sajadn <snorouzi@nvidia.com> * Performance Docs update Signed-off-by: Parth Mannan <pmannan@nvidia.com> * Performance Docs update fix Signed-off-by: Parth Mannan <pmannan@nvidia.com> * Update README to enhance clarity and accuracy - Removed redundant description of the framework. - Clarified the relationship between Megatron Bridge and Megatron Core in the Dual-Path Architecture section. * Enhance README with detailed performance optimizations and parallelism descriptions - Updated the Megatron Bridge Path section to include 6D parallelism details. - Added state-of-the-art performance optimizations to the Dual Training Paths section. - Clarified parallelism terminology in the comparison table for better understanding. * Update perf doc Signed-off-by: Parth Mannan <pmannan@nvidia.com> * update Signed-off-by: linnan wang <wangnan318@gmail.com> * Update README with fine-tuning command Removed TODO comment and added a command for fine-tuning a video diffusion model. * Apply suggestion from @akoumpa * Apply suggestion from @akoumpa * Apply suggestion from @akoumpa * Update README, Wan-related. Updated command syntax and improved clarity in README. * Apply suggestion from @akoumpa * Fixing typo @akoumpa * fix automodel section Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * update DFM-specific readme Signed-off-by: Pablo Garay <pagaray@nvidia.com> * Update performance-summary.md Thanks a lot @linnanwang for the bench numbers. * Update performance-summary.md * Update performance-summary.md * Update README.md Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com> * Update README.md Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com> * Update README.md Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com> * Update README.md Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com> * Refactor README.md and performance-summary.md for clarity and conciseness - Simplified descriptions of Megatron Bridge and AutoModel paths in README.md. - Removed outdated comparison table to streamline content. - Updated performance-summary.md to generalize model references and improve clarity. Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com> * Fix typo in README.md: changed "Built" to "Build" in the container section header for consistency. --------- Signed-off-by: sajadn <snorouzi@nvidia.com> Signed-off-by: Parth Mannan <pmannan@nvidia.com> Signed-off-by: linnan wang <wangnan318@gmail.com> Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: Pablo Garay <pagaray@nvidia.com> Co-authored-by: sajadn <snorouzi@nvidia.com> Co-authored-by: Parth Mannan <pmannan@nvidia.com> Co-authored-by: linnan wang <wangnan318@gmail.com> Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com> Co-authored-by: Huy Vu <86480512+huvunvidia@users.noreply.github.com> Co-authored-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Co-authored-by: Pablo Garay <pagaray@nvidia.com> Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

Initial README commit

3266077

copy-pr-bot bot temporarily deployed to test November 16, 2025 18:35 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 16, 2025 18:35 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 16, 2025 18:37 Inactive

abhinavg4 commented Nov 16, 2025

View reviewed changes

copy-pr-bot bot temporarily deployed to nemo-ci November 16, 2025 18:53 Inactive

copy-pr-bot bot temporarily deployed to test November 16, 2025 19:35 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 16, 2025 19:36 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 16, 2025 19:38 Inactive

abhinavg4 commented Nov 16, 2025

View reviewed changes

docs/performance-summary.md Show resolved Hide resolved

copy-pr-bot bot temporarily deployed to nemo-ci November 16, 2025 19:53 Inactive

euronymous-aithal reviewed Nov 16, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

pablo-garay previously approved these changes Nov 17, 2025

View reviewed changes

add DiT megatron links.

79f9d26

Signed-off-by: sajadn <snorouzi@nvidia.com>

sajadn dismissed pablo-garay’s stale review via 79f9d26 November 18, 2025 22:13

abhinavg4 commented Nov 19, 2025

View reviewed changes

README.md Show resolved Hide resolved

bernardwin reviewed Nov 19, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

parthmannan and others added 3 commits November 19, 2025 11:26

Performance Docs update

b96cf8f

Signed-off-by: Parth Mannan <pmannan@nvidia.com>

Performance Docs update fix

2b00158

Signed-off-by: Parth Mannan <pmannan@nvidia.com>

Update README to enhance clarity and accuracy

8e471a0

- Removed redundant description of the framework. - Clarified the relationship between Megatron Bridge and Megatron Core in the Dual-Path Architecture section.

abhinavg4 commented Nov 20, 2025

View reviewed changes

abhinavg4 and others added 4 commits November 20, 2025 18:56

Update perf doc

2233811

Signed-off-by: Parth Mannan <pmannan@nvidia.com>

Merge branch 'readme_init' of github.com:NVIDIA-NeMo/DFM into readme_…

60fae1d

…init Signed-off-by: Parth Mannan <pmannan@nvidia.com>

update

88ddbf1

Signed-off-by: linnan wang <wangnan318@gmail.com>

abhinavg4 commented Nov 21, 2025

View reviewed changes

docs/performance-summary.md Outdated Show resolved Hide resolved

abhinavg4 and others added 6 commits December 1, 2025 04:31

Update README.md

796103e

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

Update README.md

9ea6116

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

Update README.md

ebf00bf

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

Update README.md

7083f86

Co-authored-by: Wenwen Gao <94138584+snowmanwwg@users.noreply.github.com>

Merge branch 'main' into readme_init

31e7def

copy-pr-bot bot temporarily deployed to test December 1, 2025 12:59 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci December 1, 2025 12:59 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci December 1, 2025 13:39 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci December 1, 2025 14:38 Inactive

Fix typo in README.md: changed "Built" to "Build" in the container se…

f86c51e

…ction header for consistency.

abhinavg4 enabled auto-merge (squash) December 1, 2025 18:04

copy-pr-bot bot temporarily deployed to test December 1, 2025 18:04 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci December 1, 2025 18:05 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci December 1, 2025 19:14 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci December 1, 2025 21:07 Inactive

ntajbakhsh self-requested a review December 2, 2025 01:11

ntajbakhsh approved these changes Dec 2, 2025

View reviewed changes

Merge branch 'main' into readme_init

8640f3f

pablo-garay disabled auto-merge December 3, 2025 02:29

pablo-garay merged commit b867706 into main Dec 3, 2025
6 checks passed

chtruong814 deleted the readme_init branch January 29, 2026 20:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial README commit#53

Initial README commit#53
pablo-garay merged 31 commits intomainfrom
readme_init

abhinavg4 commented Nov 16, 2025

Uh oh!

abhinavg4 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

copy-pr-bot bot commented Nov 18, 2025

Uh oh!

Uh oh!

Uh oh!

abhinavg4 Nov 20, 2025

Uh oh!

Uh oh!

abhinavg4 commented Dec 1, 2025

Uh oh!

abhinavg4 commented Dec 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants

Comments

Conversation

abhinavg4 commented Nov 16, 2025

Uh oh!

abhinavg4 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

copy-pr-bot bot commented Nov 18, 2025

Uh oh!

Uh oh!

Uh oh!

abhinavg4 Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

abhinavg4 commented Dec 1, 2025

Uh oh!

abhinavg4 commented Dec 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants

Comments