Skip to content

Add Wan 2.1 convergence comparison documentation#74

Merged
pablo-garay merged 1 commit intomainfrom
move_report
Dec 3, 2025
Merged

Add Wan 2.1 convergence comparison documentation#74
pablo-garay merged 1 commit intomainfrom
move_report

Conversation

@abhinavg4
Copy link
Contributor

  • Introduced a new document detailing the comparison between Diffusers (Automodel path) and Megatron-Core (Megatron-Bridge path) for Wan 2.1.
  • Included experiment overview, dataset specifications, training setup, and results with visual training curves.
  • Added two binary images illustrating loss vs. steps for both text-to-image and text-to-video stages.

This documentation aims to provide insights into the model's performance and training dynamics during the partial convergence test.

- Introduced a new document detailing the comparison between Diffusers (Automodel path) and Megatron-Core (Megatron-Bridge path) for Wan 2.1.
- Included experiment overview, dataset specifications, training setup, and results with visual training curves.
- Added two binary images illustrating loss vs. steps for both text-to-image and text-to-video stages.

This documentation aims to provide insights into the model's performance and training dynamics during the partial convergence test.
@pablo-garay pablo-garay merged commit 7270c9f into main Dec 3, 2025
8 checks passed
lbliii pushed a commit that referenced this pull request Dec 3, 2025
- Introduced a new document detailing the comparison between Diffusers (Automodel path) and Megatron-Core (Megatron-Bridge path) for Wan 2.1.
- Included experiment overview, dataset specifications, training setup, and results with visual training curves.
- Added two binary images illustrating loss vs. steps for both text-to-image and text-to-video stages.

This documentation aims to provide insights into the model's performance and training dynamics during the partial convergence test.

Signed-off-by: Lawrence Lane <llane@nvidia.com>
@chtruong814 chtruong814 deleted the move_report branch January 29, 2026 20:26
huvunvidia pushed a commit that referenced this pull request Feb 12, 2026
- Introduced a new document detailing the comparison between Diffusers (Automodel path) and Megatron-Core (Megatron-Bridge path) for Wan 2.1.
- Included experiment overview, dataset specifications, training setup, and results with visual training curves.
- Added two binary images illustrating loss vs. steps for both text-to-image and text-to-video stages.

This documentation aims to provide insights into the model's performance and training dynamics during the partial convergence test.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants

Comments