Refactor megatron to mcore_bridge by tastelikefeet · Pull Request #134 · modelscope/twinkle

tastelikefeet · 2026-03-30T08:44:26Z

PR type

Bug Fix
New Feature
Document Updates
More Models or Datasets Support

PR information

Write the detail information belongs to this PR.

Experiment results

Paste your experiment result here(if needed).

gemini-code-assist

Code Review

This pull request refactors the Megatron-Core integration by offloading model configuration, creation, and weight loading logic to the mcore_bridge dependency, which allows for the removal of significant internal boilerplate code. However, the review highlights several critical issues introduced during the refactoring: the removal of the _BASE_LAYER_SUFFIXES constant and the self.hf_config attribute will lead to runtime errors since they are still referenced in the codebase. Furthermore, the send_weights method contains a NameError due to the use of an undefined args variable, which should be replaced with values from the strategy configuration.

tastelikefeet · 2026-03-31T10:25:13Z

/gemini review

gemini-code-assist

Code Review

This pull request refactors the Megatron-Core integration by delegating model configuration and creation to a new MegatronStrategy class and the mcore_bridge library, while removing the internal args.py and the TorchSampler module. The review identifies a critical AttributeError in multi_lora.py where get_target_modules is called on the wrong class. Additionally, the review highlights that the refactored _add_base_layer_suffix is now too generic and may incorrectly modify non-LoRA layer names, and that MultiLoraMegatron initializes parameter configurations without an optimizer, which could prevent the setup of training-specific features like gradient reduction overlap.

…idge

… into feat/mbridge

tastelikefeet · 2026-04-03T06:15:40Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a GRPO training script for the OlympiadBench dataset, updates the lazy dataset implementation, and includes several improvements to the checkpointing and reward computation logic. My feedback highlights an unsafe mutation of _model_keys during iteration, a likely bug in the packing detection logic, inefficient instantiation of reward classes, and excessive logging that could impact performance.

… into feat/mbridge

wip

8fb9991

gemini-code-assist Bot reviewed Mar 30, 2026

View reviewed changes

Comment thread src/twinkle/model/megatron/megatron.py

Comment thread src/twinkle/model/megatron/megatron.py

Comment thread src/twinkle/model/megatron/megatron.py Outdated

tastelikefeet added 12 commits March 30, 2026 17:15

wip

35efd81

wip

01e7535

wip

399960a

fix

f2bd846

wip

096c193

wip

53c19a7

wip

d951d47

wip

ad82a77

wip

48cbf13

wip

43ef29e

fix

c3c7620

lint code

dec91c9

gemini-code-assist Bot reviewed Mar 31, 2026

View reviewed changes

Comment thread src/twinkle/model/multi_lora.py Outdated

Comment thread src/twinkle/model/megatron/megatron.py

Comment thread src/twinkle/model/megatron/multi_lora_megatron.py

tastelikefeet added 14 commits March 31, 2026 21:56

fix

0a1c34c

Merge commit 'a89ede55e3daa4fc36f0319c77847e0bf257fcce' into feat/mbr…

13ec7f5

…idge

fix

ab0b161

fix

d50465f

wip

08d2daf

wip

fa6b463

wip

db71d2e

Merge commit 'a222914cae55cca628bf5154bf88ae037cebe7f7' into feat/mbr…

a218b9d

…idge

wip

6c23ca1

Merge branch 'feat/mbridge' of https://github.com/tastelikefeet/twinkle…

3862735

… into feat/mbridge

wip

ec2ac0b

wip

dc1ee9c

wip

f44fe4c

wip

4555bfc

tastelikefeet and others added 10 commits April 2, 2026 17:38

wip

e9cf0da

fix

30e1907

wip

1e56c29

Merge branch 'feat/mbridge' of https://github.com/tastelikefeet/twinkle…

2a8895c

… into feat/mbridge

wip

682afab

wip

9e1f823

lint code

d823f7e

Merge branch 'main' into feat/mbridge

7c9d854

wip

9eb15b8

Merge branch 'feat/mbridge' of https://github.com/tastelikefeet/twinkle…

5f2d047

… into feat/mbridge

gemini-code-assist Bot reviewed Apr 3, 2026

View reviewed changes

tastelikefeet added 6 commits April 3, 2026 14:42

fix

ec64959

wip

afa2bdf

wip

8af8b82

wip

9a819e7

wip

29cb370

fix

153afd8

hjh0119 approved these changes Apr 3, 2026

View reviewed changes

tastelikefeet added 4 commits April 3, 2026 23:34

wip

347bd6b

Merge branch 'feat/mbridge' of https://github.com/tastelikefeet/twinkle…

ec6cf8c

… into feat/mbridge

fix

6488169

fix

5752d45

tastelikefeet merged commit 4db3c40 into modelscope:main Apr 3, 2026
1 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor megatron to mcore_bridge#134

Refactor megatron to mcore_bridge#134
tastelikefeet merged 47 commits intomodelscope:mainfrom
tastelikefeet:feat/mbridge

tastelikefeet commented Mar 30, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tastelikefeet commented Mar 31, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tastelikefeet commented Apr 3, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tastelikefeet commented Mar 30, 2026

PR type

PR information

Experiment results

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tastelikefeet commented Mar 31, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tastelikefeet commented Apr 3, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants