Skip to content

[QEff. Finetuning]: Fixed Data Parallel issue#845

Merged
quic-swatia merged 3 commits intoquic:ft_experimentalfrom
quic-swatia:pp_config
Mar 11, 2026
Merged

[QEff. Finetuning]: Fixed Data Parallel issue#845
quic-swatia merged 3 commits intoquic:ft_experimentalfrom
quic-swatia:pp_config

Conversation

@quic-swatia
Copy link
Copy Markdown
Contributor

@quic-swatia quic-swatia commented Mar 10, 2026

  1. Fixed Data Parallel issue.
  2. Removed sample config for PP as the changes of config manager are now merged.
  3. Updated the PP documentation accordingly.

Signed-off-by: Swati Allabadi <sallabad@qti.qualcomm.com>
Signed-off-by: Swati Allabadi <sallabad@qti.qualcomm.com>
@quic-swatia quic-swatia changed the title [QEff. Finetuning]: Update PP config/doc [QEff. Finetuning]: Fixed Data Parallel issue Mar 11, 2026
Signed-off-by: Swati Allabadi <sallabad@qti.qualcomm.com>
python -m QEfficient.cloud.finetune_experimental configs/sft_single_device_gsm8k_config.yaml
```
where `sample_pp_config.yaml` contains `pp_degree: 2` under `training:`.
where `sft_single_device_gsm8k_config.yaml` contains `pp_degree: 2` under `training:`.
Copy link
Copy Markdown
Contributor

@quic-akuruvil quic-akuruvil Mar 11, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For pp_degree 2 , 2 devices are required right? But the config file name is ambiguous here. Can we not create a different file? And keep this file for single_device run?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As discussed offline, since we only need one extra param ( i.e. pp_degree ) to be set >1, it will be better to reuse the existing config file.

@quic-swatia quic-swatia merged commit 828cdc3 into quic:ft_experimental Mar 11, 2026
3 checks passed
smedhe pushed a commit to smedhe/QEff_Sharvari that referenced this pull request Mar 17, 2026
1) Fixed Data Parallel issue.
2) Removed sample config for PP as the changes of config manager are now
merged.
3) Updated the PP documentation accordingly.

---------

Signed-off-by: Swati Allabadi <sallabad@qti.qualcomm.com>
Co-authored-by: Swati Allabadi <sallabad@qti.qualcomm.com>
smedhe pushed a commit to smedhe/QEff_Sharvari that referenced this pull request Mar 23, 2026
1) Fixed Data Parallel issue.
2) Removed sample config for PP as the changes of config manager are now
merged.
3) Updated the PP documentation accordingly.

---------

Signed-off-by: Swati Allabadi <sallabad@qti.qualcomm.com>
Co-authored-by: Swati Allabadi <sallabad@qti.qualcomm.com>
smedhe pushed a commit to smedhe/QEff_Sharvari that referenced this pull request Mar 23, 2026
1) Fixed Data Parallel issue.
2) Removed sample config for PP as the changes of config manager are now
merged.
3) Updated the PP documentation accordingly.

---------

Signed-off-by: Swati Allabadi <sallabad@qti.qualcomm.com>
Co-authored-by: Swati Allabadi <sallabad@qti.qualcomm.com>
smedhe pushed a commit to smedhe/QEff_Sharvari that referenced this pull request Mar 24, 2026
1) Fixed Data Parallel issue.
2) Removed sample config for PP as the changes of config manager are now
merged.
3) Updated the PP documentation accordingly.

---------

Signed-off-by: Swati Allabadi <sallabad@qti.qualcomm.com>
Co-authored-by: Swati Allabadi <sallabad@qti.qualcomm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants