Erm Hyper Init by MatteoWohlrapp · Pull Request #843 · marrlab/DomainLab

MatteoWohlrapp · 2024-05-28T09:01:52Z

Added functionality to use ERM with the hyperparam scheduling. Alternatively to adding the hyper init and hyper update method to ERM, we could also add them to the a_model superclass, or check if the method exists before invoking in the scheduler.

…opt, fishr and erm

…rain epoch. Reverted prior backpack changes

codecov-commenter · 2024-06-11T12:09:28Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 90.90%. Comparing base (d2ac388) to head (976c25a).

Additional details and impacted files

@@                Coverage Diff                 @@
##           mhof_dev_merge     #843      +/-   ##
==================================================
+ Coverage           90.77%   90.90%   +0.12%     
==================================================
  Files                 137      137              
  Lines                5853     5858       +5     
==================================================
+ Hits                 5313     5325      +12     
+ Misses                540      533       -7

Flag	Coverage Δ
unittests	`90.90% <100.00%> (+0.12%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…into erm_hyper_init

smilesun

I think we need a comment to "flag_info" so that the code reader/ reviewer know what this variable does in general

domainlab/algos/trainers/fbopt_mu_controller.py

domainlab/algos/trainers/train_fishr.py

smilesun · 2024-07-02T08:43:47Z

examples/benchmark/pacs_fbopt_fishr_erm.yaml

has this yaml file been tested?

Yes, was a separate issue 831 which is also linked in this PR

I tested it, and resulted some error. Will print downstairs.

looks nothing big: zdata does not have pacs yet.

domainlab/zdata/pacs/PACS/art_painting

Now i meet:

OutOfMemoryError in file /ictstr01/home/aih/xudong.sun/domainlab_master/domainlab/exp_protocol/benchmark.smk, line 154: zoutput/slurm_logs/run_experiment/run_experiment-index=14-21649209.err-251-CUDA out of memory. Tried to allocate 1.98 GiB. GPU 0 has a total capacty of 19.50 GiB of which 221.88 MiB is free. Including non-PyTorch memory, this process has 19.24 GiB memory in use. Process 1322808 has 19.24 GiB memory in use. Of the allocated memory 18.93 GiB is allocated by PyTorch, and 71.99 MiB is rese

Is it because some GPU has larger mem so your run has gone through? @MatteoWohlrapp

Does it say which of the two experiements in the yaml? We could try a different dataset to see if it works then. I do remember that it ran on the cluster.

MatteoWohlrapp · 2024-07-02T09:40:36Z

You introduced 'flag_info' in your mhof_dev branch. Can you give a brief explanation, I dont think I fully understand the naming. I added it because otherwise training was not possible. It is set to self.flag_setpoint_updated in train_fbopt_b.py.

MatteoWohlrapp and others added 12 commits May 13, 2024 17:18

added hyper init and hyper update function and a new benchmark for fb…

e4d87d9

…opt, fishr and erm

Fixed import of backpack

5c44550

Fixed benchmark import not successfull in erm

416389f

fixed indentation

d59d37a

Added backpack check for fishr

a4fa31d

Debugging backpack in erm

a38c777

Adding more logging to erm

870bc57

added list_str_multiplier

f7795a9

Added directories to gitignore, adjusted fobt_fishr_erm benchmark

6c5ca27

Solved indexing issue in fbopt mu controller and added flag info to t…

38fb5f2

…rain epoch. Reverted prior backpack changes

removed prints

caf32b7

Fixed indentation for convert4backpack

a098a68

This was linked to issues May 28, 2024

Error in backpack when running a benchmark with fbopt, fishr, and erm. #836

Closed

write an yaml file to evaluate trainer=fbopt_fishr model=erm for benchmark, so other could run the yaml file #831

Closed

Matteo Wohlrapp and others added 4 commits May 28, 2024 11:39

fixed codacity

518d919

fixed codacity

e947f67

Merge branch 'mhof_dev_merge' into erm_hyper_init

e6a6f3e

Merge branch 'mhof_dev_merge' into erm_hyper_init

5288432

Matteo Wohlrapp added 5 commits June 11, 2024 15:38

fixed codacity

4f1347a

Merge branch 'erm_hyper_init' of https://github.com/marrlab/DomainLab …

6a81b4c

…into erm_hyper_init

fixed codacity

f2fa952

Added test for erm functions

279f510

Disabling line too long for argument

f8f4f0e

smilesun requested changes Jun 27, 2024

View reviewed changes

domainlab/algos/trainers/fbopt_mu_controller.py Show resolved Hide resolved

smilesun reviewed Jul 2, 2024

View reviewed changes

marrlab deleted a comment from smilesun Jul 2, 2024

Merge branch 'mhof_dev_merge' into erm_hyper_init

c7caeb5

Update task_pacs_aug.py, update path of PACS

d1ccf46

smilesun added 2 commits July 2, 2024 12:39

Update pacs_fbopt_fishr_erm.yaml

6acdfdd

Update pacs_fbopt_fishr_erm.yaml

d60a789

smilesun self-assigned this Jul 2, 2024

smilesun marked this pull request as ready for review July 2, 2024 10:52

Update task_pacs_aug.py, fix codacy

976c25a

smilesun merged commit a99c9f5 into mhof_dev_merge Jul 2, 2024

smilesun deleted the erm_hyper_init branch July 2, 2024 13:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Erm Hyper Init#843

Erm Hyper Init#843
smilesun merged 26 commits intomhof_dev_mergefrom
erm_hyper_init

MatteoWohlrapp commented May 28, 2024 •

edited

Loading

Uh oh!

codecov-commenter commented Jun 11, 2024 •

edited

Loading

Uh oh!

smilesun left a comment

Uh oh!

Uh oh!

Uh oh!

smilesun Jul 2, 2024

Uh oh!

MatteoWohlrapp Jul 2, 2024 •

edited

Loading

Uh oh!

smilesun Jul 2, 2024

Uh oh!

smilesun Jul 2, 2024

Uh oh!

smilesun Jul 2, 2024

Uh oh!

smilesun Jul 2, 2024

Uh oh!

MatteoWohlrapp Jul 2, 2024

Uh oh!

MatteoWohlrapp commented Jul 2, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

MatteoWohlrapp commented May 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Jun 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

smilesun left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

smilesun Jul 2, 2024

Choose a reason for hiding this comment

Uh oh!

MatteoWohlrapp Jul 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

smilesun Jul 2, 2024

Choose a reason for hiding this comment

Uh oh!

smilesun Jul 2, 2024

Choose a reason for hiding this comment

Uh oh!

smilesun Jul 2, 2024

Choose a reason for hiding this comment

Uh oh!

smilesun Jul 2, 2024

Choose a reason for hiding this comment

Uh oh!

MatteoWohlrapp Jul 2, 2024

Choose a reason for hiding this comment

Uh oh!

MatteoWohlrapp commented Jul 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

MatteoWohlrapp commented May 28, 2024 •

edited

Loading

codecov-commenter commented Jun 11, 2024 •

edited

Loading

MatteoWohlrapp Jul 2, 2024 •

edited

Loading

MatteoWohlrapp commented Jul 2, 2024 •

edited

Loading