[BUG]: Training process blocks on `gc.collect`

### 🐛 Describe the bug

My training process was blocked when saving an optimizer state by `Booster.save_optimizer`. The program was blocked here:

https://github.com/hpcaitech/ColossalAI/blob/089c365fa0690485acb8e8335392095ca426633d/colossalai/zero/gemini/gemini_optimizer.py#L474

Although I've no idea why the program was blocked at `gc.collect`, I believe that using `gc.collect` here is to free the storage as fast as possible. So, why don't we use the `free_storage` defined here: https://github.com/hpcaitech/ColossalAI/blob/089c365fa0690485acb8e8335392095ca426633d/colossalai/zero/gemini/chunk/chunk.py#L43)

Replacing `gc.collect` with `free_storage` solves my problem perfectly.

If replacing `gc.collect` with `free_storage` is reasonable, I'd be happy to create a PR to fix it.


### Environment

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]: Training process blocks on `gc.collect` #4393

🐛 Describe the bug

Environment

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG]: Training process blocks on gc.collect #4393

Description

🐛 Describe the bug

Environment

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[BUG]: Training process blocks on `gc.collect` #4393