Fix weird interaction between disk offload and group offload tests #10822

a-r-r-o-w · 2025-02-18T16:44:20Z

Not quite sure what causes the following test failures: https://github.com/huggingface/diffusers/actions/runs/13384935248/job/37379809841#step:6:10497

From some debugging, the following seems to be happening:

test_disk_offload_without_safetensors and test_disk_offload_with_safetensors runs first. This adds Accelerate hooks to handle device map correctly on the model
When we run group offloading tests, for some reason, all the new instances of the models that are created contain Accelerate hooks as well.

This makes me believe Accelerate is applying hooks at the class-level instead of the instance-level (I'm not quite sure yet & will look into accelerate code as soon as I can).

I've added a new test (just for repro purposes) that shows the above behaviour consistently happens on some models. But for some models, it works without problems 🤷‍♂️

pytest -s tests/models/transformers/test_models_transformer_cogvideox.py -k test_error_when_disk_offload_run_together_with_group_offloading

FAILED tests/models/transformers/test_models_transformer_cogvideox.py::CogVideoX1_5TransformerTests::test_error_when_disk_offload_run_together_with_group_offloading - ValueError: Cannot apply group offloading to a module that is already applying an alternative offloading strategy from Accelerate. If you want to apply group offloading, please disable the existing offloading strategy first. Offending module: time_embedding.act (<class 'torch.nn.modules.activation.SiLU'>)

cc @SunMarc @DN6

SunMarc · 2025-02-19T14:50:19Z

The above PR should fix your issue ! Feel free to merge this PR so that we don't have the same issue again.

update

207fb07

SunMarc mentioned this pull request Feb 19, 2025

store activation cls instead of function #10832

Merged

SunMarc closed this in #10832 Feb 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix weird interaction between disk offload and group offload tests #10822

Fix weird interaction between disk offload and group offload tests #10822

Uh oh!

a-r-r-o-w commented Feb 18, 2025

Uh oh!

SunMarc commented Feb 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix weird interaction between disk offload and group offload tests #10822

Fix weird interaction between disk offload and group offload tests #10822

Uh oh!

Conversation

a-r-r-o-w commented Feb 18, 2025

Uh oh!

SunMarc commented Feb 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants