[CI] Slow Test Updates #8870

DN6 · 2024-07-16T08:24:48Z

What does this PR do?

We're experiencing some issues reading/writing to the mounted cache. In this PR we

Remove the use of the mounted cache in favour of using HF Transfer and downloading the models to the default cache inside the container for every job. This won't provide too much of a slow down on tests as we tend to use just a few models across multiple slow tests. e.g. Runway's SD 1.5 is used in almost all SD slow tests. So only a few downloads will happen per job. Additionally, reading/writing from the default cache inside the container is much faster that using the mounted cache. So we should see some speed ups in load times for pipelines.
Move all our slow tests with checkpoints to the nightly tests. We usually only consider the latest slow tests when identifying errors. Therefore we don't necessarily need to run checkpoint tests on every merge. It's also a bit more practical/actionable since we will get only a single set of notifications per day related to test failures.
Only run Fast/Fast GPU tests on merge. This will speed up the merge tests quite significantly.
Move log_reports.py script into the utils folder so it lives with our other CI utils.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

DN6 · 2024-07-16T08:25:51Z

.github/workflows/nightly_tests.yml

      matrix:
-        module: [models, schedulers, others, examples]
+        max-parallel: 2
+        module: [models, schedulers, lora, others, single_file, examples]


Add Lora tests here.

DN6 · 2024-07-16T08:26:43Z

.github/workflows/nightly_tests.yml

        pip install slack_sdk tabulate
-        python scripts/log_reports.py >> $GITHUB_STEP_SUMMARY
-
-  run_lora_nightly_tests:


We run LoRA tests in the Nightly Torch CUDA Tests job since PEFT is a needed dependency for LoRA loading. We don't need a job dedicated PEFT job anymore. LoRA Tests == PEFT Tests basically.

DN6 · 2024-07-16T08:27:31Z

.github/workflows/push_tests.yml

        name: torch_cuda_test_reports
        path: reports

-  peft_cuda_tests:


Not needed as LoRA Tests require PEFT. We can just run the LoRA tests.

The LoRA tests are basically PEFT tests, no?

sayakpaul

Left a couple of suggestions. I am not sure if removing LoRA related tests from push_tests.yml is a good idea though.

sayakpaul · 2024-07-16T10:34:20Z

.github/workflows/nightly_tests.yml

          python -m uv pip install accelerate@git+https://github.com/huggingface/accelerate.git
          python -m uv pip install pytest-reportlog
-
+          python -m uv pip install hf_transfer


Let's have this installed in our Dockerfile installed.

LoRA tests still run here

diffusers/.github/workflows/push_tests.yml

Line 113 in 96b0e1d

module: [models, schedulers, lora, others, single_file]

I added installing PEFT from source as well.

sayakpaul · 2024-07-16T10:39:31Z

.github/workflows/push_tests.yml

        name: torch_cuda_test_reports
        path: reports

-  peft_cuda_tests:


The LoRA tests are basically PEFT tests, no?

DN6 · 2024-07-17T05:55:52Z

Not entirely sure what's happening with the tests here. They pass locally.

* update * update * update

update

96b0e1d

DN6 commented Jul 16, 2024

View reviewed changes

DN6 requested a review from sayakpaul July 16, 2024 08:28

sayakpaul reviewed Jul 16, 2024

View reviewed changes

DN6 added 3 commits July 16, 2024 12:12

update

f2519b0

update

f65d4c9

Merge branch 'main' into ci-updates

8587608

DN6 added 3 commits July 19, 2024 15:14

Merge branch 'main' into ci-updates

2fed921

Merge branch 'main' into ci-updates

24f8b96

Merge branch 'main' into ci-updates

7d3d999

DN6 merged commit 5fbb4d3 into main Jul 25, 2024

sayakpaul pushed a commit that referenced this pull request Dec 23, 2024

[CI] Slow Test Updates (#8870)

e42c333

* update * update * update

DN6 mentioned this pull request Jul 8, 2025

[ci] enable hotswapping tests on our nightly CI. #11826

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CI] Slow Test Updates #8870

[CI] Slow Test Updates #8870

Uh oh!

DN6 commented Jul 16, 2024 •

edited

Loading

Uh oh!

DN6 Jul 16, 2024

Uh oh!

DN6 Jul 16, 2024 •

edited

Loading

Uh oh!

DN6 Jul 16, 2024

Uh oh!

sayakpaul Jul 16, 2024

Uh oh!

DN6 Jul 16, 2024

Uh oh!

sayakpaul left a comment

Uh oh!

sayakpaul Jul 16, 2024

Uh oh!

DN6 Jul 16, 2024

Uh oh!

DN6 Jul 16, 2024

Uh oh!

sayakpaul Jul 16, 2024

Uh oh!

DN6 commented Jul 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[CI] Slow Test Updates #8870

[CI] Slow Test Updates #8870

Uh oh!

Conversation

DN6 commented Jul 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DN6 Jul 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DN6 commented Jul 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

DN6 commented Jul 16, 2024 •

edited

Loading

DN6 Jul 16, 2024 •

edited

Loading