[quant] QoL improvements for pipeline-level quant config #11876

sayakpaul · 2025-07-07T06:34:58Z

What does this PR do?

Shifts PipelineQuantizationConfig to its module to keep the src/diffusers/quantizers/__init__.py clean.
Adds a __repr__ method to PipelineQuantizationConfig so that users can investigate it in between their workflows.
Sets the quantization_config attribute of a pipeline if it's not None.

I tested with the following script and got the quantization config printed nicely as below:

text_encoder_2 BitsAndBytesConfig {
  "_load_in_4bit": true,
  "_load_in_8bit": false,
  "bnb_4bit_compute_dtype": "bfloat16",
  "bnb_4bit_quant_storage": "uint8",
  "bnb_4bit_quant_type": "nf4",
  "bnb_4bit_use_double_quant": false,
  "llm_int8_enable_fp32_cpu_offload": false,
  "llm_int8_has_fp16_weight": false,
  "llm_int8_skip_modules": null,
  "llm_int8_threshold": 6.0,
  "load_in_4bit": true,
  "load_in_8bit": false,
  "quant_method": "bitsandbytes"
}
transformer BitsAndBytesConfig {
  "_load_in_4bit": true,
  "_load_in_8bit": false,
  "bnb_4bit_compute_dtype": "bfloat16",
  "bnb_4bit_quant_storage": "uint8",
  "bnb_4bit_quant_type": "nf4",
  "bnb_4bit_use_double_quant": false,
  "llm_int8_enable_fp32_cpu_offload": false,
  "llm_int8_has_fp16_weight": false,
  "llm_int8_skip_modules": null,
  "llm_int8_threshold": 6.0,
  "load_in_4bit": true,
  "load_in_8bit": false,
  "quant_method": "bitsandbytes"
}

Code

from diffusers import PipelineQuantizationConfig, DiffusionPipeline
import torch

components_to_quantize = ["transformer", "text_encoder_2"]
quant_config = PipelineQuantizationConfig(
    quant_backend="bitsandbytes_4bit",
    quant_kwargs={
        "load_in_4bit": True,
        "bnb_4bit_quant_type": "nf4",
        "bnb_4bit_compute_dtype": torch.bfloat16,
    },
    components_to_quantize=components_to_quantize,
)
pipe = DiffusionPipeline.from_pretrained(
    "hf-internal-testing/tiny-flux-pipe",
    quantization_config=quant_config,
    torch_dtype=torch.bfloat16,
).to("cuda")
print(pipe.quantization_config)

HuggingFaceDocBuilderDev · 2025-07-07T06:44:29Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul · 2025-07-07T08:54:30Z

src/diffusers/__init__.py

    "loaders": ["FromOriginalModelMixin"],
    "models": [],
    "pipelines": [],
+    "quantizers.pipe_quant_config": ["PipelineQuantizationConfig"],


So that we can do from diffusers import PipelineQuantizationConfig.

SunMarc

Thanks for this !

yiyixuxu

thanks!

sayakpaul · 2025-07-10T03:22:36Z

Failing tests are unrelated.

sayakpaul added 2 commits July 7, 2025 11:10

add repr for pipelinequantconfig.

f2f5c68

update

d954644

Merge branch 'main' into pipe-quant-config-repr

e6b5433

sayakpaul requested review from SunMarc and yiyixuxu July 7, 2025 08:53

sayakpaul commented Jul 7, 2025

View reviewed changes

SunMarc approved these changes Jul 7, 2025

View reviewed changes

sayakpaul added 2 commits July 7, 2025 22:33

Merge branch 'main' into pipe-quant-config-repr

4e62432

Merge branch 'main' into pipe-quant-config-repr

755ae77

yiyixuxu approved these changes Jul 9, 2025

View reviewed changes

Merge branch 'main' into pipe-quant-config-repr

ce54f4d

sayakpaul merged commit b41abb2 into main Jul 10, 2025
30 of 32 checks passed

sayakpaul deleted the pipe-quant-config-repr branch July 10, 2025 03:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[quant] QoL improvements for pipeline-level quant config #11876

[quant] QoL improvements for pipeline-level quant config #11876

Uh oh!

sayakpaul commented Jul 7, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 7, 2025

Uh oh!

sayakpaul Jul 7, 2025

Uh oh!

SunMarc left a comment

Uh oh!

yiyixuxu left a comment

Uh oh!

sayakpaul commented Jul 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[quant] QoL improvements for pipeline-level quant config #11876

[quant] QoL improvements for pipeline-level quant config #11876

Uh oh!

Conversation

sayakpaul commented Jul 7, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Jul 7, 2025

Uh oh!

sayakpaul Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul commented Jul 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants