feat: refine loRA diffusers to flux conversion logic #7708

simpletrontdip · 2025-02-28T08:07:35Z

Summary

This PR updates loRA Diffusers -> Flux conversion logic based on its original source:

Fix the key typos for guidance_in layer keys, as in: https://github.com/huggingface/diffusers/blob/55ac421f7bb12fd00ccbef727be4dc2f3f920abb/scripts/convert_flux_to_diffusers.py#L103-L115
Add AdaLN layer from norm_out with shift scale swapping, as in : https://github.com/huggingface/diffusers/blob/55ac421f7bb12fd00ccbef727be4dc2f3f920abb/scripts/convert_flux_to_diffusers.py#L263-L268

Related Issues / Discussions

I couldn't load Hyper-FLUX.1-dev-Nsteps-lora.safetensors from https://huggingface.co/ByteDance/Hyper-SD via InvokeUI.

QA Instructions

Install Hyper-FLUX 8 steps, type ByteDance/Hyper-SD in the search box.
Load and run it with with Flux.dev flow, (to see it fail)
Apply the changes
Check the output

Merge Plan

Apply the change only, it's a small one

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
Documentation added / updated (if applicable)
Updated What's New copy (if doing a release after this PR)

RyanJDick

Thanks for digging into this!

To get it merged, we'll need to:

Fix the shift/scale transformation.
Add a unit test in test_flux_diffusers_lora_conversion_utils.py for this new LoRA format. See the other tests in that file for reference.

RyanJDick · 2025-03-05T16:01:58Z

invokeai/backend/patches/lora_conversions/flux_diffusers_lora_conversion_utils.py

+            for _key in values.keys():
+                # in SD3 original implementation of AdaLayerNormContinuous, it split linear projection output into shift, scale;
+                # while in diffusers it split into scale, shift. Here we swap the linear projection weights in order to be able to use diffusers implementation
+                scale, shift = values[_key].chunk(2, dim=0)
+                values[_key] = torch.cat([shift, scale], dim=0)


This doesn't look right to me. If I'm understanding correctly, in the case of a vanilla LoRA layer, we should only be flipping one of the LoRA components.

The required transformation would be a bit more involved for other LoRA variants (LoHA, LoKR, etc.), so I'm fine with only supporting vanilla LoRAs. But, we should assert that the result of any_lora_layer_from_state_dict() is a LoRALayer.

Hi @RyanJDick thanks for spending time 👯
I have to confess, it is more complex than I expected, sorry for not asking the team before hand.

As my understanding

# for normal LoRA layer delta_W = up @ down W = W + delta_W # for AdaLN in diffusers W_prime = swap_shift_scale(W) delta_W_prime = swap_shift_scale(delta_W) # => We may need to add a custom LoRA layer to swap them in `get_weight` class AdaLN_LoRALayer(LoRALayer): def get_weight(self, orig_weight: torch.Tensor) -> torch.Tensor: '''swap shift and scale before returning real weight''' weight = super().get_weight(orig_weight) scale, shift = weight.chunk(2, dim=0) return torch.cat([shift, scale], dim=0) # we need to build and return this layer in our function

What do you think?

simpletrontdip added 2 commits February 28, 2025 14:47

fix: correct guidance_in layer keys in lora conversion

add3cca

feat: add missing adaLN layer in lora conversion

6efad43

simpletrontdip requested review from lstein, blessedcoolant, brandonrising, RyanJDick and hipsterusername as code owners February 28, 2025 08:07

github-actions bot added python PRs that change python files backend PRs that change backend files labels Feb 28, 2025

simpletrontdip added 2 commits March 4, 2025 09:23

chore: update util function name as convention

0c0637f

Merge branch 'main' into main

3eb2c8e

RyanJDick reviewed Mar 5, 2025

View reviewed changes

simpletrontdip added 2 commits March 7, 2025 18:20

feat: add sample lora diffuser keys with norm_out.linear layer to test

c12005e

feat: add new layer type for diffusers-ada-ln

4f1b6ce

simpletrontdip requested a review from jazzhaiku as a code owner March 7, 2025 13:32

github-actions bot added the python-tests PRs that change python tests label Mar 7, 2025

simpletrontdip added 3 commits March 9, 2025 10:28

feat: add tests for DiffuserAdaLN layer logic

b087694

feat: add adaLN for custom module test

701e9dc

Merge branch 'main' into main

b5865fe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: refine loRA diffusers to flux conversion logic #7708

feat: refine loRA diffusers to flux conversion logic #7708

simpletrontdip commented Feb 28, 2025

RyanJDick left a comment

RyanJDick Mar 5, 2025

simpletrontdip Mar 7, 2025 •

edited

Loading

feat: refine loRA diffusers to flux conversion logic #7708

Are you sure you want to change the base?

feat: refine loRA diffusers to flux conversion logic #7708

Conversation

simpletrontdip commented Feb 28, 2025

Summary

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

RyanJDick left a comment

Choose a reason for hiding this comment

RyanJDick Mar 5, 2025

Choose a reason for hiding this comment

simpletrontdip Mar 7, 2025 • edited Loading

Choose a reason for hiding this comment

simpletrontdip Mar 7, 2025 •

edited

Loading