Issue merging a Lora model to a SANA transformer #2318

frutiemax92 · 2025-01-10T01:24:35Z

System Info

peft=0.14.0

Who can help?

@BenjaminBossan @sayakpaul

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder
My own task or dataset (give details below)

Reproduction

from diffusers import SanaPipeline, SanaPAGPipeline, SanaTransformer2DModel
from peft import PeftModel

transformer = SanaTransformer2DModel.from_pretrained("frutiemax/twistedreality-sana-1600m-1024px")
print(transformer)
peft_model = PeftModel.from_pretrained(transformer, '0')
model = peft_model.merge_and_unload()

Expected behavior

I've trained a Lora model with PEFT on a SANA checkpoint. I can train and inference using the PEFT model. However, when I try to merge the Lora to the base checkpoint, I encounter a shape mismatch. I've attached the Lora model with a rank 4.

0.zip

The text was updated successfully, but these errors were encountered:

sayakpaul · 2025-01-10T02:06:16Z

Please make the code fully reproducible.

frutiemax92 · 2025-01-10T02:08:26Z

Updated the code with missing imports.

BenjaminBossan · 2025-01-10T10:03:53Z

Could you please show how you initialized the model (training code not necessary) and how you saved it? Also, do you know if the frutiemax/twistedreality-sana-1600m-1024px corresponds to the official Sana model? What is different?

frutiemax92 · 2025-01-10T13:26:51Z

I've investigated this a bit more and found out only the target module 'conv_depth' makes it crash, PEFT with the other modules work fine.

from diffusers import SanaTransformer2DModel
from peft import LoraConfig, get_peft_model

transformer = SanaTransformer2DModel.from_pretrained("Efficient-Large-Model/Sana_1600M_1024px_diffusers", subfolder='transformer')
lora_config = LoraConfig(r=4, target_modules=['conv_depth'], lora_alpha=4)
model = get_peft_model(transformer, lora_config)
model = model.merge_and_unload()
model.save_pretrained('merged_model')

frutiemax92 · 2025-01-10T14:18:14Z

I also want to add that calling the forward function with lokr and loha also crash on the conv_depth module, but not with the regular lora algorithm.

BenjaminBossan · 2025-01-10T16:06:27Z

Thanks for the snippet. The LoRA adapter does not have the right shape because we're not honoring the groups argument of this Conv2d layer. Usually, it's 1, so it doesn't matter, but here it's 11200. This issue was already reported in #2153 but still awaiting a PR.

I also want to add that calling the forward function with lokr and loha also crash on the conv_depth module, but not with the regular lora algorithm.

Most likely, the same issue is the cause here. I'm not sure why forward works for LoRA but I would not rely on the result being correct.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue merging a Lora model to a SANA transformer #2318

Issue merging a Lora model to a SANA transformer #2318

frutiemax92 commented Jan 10, 2025 •

edited

Loading

sayakpaul commented Jan 10, 2025

frutiemax92 commented Jan 10, 2025

BenjaminBossan commented Jan 10, 2025

frutiemax92 commented Jan 10, 2025

frutiemax92 commented Jan 10, 2025

BenjaminBossan commented Jan 10, 2025

Issue merging a Lora model to a SANA transformer #2318

Issue merging a Lora model to a SANA transformer #2318

Comments

frutiemax92 commented Jan 10, 2025 • edited Loading

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

sayakpaul commented Jan 10, 2025

frutiemax92 commented Jan 10, 2025

BenjaminBossan commented Jan 10, 2025

frutiemax92 commented Jan 10, 2025

frutiemax92 commented Jan 10, 2025

BenjaminBossan commented Jan 10, 2025

frutiemax92 commented Jan 10, 2025 •

edited

Loading