New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

CogView4 Control Block #10809

Open

zRzRzRzRzRzRzR wants to merge 20 commits into huggingface:main from zRzRzRzRzRzRzR:cogview4_control

Contributor

zRzRzRzRzRzRzR commented Feb 17, 2025

What does this pull request do?

The purpose of this PR is to add a Control module to CogView4, which refers to the implementation of Flux.

Who can review?

zRzRzRzRzRzRzR added 2 commits

February 17, 2025 15:51

a97fca2


          change to channel 1

c30ca7a

zRzRzRzRzRzRzR changed the title ~~CogView4 Contorl Block~~ CogView4 Control Block

zRzRzRzRzRzRzR added 18 commits

February 18, 2025 14:43


          cogview4 control training

5c25cd2


          add CacheMixin

44bfd4c

a9f448e


          remove initial_input_channels change for val

2cbdf35

df83bf2


          update

8bba67a


          use 3.5

b9d864b


          new loss

5d2e994


          Merge branch 'huggingface:main' into cogview4_control

ebeb1e4

95e8504


          Merge branch 'cogview4_control' of https://github.com/zRzRzRzRzRzRzR/…

940c23b

…diffusers into cogview4_control


          use imagetoken

7a68a3e


          for megatron convert

2a81772

1d91a24


          train con and uc

dff4b29


          Merge branch 'huggingface:main' into cogview4_control

050b97c

b007be0


          remove guidance_scale

25f4e4b

yiyixuxu reviewed

View reviewed changes

Collaborator

yiyixuxu left a comment

thanks for the PR!
do we already have a checkpoint for cogview3 control lora or is this mainly to support training?

src/diffusers/models/transformers/transformer_cogview4.py

@@ @@ -35,7 +36,7 @@ class CogView4PatchEmbed(nn.Module): @@
                   def __init__(
                       self,
                       in_channels: int = 16,
-                      hidden_size: int = 2560,
+                      hidden_size: int = 4096,

Collaborator

yiyixuxu Feb 20, 2025

is this not breaking change?

src/diffusers/pipelines/cogview4/pipeline_cogview4_control.py

		"""


		def calculate_shift(

Collaborator

yiyixuxu Feb 20, 2025

we can add a #Copied from here

src/diffusers/pipelines/cogview4/pipeline_cogview4_control.py

+                      >>> import torch
+                      >>> from diffusers import CogView4Pipeline
+                      >>> pipe = CogView4Pipeline.from_pretrained("THUDM/CogView4-6B", torch_dtype=torch.bfloat16)

Collaborator

yiyixuxu Feb 20, 2025

need to update the pipeline
do we have a checkpoint?

src/diffusers/pipelines/cogview4/pipeline_cogview4_control.py

+                      self.vae_scale_factor = 2 ** (len(self.vae.config.block_out_channels) - 1) if getattr(self, "vae", None) else 8
+                      self.image_processor = VaeImageProcessor(vae_scale_factor=self.vae_scale_factor)
+                  def _get_glm_embeds(

Collaborator

yiyixuxu Feb 20, 2025

add #Copied from here

src/diffusers/pipelines/cogview4/pipeline_cogview4_control.py

+                      prompt_embeds = prompt_embeds.view(batch_size * num_images_per_prompt, seq_len, -1)
+                      return prompt_embeds
+                  def encode_prompt(

Collaborator

yiyixuxu Feb 20, 2025

same here

src/diffusers/pipelines/cogview4/pipeline_cogview4_control.py

+                          if timesteps is None
+                          else np.array(timesteps)
+                      )
+                      timesteps = timesteps.astype(np.int64)

Collaborator

yiyixuxu Feb 20, 2025

Suggested change

      
                    timesteps = timesteps.astype(np.int64)
          
                    timesteps = timesteps.astype(np.int64).astype(np.float32)

src/diffusers/pipelines/cogview4/pipeline_cogview4_control.py

+                          self.scheduler.config.get("base_shift", 0.25),
+                          self.scheduler.config.get("max_shift", 0.75),
+                      )
+                      _, num_inference_steps = retrieve_timesteps(self.scheduler, num_inference_steps, device, sigmas=sigmas, mu=mu)

Collaborator

yiyixuxu Feb 20, 2025

Suggested change

      
                    _, num_inference_steps = retrieve_timesteps(self.scheduler, num_inference_steps, device, sigmas=sigmas, mu=mu)
          
                    timesteps, num_inference_steps = retrieve_timesteps(self.scheduler, num_inference_steps, device, timesteps, sigmas=sigmas, mu=mu)

we updated our scheduler to work with cogview4 - is there any reason we still cannot use the 1scheduler.set_timesteps1 to set timesteps?

src/diffusers/pipelines/cogview4/pipeline_cogview4_control.py

+                          self.scheduler.config.get("max_shift", 0.75),
+                      )
+                      _, num_inference_steps = retrieve_timesteps(self.scheduler, num_inference_steps, device, sigmas=sigmas, mu=mu)
+                      timesteps = torch.from_numpy(timesteps).to(device)

Collaborator

yiyixuxu Feb 20, 2025

Suggested change

timesteps = torch.from_numpy(timesteps).to(device)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet