flux-dev Double RAM usage on Apple Silicon #1220

d-z-m · 2024-11-17T16:55:36Z

Problem

I'm trying to generate images w/ flux-dev fp16 safetensors, and running into unexpected memory usage issues. From what I understand, flux-dev should be able to run w/under 24GB VRAM utilization, instead I'm seeing close to 60 GB allocated(before any image generation takes place) on my 36GB M3 macbook w/unified memory(24GB or so in swap). I could have some bad assumptions about RAM requirements, though. Also, I'm running Kobold with no GGUF model for text completion, only flux.

loading tensors from /Users/username/flux/clip_l.safetensors
loading tensors from /Users/username/flux/t5xxl_fp16.safetensors
unknown tensor 'text_encoders.t5xxl.transformer.encoder.embed_tokens.weight | f16 | 2 [4096, 32128, 1, 1, 1]' in model file
loading tensors from /Users/username/flux/ae.safetensors
loading tensors from /Users/username/flux/flux1-dev.safetensors
total params memory size = 54879.10MB (VRAM 45560.27MB, RAM 9318.83MB): clip 9318.83MB(RAM), unet 45400.27MB(VRAM), vae 160.00MB(VRAM), controlnet 0.00MB(VRAM), pmid 0.00MB(RAM)
loading model from '/Users/username/flux/flux1-dev.safetensors' completed, taking 16.25s
running in Flux FLOW mode
finished loaded fileLoad Image Model OK: True
Embedded KoboldAI Lite loaded.
Embedded API docs loaded.
Embedded SDUI loaded.

Perhaps has something to do with the way stable-diffusion.cpp handles model initialization?

The text was updated successfully, but these errors were encountered:

stduhpf · 2024-11-17T20:30:08Z

Flux-dev original weights are not fp16, but bf16. Bf16 is not supported by sdcpp, so it's converted to fp32 (this is a lossless conversion), which takes twice the amount of memory.

https://github.com/leejet/stable-diffusion.cpp/blob/ac54e0076052a196b7df961eb1f792c9ff4d7f22/model.cpp#L1626C21-L1629C22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

flux-dev Double RAM usage on Apple Silicon #1220

flux-dev Double RAM usage on Apple Silicon #1220

d-z-m commented Nov 17, 2024 •

edited

Loading

stduhpf commented Nov 17, 2024 •

edited

Loading

flux-dev Double RAM usage on Apple Silicon #1220

flux-dev Double RAM usage on Apple Silicon #1220

Comments

d-z-m commented Nov 17, 2024 • edited Loading

Problem

stduhpf commented Nov 17, 2024 • edited Loading

d-z-m commented Nov 17, 2024 •

edited

Loading

stduhpf commented Nov 17, 2024 •

edited

Loading