Impact of batch size on training performance #55

XingtongGe · 2024-12-18T06:42:47Z

Hi, I would like to know how much the batch size affects training performance. For example, is there a significant performance gap between training DMD2-SDXL with 8 GPUs (total batch size of 16) and training with 64 GPUs (total batch size of 128)? Thanks!

tzhu01 · 2025-01-16T09:21:14Z

I followed his ReadMe and trained using an 8 GPUs (A100, 80G), and the results were almost identical to the data in the paper.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Impact of batch size on training performance #55

Impact of batch size on training performance #55

XingtongGe commented Dec 18, 2024

tzhu01 commented Jan 16, 2025

Impact of batch size on training performance #55

Impact of batch size on training performance #55

Comments

XingtongGe commented Dec 18, 2024

tzhu01 commented Jan 16, 2025