Why use the intermediate estimated x_0|t as the distribution matching input， but not the final image of generator? #45

vv12kant · 2024-08-30T02:48:45Z

As far as I understand，the input of ditribution mathing should be the final output of generator, but under the circumstance of multi-step generator, we use the intermediate estimated x_0|t as the input，why?

tianweiy · 2024-08-30T04:22:30Z

you are right. However, using the final output requires us to back propagate through the time (or step) which will adds more gpu memory consumption. It should be possible to do this with some system optimization but we didn't get chance to try because of this concern

vv12kant · 2024-09-04T02:52:15Z

Thanks. I think I understand something: the multi-step generator is like an LCM model. The difference is that LCM needs to align the final output x_0 of all steps on the ODE, while DMD needs to match the distribution to the pre-trained DM at all steps on the ODE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why use the intermediate estimated x_0|t as the distribution matching input， but not the final image of generator? #45

Why use the intermediate estimated x_0|t as the distribution matching input， but not the final image of generator? #45

vv12kant commented Aug 30, 2024

tianweiy commented Aug 30, 2024 •

edited

Loading

vv12kant commented Sep 4, 2024 •

edited

Loading

Why use the intermediate estimated x_0|t as the distribution matching input， but not the final image of generator? #45

Why use the intermediate estimated x_0|t as the distribution matching input， but not the final image of generator? #45

Comments

vv12kant commented Aug 30, 2024

tianweiy commented Aug 30, 2024 • edited Loading

vv12kant commented Sep 4, 2024 • edited Loading

tianweiy commented Aug 30, 2024 •

edited

Loading

vv12kant commented Sep 4, 2024 •

edited

Loading