Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why use the intermediate estimated x_0|t as the distribution matching input, but not the final image of generator? #45

Open
vv12kant opened this issue Aug 30, 2024 · 2 comments

Comments

@vv12kant
Copy link

As far as I understand,the input of ditribution mathing should be the final output of generator, but under the circumstance of multi-step generator, we use the intermediate estimated x_0|t as the input,why?

@tianweiy
Copy link
Owner

tianweiy commented Aug 30, 2024

you are right. However, using the final output requires us to back propagate through the time (or step) which will adds more gpu memory consumption. It should be possible to do this with some system optimization but we didn't get chance to try because of this concern

@vv12kant
Copy link
Author

vv12kant commented Sep 4, 2024

Thanks. I think I understand something: the multi-step generator is like an LCM model. The difference is that LCM needs to align the final output x_0 of all steps on the ODE, while DMD needs to match the distribution to the pre-trained DM at all steps on the ODE.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants