Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Are training the CLIP model from scratch? or are you using the pretrained weights? #39

Open
Maram-Helmy opened this issue Feb 20, 2024 · 1 comment

Comments

@Maram-Helmy
Copy link

I'm really confuse, in the code you initialize the layers using the normal distribution, but what I understood from the paper is that you are using CLIP model.

Your answer will really help me understand.

Thanks

@hoahoa1808
Copy link

As I read code, the CORE_MODEl can be splited into 2 parts:

  • the backbone: download and load pretrained CLIP model from OpenAI (read file model/build.py line 17/18)
  • the heads: MLM head+cross_modal_transformer and classification head are initialized by the norm dist and trained from scratch.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants