-
Notifications
You must be signed in to change notification settings - Fork 345
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add new pretrained weights #1043
Comments
1 M models were implemented
From there we are missing |
Regarding these weights, there are three things I am running into:
It feels like the above two issues could happen again down the line with other pretrained weights as well, if they don't have the license for us to upload them to huggingface in the format we prefer. |
For 1 and 2, my preference would be to modify and save only the backbone on the CPU so that users or Lightning can map it to the GPU themselves. But of course that requires a favorable license. I would start by inquiring about licenses. If there's an issue with the license we can suggest changing it and describe the use case we have in mind. If they don't respond or won't change, then and only then can we think about more complex code in TorchGeo or changes to torchvision. |
Yep I think re-save the weights is the way to go. |
The list to end all lists: https://github.com/Jack-bo1220/Awesome-Remote-Sensing-Foundation-Models |
I will work on adding bigger ViTs from SSL4EO-S12, FG-MAE, DeCUR, and SoftCon these days. |
Summary
This issue is to track progress on implementing new pretrained weights from related literature into torchgeo:
and many, many more:
Rationale
Foundation Models are one of the most substantial developments in recent ML research. FMs trained on ImageNet are one of the core components of torchvision and transformers that make them so popular. TorchGeo serves as a collection of EO FMs, allowing researchers to quickly and easily experiment with and design new FMs. This is critical for researchers to apply FMs to transfer learning on downstream tasks with small labeled datasets.
Implementation
See #2057, #1903, #1884, etc. for recent PRs adding new FMs.
If you would like to volunteer to add a particular FM, please comment on this issue to say that you're working on this.
Not sure where to get started? FMs that can be considered "multi-modal" (the same set of pre-trained weights can dynamically handle imagery from many different sensors) are the highest priority!
The text was updated successfully, but these errors were encountered: