SHI Labs

Neighborhood-Attention-Transformer Public

Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022

Python 1.1k 86

Versatile-Diffusion Public

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023

Python 1.3k 85

OneFormer Public

[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation

Jupyter Notebook 1.6k 136

Prompt-Free-Diffusion Public

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024

Python 745 37

Smooth-Diffusion Public

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models arXiv 2023 / CVPR 2024

Python 330 9

VCoder Public

[CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models

Python 272 17

Provide feedback