Skip to content
Change the repository type filter

All

    Repositories list

    • (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision
      Jupyter Notebook
      Apache License 2.0
      311100Updated Nov 16, 2024Nov 16, 2024
    • LETR

      Public
      (CVPR 2021 Oral) LETR: Line Segment Detection Using Transformers without Edges
      Jupyter Notebook
      Apache License 2.0
      40210193Updated Jul 5, 2024Jul 5, 2024
    • Patch-DM

      Public
      Code Release for Patch-DM (ICLR 2024)
      Python
      13630Updated May 17, 2024May 17, 2024
    • BDM

      Public
      (CVPR 2024) Bayesian Diffusion Models for 3D Shape Reconstruction
      Python
      32400Updated May 6, 2024May 6, 2024
    • BLIVA

      Public
      (AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
      Python
      BSD 3-Clause "New" or "Revised" License
      28269190Updated Apr 14, 2024Apr 14, 2024
    • Uni-3D

      Public
      (ICCV 2023) Uni-3D: A Universal Model for Panoptic 3D Scene Reconstruction
      Python
      Apache License 2.0
      02300Updated Feb 23, 2024Feb 23, 2024
    • MaskCLIP

      Public
      Code Release for MaskCLIP (ICML 2023)
      Python
      Other
      35860Updated Nov 29, 2023Nov 29, 2023
    • MasQCLIP

      Public
      (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation
      Python
      Other
      23440Updated Oct 18, 2023Oct 18, 2023
    • XTRA

      Public
      On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
      Python
      Apache License 2.0
      01710Updated Apr 30, 2023Apr 30, 2023
    • TESTR

      Public
      (CVPR 2022) Text Spotting Transformers
      Python
      Apache License 2.0
      2217990Updated Jan 30, 2023Jan 30, 2023
    • DC-VAE

      Public
      (CVPR 2021) DC-VAE: Dual Contradistinctive Generative Autoencoder
      Python
      Apache License 2.0
      63621Updated Jan 3, 2023Jan 3, 2023
    • (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.
      Python
      32100Updated Jul 13, 2022Jul 13, 2022
    • Code for CVPR2022 paper: Instance Segmentation with Mask-supervised Polygonal Boundary Transformers
      Python
      Apache License 2.0
      99840Updated Jul 10, 2022Jul 10, 2022
    • ViTGAN

      Public
      Python
      MIT License
      95490Updated Jun 8, 2022Jun 8, 2022
    • (ICLR 2021) ConstellationNet: Attentional Constellation Nets for Few-Shot Learning
      Python
      Apache License 2.0
      81400Updated Apr 4, 2022Apr 4, 2022
    • CoaT

      Public
      (ICCV 2021 Oral) CoaT: Co-Scale Conv-Attentional Image Transformers
      Jupyter Notebook
      Apache License 2.0
      3122830Updated Feb 3, 2022Feb 3, 2022
    • (CVPR 2020) Guided-VAE: Guided Variational Autoencoder for Disentanglement Learning
      Python
      Apache License 2.0
      52610Updated Sep 17, 2021Sep 17, 2021
    • PRTR

      Public
      (CVPR 2021) PRTR: Pose Recognition with Cascade Transformers
      Jupyter Notebook
      Apache License 2.0
      2914140Updated Jun 21, 2021Jun 21, 2021