#

token-pruning

Here are 4 public repositories matching this topic...

microsoft / Moonlit

This is a collection of our research on efficient AI, covering hardware-aware NAS and model compression.

model-compression neural-architecture-search inference-efficiency token-pruning

Updated Oct 25, 2024
Python

mlvlab / vid-TLDR

Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".

computer-vision video-transformer token-pruning efficient-vision-transformers cvpr2024 token-merging

Updated May 7, 2024
Python

Adam-Mazur / Lazy-Llama

An implementation of LazyLLM token pruning for LLaMa 2 model family.

transformers llama huggingface huggingface-transformers token-pruning llama2

Updated Sep 22, 2024
Python

Jungmin-YUN-0 / Attention_Lightweight

lightweight pytorch transformer classification self-attention token-pruning

Updated Jul 24, 2023
Python

Improve this page

Add a description, image, and links to the token-pruning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the token-pruning topic, visit your repo's landing page and select "manage topics."