-
Notifications
You must be signed in to change notification settings - Fork 65
Pull requests: intel/xFasterTransformer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Bump transformers from 4.40.0 to 4.48.0
dependencies
Pull requests that update a dependency file
#490
opened Feb 11, 2025 by
dependabot
bot
Loading…
Add env param KV_CACHE_LOCATION to control kv cache memory numanode location
#462
opened Jun 28, 2024 by
a3213105
Loading…
[Layers] Increased the threshold for enabling flashAttn
performance
performance related.
#428
opened Jun 3, 2024 by
abenmao
Loading…
[Kernel] Add dynamic onednn matmul.
performance
performance related.
#425
opened May 28, 2024 by
changqi1
Loading…
[Eval] Add eval test with opencompass.
benchmark
performance or accuracy benchmark
enhancement
New feature or request
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.