ModelTC
https://light-ai.top/
Pinned Loading
Repositories
Showing 10 of 65 repositories
- SageAttention Public Forked from thu-ml/SageAttention
Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across language, image, and video models.
ModelTC/SageAttention’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…