inference-acceleration - 主题项目

追踪 GitHub 趋势，把握技术发展脉搏

探索趋势了解更多

inference-acceleration - 主题项目

追踪 GitHub 趋势，把握技术发展脉搏

探索趋势了解更多

成就解锁

❤️❤️❤️❤️❤️❤️ 我们已经正式推出微信小程序，在微信中搜索 TrendForge Pro 即可使用小程序，如果使用 Telegram 请搜索 trendforge_tg ❤️❤️❤️❤️❤️❤️

inference-acceleration

话题找到数量

thu-ml/SageAttention

量化注意力机制相比FlashAttention和xformers实现了2-5倍和3-11倍的速度提升，且在语言、图像和视频模型上保持端到端指标无损。

attention cuda efficient-attention

thu-ml

thu-ml 开发者

3.4k

422

351

+149

排名 #15

5月25日

thu-ml/SpargeAttn

SpargeAttention：一种免训练的稀疏注意力机制，可加速任何模型推理

ai-infra attention inference-acceleration

thu-ml

thu-ml 开发者

951

87

346

+4

排名 #16

2月25日

首页上一页

1

1

下一页末页

助手