热点
关于我们
xx
xx
"
稀疏化
" 相关文章
1-bit大模型还能再突破!新一代BitNet架构启用4位激活值
智源社区
2024-12-06T03:22:05.000000Z
1-bit大模型还能再突破,新一代BitNet架构启用4位激活值
36氪 - 科技频道
2024-12-05T05:10:06.000000Z
Neural Magic Releases 2:4 Sparse Llama 3.1 8B: Smaller Models for Efficient GPU Inference
MarkTechPost@AI
2024-11-25T17:50:01.000000Z
This AI Paper Introduces BitNet a4.8: A Highly Efficient and Accurate 4-bit LLM
MarkTechPost@AI
2024-11-10T09:04:58.000000Z
This AI Paper from China Proposes a Novel dReLU-based Sparsification Method that Increases Model Sparsity to 90% while Maintaining Performance, Achieving a 2-5× Speedup in Inference
MarkTechPost@AI
2024-06-15T07:01:53.000000Z
Researchers from Cerebras & Neural Magic Introduce Sparse Llama: The First Production LLM based on Llama at 70% Sparsity
MarkTechPost@AI
2024-05-18T04:00:52.000000Z