热点
"闪电注意力" 相关文章
MiniMax-Text-01 and MiniMax-VL-01 Released: Scalable Models with Lightning Attention, 456B Parameters, 4B Token Contexts, and State-of-the-Art Accuracy
MarkTechPost@AI 2025-01-15T20:02:58.000000Z