热点
"混合专家模型" 相关文章
华为盘古首次露出,昇腾原生72B MoE架构,SuperCLUE千亿内模型并列国内第一
掘金 人工智能 2025-05-28T09:08:02.000000Z
The Rise of Mixture-of-Experts: How Sparse AI Models Are Shaping the Future of Machine Learning
Unite.AI 2025-05-06T23:22:34.000000Z
消息称DeepSeek R2下月发:成本较GPT降97%、华为芯片性能不输英伟达
最新-新浪科技科学探索 2025-04-29T14:13:27.000000Z
Qwen3强势来袭:推理力爆表、语言超百种、智能体协作领先,引领AI开源大模型
掘金 人工智能 2025-04-29T07:28:04.000000Z
消息称DeepSeek R2下月发:成本较GPT降97%、华为芯片性能不输英伟达
快科技资讯 2025-04-29T01:16:24.000000Z
Alibaba Qwen Team Just Released Qwen3: The Latest Generation of Large Language Models in Qwen Series, Offering a Comprehensive Suite of Dense and Mixture-of-Experts (MoE) Models
MarkTechPost@AI 2025-04-29T01:10:35.000000Z
Multimodal Models Don’t Need Late Fusion: Apple Researchers Show Early-Fusion Architectures are more Scalable, Efficient, and Modality-Agnostic
MarkTechPost@AI 2025-04-14T22:20:29.000000Z
Meta公布MoE架構開發的Llama 4 開源4000億、1090億參數的Maverick
AI & Big Data 2025-04-07T01:28:48.000000Z
消息称蚂蚁集团采用阿里、华为等国产芯片训练 AI:性能匹敌英伟达 H800,成本降低 20%
IT之家 2025-03-24T06:12:49.000000Z
Observations About LLM Inference Pricing
少点错误 2025-03-04T03:04:11.000000Z
DeepSeek开源放大招:FlashMLA让H800算力狂飙!曝光低成本秘笈
智源社区 2025-02-25T03:18:07.000000Z
月之暗面 Kimi 开源 Moonlight:30 亿 / 160 亿参数混合专家模型
IT之家 2025-02-24T01:07:38.000000Z
一次推理解决复合问题:基于MoE的大语言模型知识模块可扩展融合推理架构MeteoRA
机器之心 2025-02-22T05:53:46.000000Z
Lex Fridman 五小时聊 DeepSeek:一文看懂 DeepSeek 的创新与2025 AI 趋势
Founder Park 2025-02-11T16:31:22.000000Z
Autonomy-of-Experts (AoE): A Router-Free Paradigm for Efficient and Adaptive Mixture-of-Experts Models
MarkTechPost@AI 2025-01-27T01:04:01.000000Z
DeepSeek V3 and the actual cost of training frontier AI models
Interconnects 2025-01-09T20:55:35.000000Z
Noteworthy AI Research Papers of 2024 (Part One)
Ahead of AI 2024-12-31T12:28:58.000000Z
Researchers from Tsinghua University Propose ReMoE: A Fully Differentiable MoE Architecture with ReLU Routing
MarkTechPost@AI 2024-12-29T08:09:40.000000Z
中国人工智能进步速度引美媒关注 AI模型将成为新的技术标签
Cnbeta 2024-12-25T02:36:35.000000Z
DeepSeek-VL2 AI 视觉模型开源:支持动态分辨率、处理科研图表、解析各种梗图等
IT之家 2024-12-14T02:34:30.000000Z