MoE模型_Fishai

热点

"MoE模型" 相关文章

WAIC 2025镇馆之宝！“算力核弹”华为昇腾384超节点一图看懂

快科技资讯 2025-07-28T16:19:28.000000Z

GSPO: Towards Scalable Reinforcement Learning for Language Models

Qwen 技术博客 2025-07-27T10:18:46.000000Z

美图旗下Wink上线“全能修复”功能

36氪 2025-07-24T08:13:24.000000Z

Kimi K2官方技术报告出炉：采用384个专家，训练不靠刷题靠“用自己的话再讲一遍”

智源社区 2025-07-23T09:51:47.000000Z

【新模型速递】PAI-Model Gallery云上一键部署Kimi K2模型

掘金人工智能 2025-07-21T07:18:25.000000Z

Deploying Kimi K2 with PD Disaggregation and Large-Scale Expert Parallelism on 128 H200 GPUs

Large Model Systems Organization 2025-07-20T08:55:14.000000Z

Kimi 员工复盘 K2：为什么聚焦 Agent、为什么开源，为什么选择 DSV3 架构？

智源社区 2025-07-19T11:42:39.000000Z

杨植麟摸着DeepSeek过河

36kr-科技 2025-07-19T04:27:28.000000Z

一文看懂 MOE 模型：让大模型像医院看病一样高效工作

掘金人工智能 2025-07-18T06:08:18.000000Z

华丰科技，正宗的昇腾 384 超节点！核心高速总线互联技术！

韭研公社 2025-07-18T05:27:28.000000Z

AI圈水太深：OpenAI保密、Meta作弊！国产MoE却异军突起

智源社区 2025-07-17T03:02:27.000000Z

腾讯混元开源首款混合推理MoE模型，擅长Agent工具调用和长文理解

夕小瑶科技说 2025-07-16T19:09:06.000000Z

硅基流动 SiliconCloud 上线月之暗面 Kimi K2

硅基流动 2025-07-13T16:49:59.000000Z

杨植麟被梁文锋叫醒了！Kimi新模型发布即开源，1T参数全线SOTA

智源社区 2025-07-13T13:38:01.000000Z

Moonshot AI Releases Kimi K2: A Trillion-Parameter MoE Model Focused on Long Context, Code, Reasoning, and Agentic Behavior

MarkTechPost@AI 2025-07-12T04:26:06.000000Z

“狠人”闫俊杰，闯关IPO

36氪 - AI相关文章 2025-07-11T07:59:31.000000Z

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

cs.AI updates on arXiv.org 2025-07-04T04:08:40.000000Z

华为又开源了个大的：超大规模 MoE 推理秘籍

掘金人工智能 2025-07-01T10:05:15.000000Z

Tencent Open Sources Hunyuan-A13B: A 13B Active Parameter MoE Model with Dual-Mode Reasoning and 256K Context

MarkTechPost@AI 2025-06-28T20:40:48.000000Z

Qwen1.5-MoE: Matching 7B Model Performance with 1/3 Activated Parameters

Qwen 技术博客 2025-06-25T07:54:01.000000Z

Copyright © 2019 FISHAI.All Rights Reserved