模型蒸馏_Fishai

热点

"模型蒸馏" 相关文章

Datawhale AI夏令营：Baseline与调优

掘金人工智能 2025-07-28T03:23:23.000000Z

基于模型蒸馏的大模型文案生成最佳实践

掘金人工智能 2025-07-25T03:08:41.000000Z

NVIDIA AI Releases OpenReasoning-Nemotron: A Suite of Reasoning-Enhanced LLMs Distilled from DeepSeek R1 0528

MarkTechPost@AI 2025-07-20T04:40:53.000000Z

Distilling Invariant Representations with Dual Augmentation

cs.AI updates on arXiv.org 2025-07-17T04:14:25.000000Z

Towards Interpretable Time Series Foundation Models

cs.AI updates on arXiv.org 2025-07-11T04:04:09.000000Z

Sakana AI Introduces Reinforcement-Learned Teachers (RLTs): Efficiently Distilling Reasoning in LLMs Using Small-Scale Reinforcement Learning

MarkTechPost@AI 2025-06-23T21:35:13.000000Z

深夜突袭！DeepSeek-R1重磅升级：媲美OpenAl最高o3模型，编码能力直逼Claude4

机器学习初学者 2025-05-30T05:32:11.000000Z

5%参数比肩DeepSeek满血R1！北大“小”模型靠分合蒸馏，打破推理成本下限

智源社区 2025-05-28T01:17:53.000000Z

Reasoning模型蒸馏实践：用大模型提升小模型能力

魔搭ModelScope社区 2025-05-23T15:01:06.000000Z

瘦身不降智！大模型训推效率提升30%，京东大模型开发计算研究登Nature旗下期刊

智源社区 2025-05-22T11:53:22.000000Z

瘦身不降智！大模型训推效率提升30%，京东大模型开发计算研究登Nature旗下期刊

量子位 2025-05-21T08:26:30.000000Z

DeepSeek-R1 发布，性能对标 OpenAI o1 正式版

DeepSeek 2025-05-13T16:51:11.000000Z

亚马逊功能最强模型Amazon Nova Premier现已正式可用

互联网数据资讯网-199IT 2025-05-06T07:56:35.000000Z

亚马逊推出Nova Premier模型旗下最先进但性价比不高

Cnbeta 2025-05-01T06:47:37.000000Z

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

AWS Machine Learning Blog 2025-05-01T01:15:54.000000Z

Import AI 400: Distillation scaling laws; recursive GPU kernel improvement; and wafer-scale computation

Import AI 2025-04-09T10:38:26.000000Z

漫画趣解：一口气搞懂模型蒸馏！

特大号 2025-04-09T09:54:29.000000Z

通过DeepSeek现象思考大模型落地的正确路径

安全村SecUN 2025-04-04T03:40:42.000000Z

通过DeepSeek现象思考大模型落地的正确路径

安全村SecUN 2025-03-25T02:10:40.000000Z

AI公司集体＂抄作业＂：白菜价训练＂小模型＂时代来了？

Cnbeta 2025-03-10T02:07:55.000000Z

Copyright © 2019 FISHAI.All Rights Reserved