热点
关于我们
xx
xx
"
模型融合
" 相关文章
ICML 2025 | 还在裸跑LoRA?CoTo用渐进激活杀出新路,融合剪枝全起飞
PaperWeekly
2025-07-30T03:06:46.000000Z
ICML 2025 | CoTo:让LoRA训练「渐入佳境」,模型融合、剪枝样样精通
机器之心
2025-07-27T09:00:39.000000Z
ICML 2025 | CoTo:让LoRA训练「渐入佳境」,模型融合、剪枝样样精通
机器之心
2025-07-26T18:56:53.000000Z
ChipAlign: Instruction Alignment in Large Language Models for Chip Design via Geodesic Interpolation
cs.AI updates on arXiv.org
2025-07-17T04:14:26.000000Z
Low-Rank and Sparse Model Merging for Multi-Lingual Speech Recognition and Translation
cs.AI updates on arXiv.org
2025-07-09T04:02:08.000000Z
DeepSeek R1T2 Chimera: 200% Faster Than R1-0528 With Improved Reasoning and Compact Output
MarkTechPost@AI
2025-07-03T11:45:35.000000Z
不用等R2了!第三方给新版DeepSeek V3添加深度思考,推理101秒破解7米甘蔗过2米门
智源社区
2025-04-29T14:14:45.000000Z
DeepSeek-R1T-Chimera:当R1的智慧,遇上V3的速度!开源AI新物种驾到!
掘金 人工智能
2025-04-29T02:27:58.000000Z
不用等R2了!第三方给新版DeepSeek V3添加深度思考,推理101秒破解7米甘蔗过2米门
掘金 人工智能
2025-04-28T09:22:52.000000Z
2025.3 OpenAI CPO Kevin Weil 访谈
孔某人的低维认知
2025-03-20T13:15:42.000000Z
靠这个免费的开源库 人人都能手搓DeepSeek应用了
快科技资讯
2025-03-06T22:59:53.000000Z
亚马逊放大招,多模型融合Alexa+来袭,剑指千亿级智能家居市场
36氪 AI
2025-02-27T10:51:53.000000Z
观点|从Deepseek-R1看2025模型的未来
Zilliz
2025-02-19T23:41:47.000000Z
Enhancing Reasoning Capabilities in Low-Resource Language Models through Efficient Model Merging
MarkTechPost@AI
2025-02-17T06:05:10.000000Z
网易有道全线AI应用接入DeepSeek-R1|钛媒体独家
钛媒体:引领未来商业与生活新知
2025-02-06T09:31:22.000000Z
Kimi思考模型k1.5是怎么练成的?细节曝光
PaperAgent
2025-01-21T17:16:11.000000Z
NVIDIA Research Introduces ChipAlign: A Novel AI Approach that Utilizes a Training-Free Model Merging Strategy, Combining the Strengths of a General Instruction-Aligned LLM with a Chip-Specific LLM
MarkTechPost@AI
2025-01-03T00:28:52.000000Z
Top5团队!全球AI攻防挑战赛系列分享(四)
智源社区
2024-12-16T04:07:03.000000Z
TIME Framework: A Novel Machine Learning Unifying Framework Breaking Down Temporal Model Merging
MarkTechPost@AI
2024-12-14T06:34:49.000000Z
DeepSeek等团队新作JanusFlow: 1.3B大模型统一视觉理解和生成
智源社区
2024-11-23T05:07:17.000000Z