Coding with Intelligence 01月02日
DeepSeek-V3: the model everyone is talking about
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文涵盖了2024年最后两周AI领域的多项内容,包括各种模型的推出及性能表现,如DeepSeek-V3、QwQ、QVQ等,还涉及到一些应用和研究成果,如Groq Appgen、Epoch AI的研究等,展现了AI领域的快速发展。

DeepSeek-V3是GPT-4o水平的开源模型,参数众多,成本较低

QwQ是首个真正的开源推理模型,QVQ是视觉推理模型

o3在ARC-AGI-1上表现良好,Groq Appgen可实现即时网页应用生成

ModernBERT是适合微调的基础模型,Epoch AI表明前沿模型正变小

Dear readers,

Happy New Year and welcome to 2025! This week’s edition is a collection of everything that happened in the final 2-weeks (51/52) of 2024, and BOY did it get busy during that final sprint of the year. If nothing else, I think it signals that 2025 is going to be an incredible year for AI. With democratization of frontier performance (DeepSeek-V3, QwQ, QVQ, Llama 3.3 70B, Qwen 2.5 72B), an incredible installed base of compute clusters (multiple interconnected 100k accelerator clusters, 1M clusters in the works), and new frontier heights (o3) that fully automate most run of the mill software engineering (71.7% on SWE-bench verified), the pace of progress is bound to be electric. Strap in and enjoy the ride!

- Rick Lamers

DeepSeek-V3 Perf/Cost chart: a new position on the Pareto front

? News

? Repos

? Papers

?️ Products

? Resources


Want more? Follow me on X! @ricklamers

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AI领域 模型发展 应用成果 技术进步
相关文章