Interconnects 05月29日 21:45
Latest open artifacts (#10): New DeepSeek R1 0528!, more permissive licenses, everything as a reasoner, and from artifacts to agents
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文探讨了近期AI模型领域的最新动态,特别关注了开源模型的崛起以及推理能力的提升。文章分析了中国模型在开放许可方面的显著进展,并讨论了西方生态系统面临的压力。此外,文章还介绍了多个值得关注的开源模型,如DeepSeek-R1、Skywork-R1V2等,以及它们在推理、多模态和RAG等方面的应用。文章最后还提到了与AI相关的其他资源和讨论,如AI产品、在线关系、深度伪造等,为读者提供了全面的视角。

💡中国开源模型蓬勃发展:文章指出,中国模型在开放许可方面取得了显著进展,推动了其他开源模型许可的改进。Qwen模型在微调方面表现突出,受到中国公司和小型美国初创企业的青睐。

🚀推理模型成为主流:推理模型在AI领域占据主导地位,GRPO算法仍被广泛应用。文章介绍了多个新发布的推理模型,如DeepSeek-R1、Skywork-R1V2、Pleias-RAG-1B等,涵盖了推理、多模态、RAG等多个方面。

🛠️工具与模型的结合:领先的AI发布更倾向于工具与模型的结合,例如Claude Code、OpenAI的Codex以及Gemini的Jules。开放系统需要采取不同的形式,以充分利用可交换和迭代的开放模型的优势。

A consistent trend over the last few months has been in the surge of Chinese models with permissive licenses, which has been translating into improvements in the licenses used by other open models. The major players in the Western ecosystem — Meta’s Llama and Google’s Gemma — are yet to do this, but pressure is building.

Mirroring this, we’re seeing far more Qwen finetunes than Llama. Llama, for it’s first 3 versions, was by far and away the leading model for fine-tuners. “Qwen as the default” is not only the view of other Chinese companies, but it is championed by many smaller American startups looking to break through wit hstrong models. While Qwen2.5, Qwen2.5-VL and QwQ are the leading base models, we also see first models based on Qwen3.

These trends are on top of the transition we’ve seen wrapping up on top of the entire industry where reasoning models are the default. GRPO is still the most common algorithm in practice (see our research overview for expansions of the method).

A trend that is beginning, and one that will take longer, is that leading AI releases are much more often about tools than models alone. From Claude Code to OpenAI’s Codex (agent) and Gemini’s Jules, open replications of these systems will be much slower. The open systems will need to take on different forms in order to take the benefits of open models that can be swapped and iterated upon, all of which we hope to highlight in future issues.

Share

Our Picks

Links

Reasoning

Models

Datasets

Models

Read more

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

开源模型 推理能力 AI进展
相关文章