Interconnects 02月20日
The latest open artifacts (#7): Alpaca era of reasoning models, China's continued dominance, and tons of multimodal advancements
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文介绍了开源生态系统中的诸多新进展,包括多种模型的发布、数据集的出现、训练资源及工具等内容。

DeepSeek R1相关数据集大量出现,选择需谨慎

中国持续发布强模型,部分将开源或免费

多种新模型发布,各有特点及应用场景

出现多个用于RL训练的代码库及相关工具

It’s one of those months where it feels like a year passes in the open-source ecosystem. Things are the liveliest they’ve been in quite some time, obviously thanks to DeepSeek R1. We’re here to map that out for you.

The biggest things to know are:

    Tons of DeepSeek R1 datasets are appearing, knowing which one is best is nuanced.

    China continues to release most of the strongest models across the AI stack with more permissive licenses.

      Aside from the many models covered in this and the previous episode, Baidu has announced their plan to make their upcoming Ernie 4.5 model open source on June 30th, while making their premium tier free in April.

      A lot of Chinese labs have started their presence on X (formerly Twitter), including Qwen, Hailuo, StepFun, and Hunyuan, among many others and made their models easily accessible with either HuggingFace Spaces or dedicated sites, like QwenLM Chat.

As usual, the artifacts in this post are available under this HuggingFace collection.

Share

Our Picks

Links

Interconnects is a reader-supported publication. Consider becoming a subscriber.

Reasoning

Models

Datasets

Tools

Lots of codebases are being spun up for RL training these days. Some of the first ones are libraries that implement verifiers. Two of these are:

    Math-Verify from HuggingFace: Tools for extracting math answers from LLM text. This is very similar to what Ai2 uses for RLVR training with open-instruct.

    Reasoning-Gym from Open-Thought: Verifiers for RL training across more domains. This library is a bit more complex, but it’s the most aggregated source of verifiers we’ve seen.

    Verdict from haizelabs: A new repository/toolkit for scaling up compound calls and inference-time compute for LLM-as-a-judge workflows.

Read more

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

开源生态 模型发布 数据集 训练工具
相关文章