TechCrunch News 01月29日
Hugging Face researchers are trying to build a more open version of DeepSeek’s AI ‘reasoning’ model
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Hugging Face 正在发起 Open-R1 项目,旨在复刻 DeepSeek 的 R1 推理 AI 模型并开源其所有组件,包括训练数据。此举源于对 DeepSeek “黑盒”发布理念的挑战,尽管 R1 模型许可宽松,但其构建工具不透明。Hugging Face 工程师认为,完全开源 R1 不仅关乎透明度,更在于释放其潜力。R1 在多项基准测试中表现出色,甚至超越了 OpenAI 的 o1 模型。Open-R1 项目希望通过社区合作,利用 Hugging Face 的研究服务器,在几周内复制 R1,并为未来开源推理模型奠定基础。

🚀Hugging Face 发起 Open-R1 项目,旨在复刻 DeepSeek 的 R1 推理 AI 模型,并开源其所有组件,包括训练数据,以挑战 DeepSeek 的“黑盒”发布模式。

🛠️Open-R1 项目的目标是复制 R1 模型,并开源其训练代码和指令,以便深入研究模型、控制其行为,并解决潜在的偏见。项目利用 Hugging Face 的 Science Cluster,包含 768 个 Nvidia H100 GPU,以生成类似 DeepSeek 用于训练 R1 的数据集。

🤝Open-R1 项目正在 GitHub 上进行,并积极寻求 AI 和更广泛技术社区的帮助,以确保算法和方法的正确实施。该项目在三天内获得了 10,000 个 GitHub 星标,显示了社区的广泛兴趣。

💡Open-R1 成功后,将为 AI 研究人员提供一个基础,以开发下一代开源推理模型,并促进 AI 技术的更广泛应用。项目强调开源的益处,认为其能让所有人受益,包括前沿实验室和模型供应商。

Barely a week after DeepSeek released its R1 “reasoning” AI model — which sent markets into a tizzy — researchers at Hugging Face are trying to replicate the model from scratch in what they’re calling a pursuit of “open knowledge.”

Hugging Face head of research Leandro von Werra and several company engineers have launched Open-R1, a project that seeks to build a duplicate of R1 and open source all of its components, including the data used to train it.

The engineers said they were compelled to act by DeepSeek’s “black box” release philosophy. Technically, R1 is “open” in that the model is permissively licensed, which means it can be deployed largely without restrictions. However, R1 isn’t “open source” because many of the tools used to build it are shrouded in mystery. Like many high-flying AI companies, DeepSeek is loathe to reveal its secret sauce.

“The R1 model is impressive, but there’s no open data set, experiment details, or intermediate models available, which makes replication and further research difficult,” Elie Bakouch, one of the Hugging Face engineers on the Open-R1 project, told TechCrunch. “Fully open-sourcing R1’s complete architecture isn’t just about transparency — it’s about unlocking its potential.”

DeepSeek, a Chinese AI lab funded in part by a quantitative hedge fund, released R1 last week. On a number of benchmarks, R1 matches — and even surpasses — the performance of OpenAI’s o1 reasoning model.

Being a reasoning model, R1 effectively fact-checks itself, which helps it avoid some of the pitfalls that normally trip up models. Reasoning models take a little longer — usually seconds to minutes longer — to arrive at solutions compared to a typical non-reasoning model. The upside is that they tend to be more reliable in domains such as physics, science, and math.

R1 broke into the mainstream consciousness after DeepSeek’s chatbot app, which provides free access to R1, rose to the top of the Apple App Store charts. The speed and efficiency with which R1 was developed — DeepSeek released the model just weeks after OpenAI released o1 — has led many Wall Street analysts and technologists to question whether the U.S. can maintain its lead in the AI race.

The Open-R1 project is less concerned about U.S. AI dominance than “fully opening the black box of model training,” Bakouch told TechCrunch. He noted that, because R1 wasn’t released with training code or training instructions, it’s challenging to study the model in depth — much less steer its behavior.

“Having control over the data set and process is critical for deploying a model responsibly in sensitive areas,” Bakouch said. “It also helps with understanding and addressing biases in the model. Researchers require more than fragments […] to push the boundaries of what’s possible.”

The goal of the Open-R1 project is to replicate R1 in a few weeks, relying in part on Hugging Face’s Science Cluster, a dedicated research server with 768 Nvidia H100 GPUs.

The Hugging Face engineers plan to tap the Science Cluster to generate data sets similar to those DeepSeek used to create R1. To build a training pipeline, the team is soliciting help from the AI and broader tech communities on Hugging Face and GitHub, where the Open-R1 project is being hosted.

“We need to make sure that we implement the algorithms and recipes [correctly,]” Von Werra told TechCrunch, “but it’s something a community effort is perfect at tackling, where you get as many eyes on the problem as possible.”

There’s a lot of interest already. The Open-R1 project racked up 10,000 stars in just three days on GitHub. Stars are a way for GitHub users to indicate that they like a project or find it useful.

If the Open-R1 project is successful, AI researchers will be able to build on top of the training pipeline and work on developing the next generation of open source reasoning models, Bakouch said. He hopes the Open-R1 project will not only yield a strong open source replication of R1, but a foundation for better models to come.

“Rather than being a zero-sum game, open source development immediately benefits everyone, including the frontier labs and the model providers, as they can all use the same innovations,” Bakouch said.

While some AI experts have raised concerns about the potential for open source AI abuse, Bakouch believes that the benefits outweigh the risks.

“When the R1 recipe has been replicated, anyone who can rent some GPUs can build their own variant of R1 with their own data, further diffusing the technology everywhere,” he said. “We’re really excited about the recent open source releases that are strengthening the role of openness in AI. It’s an important shift for the field that changes the narrative that only a handful of labs are able to make progress, and that open source is lagging behind.”

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Open-R1 DeepSeek 开源 AI模型 推理模型
相关文章