钛媒体:引领未来商业与生活新知 02月10日
DeepSeek Researcher Predicts Significant Progress in 2025
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

DeepSeek研究员郭达亚在X平台上透露,DeepSeek R1的训练仅耗时两到三周,并表示在春节期间见证了R1-Zero曲线的持续增长,真切感受到了强化学习的力量。研究团队在春节期间观察到R1-Zero的显著改进,展示了强化学习的巨大潜力。郭达亚还提到,他们使用未被RL提示覆盖的领域的基准来评估泛化能力,目前看来R1-Zero具备泛化能力。他认为强化学习仍处于早期阶段,未来还有很长的路要走,并相信今年将取得重大进展。此前有传言称阿里巴巴计划以100亿美元的估值投资DeepSeek,但阿里巴巴方面已否认。

🚀 DeepSeek研究员透露,DeepSeek R1的训练仅耗时两到三周,突显了其高效性。

📈 研究团队在春节期间观察到R1-Zero的显著改进,表明强化学习在提升模型性能方面的巨大潜力。

🌐 DeepSeek使用未被RL提示覆盖的领域的基准来评估泛化能力,结果显示R1-Zero具备良好的泛化能力。

💰 尽管有传言称阿里巴巴计划投资DeepSeek,但阿里巴巴方面已否认,目前DeepSeek的估值约为80亿美元。

(Image source: Photo by Lin Zhijia, TMTPost AGI Editor)

AsianFin -- In a surprising revelation, DeepSeek researcher Daya Guo shared on X platform that DeepSeek R1 training took only two to three weeks, saying “The happiest moment during the Spring Festival was witnessing R1Zero's curves continuously increase and truly feeling the power of reinforcement learning.”

Additionally, during the Chinese New Year, the research team observed significant improvements in R1-Zero, demonstrating the immense potential of reinforcement learning (RL).

On February 1, the fourth day of the Chinese New Year, Guo took to X to express his excitement over the performance of R1-Zero.

In replies to netizens, Guo said “We use benchmarks from domains not covered by the RL prompt to evaluate generalization. So far, it appears to have generalization capability.”

“I think we are still in an early stage, and there is still a long way to explore in RL. I believe there will be significant progress this year,” Guo added.

Last Friday evening, there were reports that Alibaba plans to invest US$1 billion to acquire a 10% stake in DeepSeek based on a valuation of $10 billion, and that both parties are currently discussing transaction details.

Yan Qiao, a vice president at Alibaba, responded on her WeChat Moments saying that Alibaba, as a fellow Hangzhou-based company, applauds DeepSeek, but the circulating rumors about Alibaba investing in DeepSeek are false. 

DeepSeek's current valuation is about $8 billion, according to industry insiders. The rumors initially spread within investment circles and quantitative groups, attracting significant interest from several investment institutions.

Zhu Xiaohu, a managing partner at GSR Ventures, previously said that he would definitely invest if DeepSeek opens for financing. Zhu believes DeepSeek should remain open to financing because moving forward will require significant investment, particularly in computational resources like GPU cards.

更多精彩内容,关注钛媒体微信号(ID:taimeiti),或者下载钛媒体App

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

DeepSeek R1 强化学习 阿里巴巴 估值
相关文章