Mashable 01月22日
DeepSeek AI might be smarter than OpenAIs smartest AI, and you can try it out now
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

中国人工智能公司DeepSeek推出了开源大型语言模型DeepSeek R1,在多项基准测试中超越了OpenAI等其他知名模型,尤其在数学、编码和推理任务方面表现出色。R1是DeepSeek R1 Zero的改进版,通过多阶段训练和冷启动数据解决了Zero在可读性和语言混合方面的问题。R1不仅开源,可接受专家审查以提高隐私和安全性,而且作为Web应用免费使用,API访问费用也极低,这使得它在成本效益上具有显著优势。测试显示,R1在复杂编程、提供实用建议以及展示推理能力方面均表现出色,证明了强大的AI推理能力不一定需要高昂的训练成本。

🚀DeepSeek R1是一款开源的大型语言模型,它在数学、编码和推理任务方面表现突出,超越了许多其他知名的LLM模型。

🛠️R1是基于DeepSeek R1 Zero改进而来,通过多阶段训练和冷启动数据,解决了Zero在可读性和语言混合方面的问题,并加入了强化学习。

💰DeepSeek R1不仅开源,还提供免费的Web应用使用,并且API访问费用极低,每百万输入tokens仅需0.14美元,相比之下,OpenAI的o1模型需要7.5美元,这大大降低了使用成本。

🧠测试显示,DeepSeek R1在处理复杂编程任务、提供有用的建议以及展示推理能力方面均表现出色,证明了其强大的AI能力,并且训练成本相对较低。

There's a new AI player in town, and you might want to pay attention to this one.

On Monday, Chinese artificial intelligence company DeepSeek launched a new, open-source large language model called DeepSeek R1.

According to DeepSeek, R1 wins over other popular LLMs (large language models) such as OpenAI in several important benchmarks, and it's especially good with mathematical, coding, and reasoning tasks.

DeepSeek R1 is actually a refinement of DeepSeek R1 Zero, which is an LLM that was trained without a conventionally used method called supervised fine-tuning. This made it very capable in certain tasks, but as DeepSeek itself puts it, Zero had "poor readability and language mixing." Enter R1, which fixes these issues by incorporating "multi-stage training and cold-start data" before it was trained with reinforcement learning.

Arcane technical language aside (the details are online if you're interested), there are several key things you should know about DeepSeek R1. First, it's open source, meaning it's up for scrutiny from experts, which should alleviate concerns about privacy and security. Second, it's free to use as a web app, while API access is very cheap ($0.14 for one million input tokens, compared to OpenAI's $7.5 for its most powerful reasoning model, o1).

Most importantly, this thing is very, very capable. To test it out, I immediately threw it into deep waters, asking it to code a fairly complex web app which needed to parse publicly available data, and create a dynamic website with travel and weather information for tourists. Amazingly, DeepSeek produced completely acceptable HTML code right away, and was able to further refine the site based on my input while improving and optimizing the code on its own along the way.

I'll do all of that...tomorrow. Credit: Stan Schroeder / Mashable / DeepSeek

I also asked it to improve my chess skills in five minutes, to which it replied with a number of neatly organized and very useful tips (my chess skills did not improve, but only because I was too lazy to actually go through with DeepSeek's suggestions).

I then asked DeepSeek to prove how smart it is in exactly three sentences. Bad move by me, as I, the human, am not nearly smart enough to verify or even fully understand any of the three sentences. Notice, in the screenshot below, that you can see DeepSeek's "thought process" as it figures out the answer, which is perhaps even more fascinating than the answer itself.

We get it, you're smart. Credit: Stan Schroeder / Mashable / DeepSeek

It's impressive to use. But as ZDnet noted, in the background of all this are training costs which are orders of magnitude lower than for some competing models, as well as chips which aren't as powerful as the chips that are on disposal for U.S. AI companies. DeepSeek thus shows that extremely clever AI with reasoning ability doesn't have to be extremely expensive to train — or to use.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

DeepSeek R1 开源 大型语言模型 AI推理 低成本
相关文章