TechCrunch News 2024年12月20日
Google releases its own ‘reasoning’ AI model
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

谷歌推出了名为Gemini 2.0 Flash Thinking Experimental的实验性推理AI模型,该模型在AI Studio平台上线。它旨在提升多模态理解、推理和编码能力,解决编程、数学和物理等领域的复杂问题。该模型基于Gemini 2.0 Flash,通过“思考”来增强推理能力,并在给出答案前进行自我检查,以避免错误。然而,该模型在计算推理过程较慢,且在简单的计数任务上表现不佳。目前,推理模型领域竞争激烈,除了谷歌,DeepSeek和阿里巴巴等公司也推出了类似模型,但其高昂的计算成本和可持续性仍存在挑战。

💡Gemini 2.0 Flash Thinking Experimental是谷歌推出的新型实验性推理AI模型,旨在提升多模态理解、推理和编码能力,可解决编程、数学和物理等领域的复杂问题。

🧠该模型基于Gemini 2.0 Flash,通过“思考”来加强推理,在给出答案前会进行自我检查,以避免错误,这与传统AI模型有显著差异。

⏱️推理模型通常需要更长的计算时间来得出答案,Gemini 2.0 Flash Thinking Experimental在处理问题时会暂停数秒,并解释其思考过程,但有时在简单任务(如数数)上表现不佳。

🚀目前,推理模型领域竞争激烈,除了谷歌,DeepSeek和阿里巴巴等公司也推出了类似模型,但其高昂的计算成本和可持续性仍存在挑战。

Google has released what it’s calling a new “reasoning” AI model — but it’s in the experimental stages, and from our brief testing, there’s certainly room for improvement.

The new model, called Gemini 2.0 Flash Thinking Experimental (a mouthful, to be sure), is available in AI Studio, Google’s AI prototyping platform. A model card describes it as “best for multimodal understanding, reasoning, and coding,” with the ability to “reason over the most complex problems” in fields such as programming, math, and physics.

In a post on X, Logan Kilpatrick, who leads product for AI Studio, called Gemini 2.0 Flash Thinking Experimental “the first step in [Google’s] reasoning journey.” Jeff Dean, chief scientist for Google DeepMind, Google’s AI research division, said in his own post that Gemini 2.0 Flash Thinking Experimental is “trained to use thoughts to strengthen its reasoning.”

“We see promising results when we increase inference time computation,” Dean said, referring to the amount of computing used to “run” the model as it considers a question.

Built on Google’s recently announced Gemini 2.0 Flash model, Gemini 2.0 Flash Thinking Experimental appears to be similar in design to OpenAI’s o1 and other so-called reasoning models. Unlike most AI, reasoning models effectively fact-check themselves, which helps them avoid some of the pitfalls that normally trip up models.

As a drawback, reasoning models often take longer — usually seconds to minutes longer — to arrive at solutions.

Given a prompt, Gemini 2.0 Flash Thinking Experimental pauses for a matter of seconds before responding, considering a number of related prompts and “explaining” its thinking along the way. After a while, the model summarizes what appears to be the best answer.

Well — that’s what’s supposed to happen. When I asked Gemini 2.0 Flash Thinking Experimental how many R’s were in the word “strawberry,” it said “two.”

Google’s reasoning model struggles with counting letters, sometimees.Image Credits:Google

Your mileage may vary.

In the wake of the release of o1, there’s been an explosion of reasoning models from rival AI labs — not just Google. In early November, DeepSeek, an AI research company funded by quant traders, launched a preview of its first reasoning model, DeepSeek-R1. That same month, Alibaba’s Qwen team unveiled what it claimed was the first “open” challenger to o1.

What opened the floodgates? Well, for one, the search for novel approaches to refine generative AI. As my colleague Max Zeff recently reported, “brute force” techniques to scale up models are no longer yielding the improvements they once did.

Not everyone’s convinced that reasoning models are the best path forward. They tend to be expensive, for one, thanks to the large amount of computing power needed to run them. And while they’ve performed well on benchmarks so far, it’s lot clear the rate of progress is sustainable.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Gemini 2.0 推理AI模型 人工智能 多模态理解 深度学习
相关文章