TechCrunch News 04月09日 04:02
Deep Cogito emerges from stealth with hybrid AI ‘reasoning’ models
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Deep Cogito推出了一系列开源AI模型,名为Cogito 1,可在“推理”和非推理模式间切换。这些混合模型结合了推理和非推理组件,能在快速回答简单问题的同时,深入思考更具挑战性的查询。Cogito 1基于Meta的Llama和阿里巴巴的Qwen模型构建,通过新颖的训练方法提升了性能,并在数学和语言评估中表现出色,甚至超越了DeepSeek和Meta的Llama 4 Scout。目前,Cogito 1模型已可通过API在Fireworks AI和Together AI等云服务商上下载或使用。

🧠 Deep Cogito是一家新成立的公司,推出了名为Cogito 1的AI模型家族,这些模型可在推理和非推理模式之间切换。推理模式在数学和物理等领域表现出色,但计算成本较高;非推理模式则更快速。

⚙️ Cogito 1是混合模型,结合了推理和非推理组件。这使得它们能够快速回答简单问题,并在处理更复杂的问题时投入更多时间。Cogito声称,其模型在同等规模下,性能优于Meta和DeepSeek等公司的最佳开源模型。

🚀 Cogito 1基于Meta的Llama和阿里巴巴的Qwen模型构建,并通过新颖的训练方法提升了性能,实现了可切换的推理功能。目前,Cogito 1模型涵盖了30亿到700亿参数的范围,并且未来几个月还将推出更大规模的模型。

📈 根据Deep Cogito的内部基准测试结果,最大的Cogito 1模型(Cogito 70B)在数学和语言评估中,推理性能超过了DeepSeek的R1推理模型。关闭推理功能的Cogito 70B在通用AI测试LiveBench上,也超越了Meta的Llama 4 Scout模型。

🏢 Deep Cogito成立于2024年6月,总部位于旧金山。其创始人曾在Google的DeepMind和Google AI实验室工作,公司目标是构建“通用超级智能”,即能够超越人类并发现全新能力的AI。

A new company, Deep Cogito, has emerged from stealth with a family of openly available AI models that can be switched between “reasoning” and non-reasoning modes.

Reasoning models like OpenAI’s o1 have shown great promise in domains like math and physics, thanks to their ability to effectively fact-check themselves by working through complex problems step by step. This reasoning comes at a cost, however: higher computing and latency. That’s why labs like Anthropic are pursuing “hybrid” model architectures that combine reasoning components with standard, non-reasoning elements. Hybrid models can quickly answer simple questions while spending additional time considering more challenging queries.

All of Deep Cogito’s models, called Cogito 1, are hybrid models. Cogito claims that they outperform the best open models of the same size, including models from Meta and Chinese AI startup DeepSeek.

“Each model can answer directly […] or self-reflect before answering (like reasoning models),” the company explained in a blog post. “[All] were developed by a small team in approximately 75 days.”

The Cogito 1 models range from 3 billion parameters to 70 billion parameters, and Cogito says that models ranging up to 671 billion parameters will join them in the coming weeks and months. Parameters roughly correspond to a model’s problem-solving skills, with more parameters generally being better.

Cogito 1 wasn’t developed from scratch, to be clear. Deep Cogito built on top of Meta’s open Llama and Alibaba’s Qwen models to create its own. The company says that it applied novel training approaches to boost the base models’ performance and enable toggleable reasoning.

According to the results of Cogito’s internal benchmarking, the largest Cogito 1 model, Cogito 70B, with reasoning outperforms DeepSeek’s R1 reasoning model on a few mathematics and language evaluations. Cogito 70B with reasoning disabled also eclipses Meta’s recently released Llama 4 Scout model on LiveBench, a general-purpose AI test.

Every Cogito 1 model is available for download or use via APIs on cloud providers Fireworks AI and Together AI.

Cogito 1’s performance compared to other popular openly available AI models,Image Credits:Deep Cogito

“Currently, we’re still in the early stages of [our] scaling curve, having used only a fraction of compute typically reserved for traditional large language model post/continued training,” wrote Cogito in its blog post. “Moving forward, we’re investigating complementary post-training approaches for self-improvement.”

According to filings with California State, San Francisco-based Deep Cogito was founded in June 2024. The company’s LinkedIn page lists two co-founders, Drishan Arora and Dhruv Malhotra. Malhotra was previously a product manager at Google AI lab DeepMind, where he worked on generative search technology. Arora was a senior software engineer at Google.

Deep Cogito, whose backers include South Park Commons, according to Pitchbook, ambitiously aims to build “general superintelligence.” The company’s founders understand the phrase to mean AI that can perform tasks better than most humans and “uncover entirely new capabilities we have yet to imagine.”

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Deep Cogito AI模型 推理 混合模型
相关文章