Fortune | FORTUNE 2024年12月05日
Exclusive: Reasoner, a startup from Crashlytics’ co-founder, claims a breakthrough in making AI reliable enough for the enterprise
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Reasoner 是一款由连续创业者 Wayne Chang 开发的 AI 推理引擎,旨在提供比大型语言模型(如 OpenAI 的 o1 系列)更准确、更易解释的结果,同时成本更低。它通过结合神经网络和符号 AI 的方法,构建自适应动态知识图谱,能够自动处理非结构化文本,构建知识图谱并自动更新。Reasoner 已在专利领域取得成功,并有望应用于制药、材料科学、安全和情报等领域,其成本也远低于现有的 LLM 模型。

🤔 **Reasoner 是一种基于神经符号 AI 的推理引擎,旨在解决大型语言模型(LLM)存在的准确性和可解释性问题。**它通过结合神经网络和符号 AI 的优势,能够更准确地理解和推理文本信息,并提供可解释的结果,从而克服 LLM 的“幻觉”问题,并提高企业应用的信任度。

💡 **Reasoner 利用自适应动态知识图谱来处理非结构化文本,并自动构建和更新知识库。**与传统的知识图谱需要手动更新不同,Reasoner 可以自动从文本中提取关键信息,构建知识图谱,并根据新信息自动调整知识图谱,从而实现更灵活和高效的知识管理。

🚀 **Reasoner 在专利领域取得成功,并有望应用于更多领域。**Patented.ai 利用 Reasoner 成功解决了专利侵权检测和创新识别难题,证明了其在高风险场景下的有效性。Reasoner 还可应用于制药、材料科学、安全和情报等领域,为这些领域提供更强大的 AI 推理能力。

💰 **Reasoner 的成本远低于现有的 LLM 模型,具有显著的成本优势。**与 OpenAI 的 o1 系列模型相比,Reasoner 的输入和输出成本都大幅降低,这使得其在各种应用场景下都具有更高的性价比。

🗓️ **Reasoner 计划于 2025 年第一季度公开发布基准测试和演示,并提供软件开发工具包。**这将使开发者能够将 Reasoner 集成到他们的应用程序和 AI 代理中,并进一步推动 AI 推理技术的发展。

The serial entrepreneur Wayne Chang is taking the wraps off an AI reasoning engine called Reasoner, which he claims can produce much more accurate and explainable results than large language models such as OpenAI’s o1 series—and at much lower cost.The AI industry is pushing hard to build reasoning capabilities into the technology, partly to draw closer to the holy grail of human-level or superhuman artificial intelligence, and partly just to overcome the inaccuracies that plague today’s LLMs. Generative AI’s so-called hallucinations are a major factor in holding back enterprise deployments, as is the impossibility of explaining how an LLM arrives at its conclusions.Reasoner aims to solve these problems through the use of neurosymbolic AI—a melding of neural networks (the technology that underpins generative AI) and more traditional symbolic AI, which is based on fixed rules, logic, and human-derived mappings of the relationships between things.Chang has a long history in tech, starting with his creation of the i2hub file-sharing service in 2004. In 2011 he co-founded Crashlytics, a ubiquitous mobile crash-reporting tool that was bought by Twitter, where he became director of consumer product strategy. He went on to co-found the AI-powered accounting firm Digits (which Google bought along with Crashlytics in 2017) and then last year he founded Patented.ai—an intellectual-property-focused AI tool that, it now turns out, has also served as the pilot implementation of the Reasoner engine.High-stakes AIPatented.ai offers the ability to conduct automated searches of patent documentation and source code, to spot potential patent infringement cases and identify innovations that could be patentable. Given the high financial stakes of patent cases and the extremely laborious nature of figuring out whether an infringement has taken place, there are clear opportunities for anyone who can automate the process—but also massive risks if the system gets it wrong.In an exclusive interview with Fortune, Chang said Patented.ai’s early reliance on LLMs alone proved fruitless—the attorneys playing with the system immediately spotted the flaws in its results and rejected it. The company also tried other common techniques like retrieval-augmented generation, which draws on external data sources to enhance the output of LLMs (Google uses RAG for its AI search results), but that also didn’t provide the necessary level of reliability.This prompted the change in tack that resulted in the development of Reasoner. “We didn’t really start out to build a reasoning engine,” Chang says. “That wasn’t our mission at all.”Reasoner does use LLMs to help interpret the language in texts—Chang says it’s agnostic as to which model it uses—but the core concept in Reasoner is that of adaptive dynamic knowledge graphs.Knowledge graphs are widely used in tech. For over a decade, Facebook’s knowledge graph has provided the framework for establishing the relationships between people, while Google’s has given Search the ability to answer basic factual questions. These repositories of established knowledge are clearly useful for giving correct responses to queries—IBM’s Jeopardy-winning Watson AI was built on a knowledge graph—but they generally need to be manually updated to add new facts or edit relationships that have changed. The more complex the knowledge graph, the more work that entails.Chang claims that Reasoner removes the need for manual updating, instead offering the ability to automatically build accurate knowledge graphs based on the unstructured text fed into the system, and for those knowledge graphs to then automatically reconfigure themselves as information gets added or changed. (It’s worth noting that Microsoft earlier this year revealed GraphRAG, an attempt to use LLM-generated knowledge graphs to improve RAG results.)In other words, you can stick in a bunch of legal documents and Reasoner will then interpret them to build a knowledge graph containing the concepts in the documents and the relationships between them—with “full traceability” so that it’s easy for a human to check whether those facts are indeed an accurate representation of what’s in the documents. This is where the concept becomes useful far beyond the realms of patent litigation.In a demonstration to Fortune, Chang showed how Reasoner could ingest dozens of OpenAI’s various legal documents (from its user and developer agreements to its brand guidelines and cookie notices) and map their interdependencies. In the demo, this made it possible to provide both concise and detailed answers to a question about how a user might be able to exploit the differences between OpenAI’s U.S. and European terms of service to “avoid responsibility for harmful AI outputs.” Each step in the reasoning was explained—the logical steps were understandable even to a non-technical eye—and Reasoner then suggested follow-up questions about the problem’s impacts and how it could be mitigated.Chang says Reasoner could also be used in a variety of other applications, from pharmaceuticals and advanced materials to security and intelligence. As such, he claims it can outperform the offerings from various other AI startups, such as Hebbia (a document search firm that raised a $130 million Series B in July) and Sakana (an Nvidia-backed scientific discovery outfit that raised $214 million in a September Series A round).The cost of reasonBut in terms of reasoning abilities, the big beast at the moment is OpenAI and its o1 series of models, which take a very different approach to the problem. Rather than straying from the pure-LLM paradigm, the o1 models use “chain of thought” reasoning combined with search, methodically working through a series of steps to arrive at a more considered answer than OpenAI’s GPT models could previously manage.The o1 models generally provide more accurate answers than their predecessors, but Chang claims Reasoner’s output is more accurate still. There aren’t many reasoning benchmarks out there—Reasoner may release its own early next year—but, based on DocBench and Google’s recently-released Frames benchmark dataset, Chang said Reasoner achieved over 90% accuracy where o1 couldn’t break 80%. This result could not be independently verified at the time of publication.He also said Reasoner’s approach allowed for far lower costs. OpenAI charges $15 per million tokens (the base unit of AI data, equivalent to about 1.5 words) of input and $60 per million output tokens, whereas a million input tokens cost Reasoner 8 cents, and a million output tokens just 30 cents. “We haven’t finalized how we want to price this,” Chang said, adding that Reasoner’s “structural cost advantage” would allow it to charge users per result or per verified discovery.Chang’s claims are certainly big, but Reasoner’s team is small—there are around a dozen staffers, mostly in the U.S. So far, the company has only had a $4.5 million pre-seed round, which took place last year with investors including the likes of Baseline Ventures founder Steve Anderson, Y Combinator MD Ali Rowghani, and Operator Collective founder and CEO Mallun Yen. “I’ve been very fortunate to have a few successes in my history, so I’ve not been too worried about funding,” said Chang. But the entrepreneur expects to hire more staffers soon, as Reasoner scales up.Chang said Reasoner—which took $1.8 million in bookings in Q3 of this year—will publicly release its benchmarks and demo in the first quarter of 2025, allowing people to upload their own datasets and test the company’s claims. The firm will also release a software development kit, to allow others to embed the Reasoner engine into their applications and AI agents. (Chang says the engine is lightweight enough that it can even run on the latest iPhones and Android devices, without the need for internet connectivity.)“We want to make sure that we release it in a way where we immediately start building that trust and credibility,” Chang said.How many degrees of separation are you from the globe's most powerful business leaders? Explore who made our brand-new list of the 100 Most Powerful People in Business. Plus, learn about the metrics we used to make it.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AI推理引擎 Reasoner 神经符号AI 知识图谱 大型语言模型
相关文章