All Content from Business Insider 23小时前
Top AI researchers say language is limiting. Here's the new kind of model they are building instead.
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文聚焦于人工智能领域的新兴研究方向——世界模型。不同于大型语言模型依赖于语言数据,世界模型旨在构建类似人类的思维模型,通过理解和预测现实世界中的事件。以李飞飞和Yann LeCun为代表的AI科学家们正在积极探索这一领域,世界模型有望在创意、机器人等领域发挥重要作用。然而,构建世界模型面临数据匮乏的挑战,需要更复杂的工程技术来获取和处理数据。

🧠 世界模型是AI研究的新方向,它模拟人类的思维模式,预测现实世界中的事件,与依赖语言数据的大型语言模型不同。

💡 斯坦福大学的李飞飞和Meta的Yann LeCun等科学家正在构建世界模型。李飞飞的世界实验室致力于将AI模型从2D像素扩展到3D世界,赋予其空间智能。

🚧 构建世界模型面临数据挑战,由于人类对空间智能的理解和记录不如语言,需要更复杂的数据工程、采集、处理和合成技术。

⚙️ Meta的LeCun团队使用视频数据训练模型,通过抽象视频的不同层面来运行模拟,简化预测过程,从而构建更智能的AI。

Fei-Fei Li, a pioneer in AI research, is working to develop a "world" model, which trains on data beyond just language.

As OpenAI, Anthropic, and Big Tech invest billions in developing state-of-the-art large-language models, a small group of AI researchers is working on the next big thing.

Computer scientists like Fei-Fei Li, the Stanford professor famous for inventing ImageNet, and Yann LeCun, Meta's chief AI scientist, are building what they call "world models."

Unlike large-language models, which determine outputs based on statistical relationships between the words and phrases in their training data, world models predict events based on the mental constructs that humans make of the world around them.

"Language doesn't exist in nature," Li said on a recent episode of Andreessen Horowitz's a16z podcast. "Humans," she said, "not only do we survive, live, and work, but we build civilization beyond language."

Computer scientist and MIT professor, Jay Wright Forrester, in his 1971 paper "Counterintuitive Behavior of Social Systems," explained why mental models are crucial to human behavior:

Each of us uses models constantly. Every person in private life and in business instinctively uses models for decision making. The mental images in one's head about one's surroundings are models. One's head does not contain real families, businesses, cities, governments, or countries. One uses selected concepts and relationships to represent real systems. A mental image is a model. All decisions are taken on the basis of models. All laws are passed on the basis of models. All executive actions are taken on the basis of models. The question is not to use or ignore models. The question is only a choice among alternative models.

If AI is to meet or surpass human intelligence, then the researchers behind it believe it should be able to make mental models, too.

Li has been working on this through World Labs, which she cofounded in 2024 with an initial backing of $230 million from venture firms like Andreessen Horowitz, New Enterprise Associates, and Radical Ventures. "We aim to lift AI models from the 2D plane of pixels to full 3D worlds — both virtual and real — endowing them with spatial intelligence as rich as our own," World Labs says on its website.

Li said on the No Priors podcast that spatial intelligence is "the ability to understand, reason, interact, and generate 3D worlds," given that the world is fundamentally three-dimensional.

Li said she sees applications for world models in creative fields, robotics, or any area that warrants infinite universes. Like Meta, Anduril, and other Silicon Valley heavyweights, that could mean advances in military applications by helping those on the battlefield better perceive their surroundings and anticipate their enemies' next moves.

The challenge of building world models is the paucity of sufficient data. In contrast to language, which humans have refined and documented over centuries, spatial intelligence is less developed.

"If I ask you to close your eyes right now and draw out or build a 3D model of the environment around you, it's not that easy," she said on the No Priors podcast. "We don't have that much capability to generate extremely complicated models till we get trained."

To gather the data necessary for these models, "we require more and more sophisticated data engineering, data acquisition, data processing, and data synthesis," she said.

That makes the challenge of building a believable world even greater.

At Meta, chief AI scientist Yann LeCun has a small team dedicated to a similar project. The team uses video data to train models and runs simulations that abstract the videos at different levels.

"The basic idea is that you don't predict at the pixel level. You train a system to run an abstract representation of the video so that you can make predictions in that abstract representation, and hopefully this representation will eliminate all the details that cannot be predicted," he said at the AI Action Summit in Paris earlier this year.

That creates a simpler set of building blocks for mapping out trajectories for how the world will change at a particular time.

LeCun, like Li, believes these models are the only way to create truly intelligent AI.

"We need AI systems that can learn new tasks really quickly," he said recently at the National University of Singapore. "They need to understand the physical world — not just text and language but the real world — have some level of common sense, and abilities to reason and plan, have persistent memory — all the stuff that we expect from intelligent entities."

Read the original article on Business Insider

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

世界模型 AI 李飞飞 Yann LeCun 空间智能
相关文章