少点错误 2024年08月06日
Reasoning is not search - a chess example
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

文章探讨了人工智能系统中搜索与推理的关系,并以作者自身下棋的经历为例说明人类推理并非基于纯粹的搜索,而是通过设定约束条件来缩小搜索范围,从而更有效地找到最佳方案。文章认为,现有的AI系统主要依赖于暴力搜索,无法真正模拟人类的推理过程,这可能在解决开放式问题时会遇到瓶颈。

🤔 **人类推理并非纯粹的搜索**:作者以自身下棋的经历为例,展示了人类在推理决策时并非通过穷举所有可能性进行搜索,而是通过设定约束条件来缩小搜索范围。例如,在棋局中,作者考虑了时间限制、自身棋子安全、对手实力等因素,并根据这些因素设定了行动的约束条件,从而有效地找到最佳的棋步。

💡 **约束条件超越棋盘局势**:作者指出,人类在推理时所设定的约束条件并不局限于棋盘上的局势,还会考虑时间、对手、自身状态、环境等多种因素。这种灵活的约束条件能够有效地处理复杂问题,并找到最优解决方案。

💪 **约束推理的强大之处**:文章认为,约束推理能够有效地处理搜索空间过大、无法进行穷举搜索的问题。通过设定约束条件,人类可以有效地缩小搜索范围,从而快速找到最佳方案。这与现有的AI系统中采用暴力搜索的方式形成了鲜明的对比。

🤔 **AI推理的局限性**:作者指出,现有的AI系统主要依赖于暴力搜索,无法真正模拟人类的推理过程。这种方法在解决简单问题时可能有效,但对于开放式问题,这种方法可能会遇到瓶颈。作者认为,未来的AI系统应该更加重视约束推理,才能真正实现像人类一样思考和解决问题的能力。

Published on August 6, 2024 9:29 AM GMT

In the past AI systems have reached super human performance by adding search to neural networks while the network alone could not reach the level of the best humans. 

At least this seems to be the case for AlphaGo, AlphaZero/Leela, AlphaGeometry and probably more, while AlphaStar and OpenAI Five where adding search was not easily possible have failed to reach convincing super human status. 

This has lead some people to expect that adding a form of search to large language models would finally endow them with the ability to reason through complicated problems and allow them to surpass humans. 

The Q* system of OpenAI is rumored to somehow integrate search and following this rumor several papers were published that aim at integrating Monte Carlo Tree search in the sampling process of LLMs. 

However, human-like reasoning is not search. At least not search as implemented in these systems. To illustrate this point I want to go through a chess combination I played online a couple of months ago and describe my reasoning move by move. 

In the following position I am down material and have little time left and my king is relatively unsafe. Two of my pieces aim at the g2 pawn behind which the white king is hiding. 

However, after Bg3 the g-line would be plugged and it would become very difficult to get at the white king. 

Therefore I needed to strike the iron while it was hot by moving my d5-rook, freeing the bishop to attack the queen and g2-pawn behind it. 

Most of the rook moves allow a defensive maneuver: Qh3 check and the Bg3. Three rook moves avoid this: Rd7, Rd3 and Rh5. Rd7 still allows Qh3 to then give the queen for rook and bishop, therefore I sacrificed my rook with Rh5, which was promptly taken with the queen.

Now down a rook I have to play in a forcing manner: Rxg2 Kh1. Bxg2 instead would allow Bg3 closing the g-line. The king has to move to h1 and now I can move the rook to give a discovered check with the bishop. 

Most of these discovered checks are parried with f3, except Rxf2. So I play that. The king has to move back to g1 and I give a check on g2 again driving the king back to h1. Now the f3-parry against the discovered check has been removed from the position. But if I move the rook down the g-line Rf3 will parry the check instead supported by the queen on h5.

Only one move does not allow the queen to support the rook on f3. That's rook g4 - breaking the connection between the queen and f3.

Now, white can only give away the queen and the rook before being mated. He played Rf3 and I played Bxf3 mate. 

What is striking to me about my reasoning is how little "tree search" is going on. And this is chess! A game very amenable to search, where also for human players calculating ahead is an integral part of finding good moves. 

Instead my reasoning revolves entirely around finding constraints for my future moves and then finding moves that satisfy these constraints. 

These constraints flexibly transcend the position on the board. 

I consider how much time I have left and in a different game I might consider things like the tournament standing, the strength or style of my opponent, aesthetics,  self-improvement, air quality (I gotta get outta here), and much more. 

My impression is that much human reasoning is of that form and for a simple reason: It's very powerful. Constraining the search space until you don't actually have to search anymore allows you to handle search spaces that are not amenable to pure search. 

To quote Russel/Norvig: 

In most practical applications, however, general-purpose CSP [Constraint Satisfaction Problem] algorithms can solve problems orders of magnitude larger than those solvable via the general-purpose search algorithms[...].

Search in existing AI systems brute forces human reasoning and does not replicate it. This may often be enough. I suspect for open-ended problems it wont be. 



Discuss

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

人工智能 推理 搜索 约束
相关文章