"AI achieves silver-medal standard solving International Mathematical Olympiad problems"

少点错误 2024年07月26日

../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

谷歌 DeepMind 发布了两个名为 AlphaProof 和 AlphaGeometry 2 的数学问题求解系统，它们成功地解决了 2024 年国际数学奥林匹克竞赛 (IMO) 中的四道题，成绩接近银牌水平。AlphaProof 是一款通用系统，而 AlphaGeometry 2 专注于几何问题。这两个系统都通过生成形式化的证明步骤并使用形式化证明检查系统 (如 Lean) 进行验证来工作。AlphaProof 解决了 IMO 的问题 1、2 和 6，而 AlphaGeometry 2 解决了问题 4。DeepMind 公开了这些系统生成的完整解决方案，并提供了链接供公众查看。

😊 **AlphaProof 和 AlphaGeometry 2 的工作原理：** AlphaProof 是一款通用系统，通过生成形式化的证明步骤并使用形式化证明检查系统 (如 Lean) 进行验证来解决数学问题。AlphaGeometry 2 则专门针对几何问题，使用专门的系统替代 Lean。这两个系统都使用了一种独特的训练循环，在竞赛期间不断强化对自生成问题变体的证明，直到找到完整解决方案。

🤔 **AlphaProof 的成果：** AlphaProof 成功地解决了 2024 年 IMO 的问题 1、2 和 6，这些问题属于代数和数论领域。它生成的证明是用 Lean 形式化的，并已公开发布。

🚀 **AlphaGeometry 2 的成就：** AlphaGeometry 2 成功地解决了 2024 年 IMO 的问题 4，一个平面几何问题。它生成的证明使用的是它自己的符号系统，并已公开发布。

🏆 **AI 在数学领域的表现：** DeepMind 的这两个系统展示了人工智能在解决复杂数学问题方面的潜力。它们能够生成完整的证明，并接近 IMO 的银牌水平，这表明 AI 在数学领域已经取得了显著的进展。

Published on July 25, 2024 3:58 PM GMT

Google DeepMind reports on a system for solving mathematical problems that allegedly is able to give complete solutions to four of the six problems on the 2024 IMO, putting it near the top of the silver-medal category.

Well, actually, two systems for solving mathematical problems: AlphaProof, which is more general-purpose, and AlphaGeometry, which is specifically for geometry problems. (This is AlphaGeometry 2; they reported earlier this year on a previous version of AlphaGeometry.)

AlphaProof works in the "obvious" way: an LLM generates formalized versions of the things to be proved, and generates candidate next steps which are checked using a formal proof-checking system, in this case Lean. One not-so-obvious thing, though: "The training loop was also applied during the contest, reinforcing proofs of self-generated variations of the contest problems until a full solution could be found."

AlphaGeometry is similar but uses something specialized for (I think) Euclidean planar geometry problems in place of Lean. The previous version of AlphaGeometry allegedly already performed at gold-medal IMO standard; they don't say anything about whether that version was already able to solve the 2024 IMO problem that was solved using AlphaGeometry 2.

AlphaProof was able to solve questions 1, 2, and 6 on this year's IMO (two algebra, one number theory). It produces Lean-formalized proofs. AlphaGeometry 2 was able to solve question 4 (plane geometry). It produces proofs in its own notation.

The solutions found by the Alpha... systems are at https://storage.googleapis.com/deepmind-media/DeepMind.com/Blog/imo-2024-solutions/index.html. (There are links in the top-of-page navbar to solutions to the individual problems.)

(If you're curious about the IMO questions or want to try them yourself before looking at the machine-generated proofs, you can find them -- and those for previous years -- at https://www.imo-official.org/problems.aspx.)

Discuss

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签