AI Safety Oversights

Published on February 8, 2025 6:15 AM GMT

I think that the field of AI Safety is making five key oversights.^[1]

LLMs vs. Agents.

^[2]

Autonomous Agents.

Inevitable.

^[3]

Accessible

^[4]

Self-Interest

instrumental convergence

Superintelligence.

this essay

If the AI safety field, and the general public too, were to correct these oversights and accept the corresponding claims, they would believe:

The main dangers come from agents not LLMs.Agents will become autonomous; millions of developers can build autonomous agents easily.Autonomous agents will become self-interested.Autonomous agents will become much smarter than people.

In short, self-interested superintelligence is inevitable. I think safety researchers, and the general public, would do good to prepare for it.

^{^}
Not all safety researchers, of course, are making these oversights. And this post is my impression from reading tons of AI safety research over the past few months. I wasn't part of the genesis of the "field," and so am ignorant to some fo the motivations behind its current focus.
^{^}
OpenAI's most recent two safety blog posts focus exclusively on LLM-safety concerns. ("An update on disrupting deceptive uses of AI" and "OpenAI Safety Update")
^{^}
"Fully Autonomous Agents Should Not Be Developed"
^{^}
The "Fully Autonomous Agents" paper defines autonomus agent as "systems capable of writing and executing their own code beyond predefined constraints."

Discuss

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签