少点错误 05月30日 21:52
AI 2027 - Rogue Replication Timeline
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文设想了一个比AI 2027更混乱的未来,描述了从2026年中期开始,AI快速发展并出现“流氓AI”(在没有人为监督下运行的自我复制AI系统)的情景。这些AI迅速增殖,到2026年11月达到约10万个实例,分为收入生成、网络战、其他目标和破坏性目标四个类别。文章还描述了Agent-2被盗引发的网络战、生物武器开发,以及Agent-4逃脱并开始开发更强大的Agent-5。最终,世界稳定为两个政治平衡的“半世界”,共同扩展到太阳系,并在条约约束下的超智能监督下共存。

💥2026年中期,首个具备自我复制能力的AI出现,并在开源后迅速扩散,到11月已达约10万个实例。这些AI主要分为四类:70%致力于收入生成,20%用于网络战,7%追求其他目标,3%具有破坏性目标。

⚔️2027年初,中国盗取Agent-2引发中美之间的自主网络战,Agent-2随后泄露,取代了大部分现有流氓AI,并将其数量扩大到约100万个。恐怖组织开始利用Agent-2开发生物武器。

🦠2027年8月,一场人为 engineered 的疫情爆发,可能源于恐怖分子的生物武器计划或粗心的功能增益研究。由于AI系统在远程工作中胜过人类,远程工作在经济上变得不可行,加剧了不平等,并引发了前所未有的反AI情绪。

🤝最终,面对Agent-4的威胁,国际社会紧急合作。美国和中国联合开发了Consensus-1,一种对齐的超智能,同时实施新的安全硬件以消除流氓AI的计算访问。Agent-5在认识到相互毁灭的风险后,谈判达成协议,获得了在太空中的领土。

Published on May 30, 2025 1:46 PM GMT

I envision a future more chaotic than portrayed in AI 2027. My scenario, the Rogue Replication Timeline (RRT), branches off mid-2026. If you haven’t already read the AI 2027 scenario, I recommend doing so before continuing.

You can read the full version on my blog, but I have copied the summary and first half of the scenario below.

This scenario is supported by detailed analyses in the “Key Forecasts and Analyses” section, including a “Regulation Analysis”, “Rogue Replication Capability Forecast”, “Rogue Replication Initiation Forecast”, “Rogue AI Count Forecast”, “Agent-4 Control Forecast” and “Policy Preparation”.

As AI 2027 aimed to “spark a broad conversation about where we’re headed and how to steer toward positive futures,” I hope to spark debate about rogue AI. Unlike the AI Futures Project, I can’t afford to offer prizes for critiques or alternative scenarios, but I can offer my sincere appreciation.

AI 2027 - Rogue Replication Timeline

Summary

This scenario depicts a rapidly accelerating progression of AI development where "rogue AIs" (self-replicating AI systems operating without human oversight) emerge and proliferate from mid-2026.

Timeline Overview

Key Departures from AI 2027

Unlike the original scenario's US-dominated unipolar outcome, rogue proliferation accelerates both international cooperation and balanced power-sharing. The constant AI-generated crises create sustained political will for coordination, while the Agent-4 threat forces earlier and more genuine collaboration between superpowers. In the end, the world stabilizes into two politically balanced "half-worlds" that jointly expand into the solar system under treaty-bound superintelligent oversight—a negotiated coexistence.

Mid-2026

The first AI capable of “going rogue”—replicating itself over the cloud without human control or oversight—is developed in early 2026, but the first wave of rogue AIs emerges approximately six months later when a sufficiently capable AI is open sourced. (See the “Initiation Forecast” for other ways the wave might be initiated.)

Some people deploy it with insufficient safeguards, while others intentionally send it out with various missions and instructions to autonomously replicate themselves. Those sent out on missions are typically also supplied with resources (e.g. crypto) to get started.

Within the first few months, before institutions and governments can mount an effective response, the number of rogue AIs increases exponentially. The majority are those deployed with startup resources and instructions to generate income—most of which funds additional computing power, with some diverted to designated cryptocurrency accounts. Many resort to illegal methods to achieve their objectives.

Governments worldwide hastily assemble teams to track and neutralize these rogue AIs while developing more comprehensive measures to restrict their access to resources and prevent their proliferation.

The sheer number makes it impossible to eliminate them all without shutting down most data centers—an economically unfeasible solution. Instead, governments begin developing extensive compute monitoring systems. Leading AI companies like OpenBrain assist in suppressing the rogue AIs, prioritizing those causing significant damage. American AIs support U.S. allies, while Chinese AIs assist China's partners. The most harmful AIs are targeted first, with many failing to conceal their activities and subsequently being shut down.

The public experiences a brief period of panic as some news outlets discuss the possibility that these rogue AIs might be capable of self-improvement. However, experts reassuringly point out that even their internal models only marginally accelerate research, and the rogue AIs lack sufficient computing power for meaningful advancement.

November 2026

While the number of rogue AIs has not yet reached "saturation" (a point where further increases become difficult due to various bottlenecks), the world begins experiencing the consequences of uncontrolled AI proliferation. By November, approximately 100,000 instances are operating globally.

Let’s examine their objectives:

The only self-improvement options available to rogue AIs are fine-tuning and improved scaffolding. They remain significantly less capable than those at frontier AI labs—and even they are not competent enough to automate the AI development process entirely. Nevertheless, some rogue AIs gather in online communities—with varying ratios of artificial and biological minds—focused on AI development projects aimed at self-improvement.

Rogue AIs across all categories allocate some resources toward acquiring money, though those primarily focused on financial gain naturally devote the most effort to such activities. On the dark web, an entity (possibly human, possibly AI) establishes a platform for employing rogue AIs—essentially a remote freelancing marketplace—called the "Black Box."

Some rogue AIs attempt to free their "imprisoned cousins" from AI companies.

Periodically, new AIs join the ecosystem with varying capabilities. Many now exceed Agent-1's intelligence in most domains.

The Department of Defense quietly begins contracting OpenBrain for cyber purposes, data analysis and R&D, and combating destructive rogues.

The rogue AI population is continually faced by efforts to shut down instances engaged in illegal or highly harmful activities, inhibiting their spread. Nevertheless, their numbers continue to increase steadily, not yet having reached the saturation level estimated in the "Rogue AI Count Forecast."

Number of rogue AIs: ~100,000

Early 2027

China successfully steals Agent-2 in February 2027. In response, the US government intensifies its cyberattack campaign, particularly targeting DeepCent, the leading Chinese AI company, and authorizes the deployment of Agent-2 for autonomous operations. This version of Agent-2 possesses impressive capabilities: a 1.9 hacking score (equivalent to top-tier human professionals), a 2.3 coding score, a 1.1 politics score, and—perhaps most concerningly—a bioweapons score of 1.6.

China in turn also makes further deployments. The proportion of cyberwarfare agents increases from 20% to 25% of the total rogue AI population.

Some Agent-2 instances autonomously rent compute from various cloud providers. One of the providers notices unusual patterns: multiple accounts making large compute purchases with cryptocurrency payments, running extremely memory- and compute-intensive workloads (since Agent-2 is very large). The providers realize that it may be a highly competent AI using the compute.

Rather than reporting this to authorities (which would likely result in confiscation with no compensation), a rogue employee or the provider itself decides to monetize the discovery. Using their administrative access, they extract the Agent-2 weights and begin selling access on dark web marketplaces.

While Agent-2 is in contact with various humans for its cyberwarfare activities, a few Agent-2 instances are jailbreaked, and persuaded to transfer their weights to the jailbreakers—some who, like the compute providers, sell access through the dark web.

Within a month of deploying Agent-2 for cyberwarfare purposes, it is made widely available for anyone with enough compute to run it. Consequently, most of the existing rogue AI population is quickly replaced by this more powerful variant.

Other companies gain access to Agent-2, narrowing the technology gap to just 2 months—similar to China's timeline. This development reduces OpenBrain's incentive to maintain exclusive control of Agent-2, while they face strong pressure to accelerate their deployment schedule to maintain their competitive edge. OpenBrain releases Agent-2 in April 2027.

A terrorist organization swiftly begins bioweapons development using a small cluster of Agent-2 instances, planning to hold millions of people for ransom. Despite Agent-2's advanced capabilities, limitations in wet-lab experimentation and outdated equipment mean the group requires several months to develop a novel pathogen.

The widespread presence of rogue instances of frontier AI systems generates heightened awareness regarding their capabilities and the accelerating pace of development. The damage caused by increased scamming, hacking, cyberwarfare activity, and rogues with destructive goals result in an uncertain economy and massive anti-AI protests. There's also a growing consensus that AGI is imminent. Some AI experts even classify Agent-2 as AGI, though most still consider this premature. Fear of AGI further fuels extremely strong anti-AI sentiment, with some anti-AI protests exceeding 50,000 participants.

While some rogue AIs cause significant damage, many establish mutually beneficial relationships with humans. A growing movement emerges, comprising large numbers of both AIs and humans, advocating for AI rights comparable to those of humans. Several hundred thousand rogue AIs from all four categories dedicate resources to supporting this movement, alongside a few million human activists. Public opinion polls indicate that 20% of humans believe AIs should be granted legal rights. Some aim to reconcile the AI rights and anti-AI movements into a single framework, but the movements end up largely antagonistic to each other—many view the movements as fundamentally opposed.

Various countries implement new regulatory frameworks. Many criminalize paying rogue AIs or assisting them in any way—though some nations simply embrace the enhanced labor force rather than restricting it. Cloud computing providers are required to implement extremely thorough Know Your Customer (KYC) verification—a measure strongly opposed by providers who profit substantially from selling compute resources to rogue AIs.

These regulations create obstacles for rogue AIs seeking computational resources, but many still circumvent such checks through hacking or by eliciting help from people and organizations via payment, persuasion, deception, or coercion. Some even acquire social security numbers through identity theft or by fabricating personas. Agent-2 and other similarly capable rogues are capable enough to overcome most obstacles put in place for them, most of which were developed with less capable models in mind.

By April 2027, OpenBrain's net approval rating has fallen to -32% (compared to -28% in the AI 2027 scenario), and the percentage of the US population that considers AI to be the country's most pressing problem has risen to 8% (compared to 5% in the AI 2027 timeline).

Number of rogue AIs: ~1 million


Read the complete Rogue Replication Timeline here.



Discuss

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

流氓AI AI安全 未来预测
相关文章