Scaling Wargaming for Global Catastrophic Risks with AI

Published on January 18, 2025 3:10 PM GMT

We at Sentinel are developing an AI-enabled wargaming-tool, grim, to significantly scale up the number of catastrophic scenarios that concerned organizations can explore and to improve emergency response capabilities of, at least, Sentinel.

How AI Improves on the State of the ArtImplementation Details, Limitations, and ImprovementsLearnings So FarGet Involved!

How AI Improves on the State of the Art

In a wargame, a group dives deep into a specific scenario in the hopes of understanding the dynamics of a complex system and understanding weaknesses in responding to hazards in the system. Reality has a surprising amount of detail, so thinking abstractly about the general shapes of issues is insufficient. However, wargames are quite resource intensive to run precisely because they require detail and coordination.

Eli Lifland shared with us some limitations about the exercises his team has run, like at The Curve conference:

It took about a month of total person-hours to iterate of iterating on the rules, printouts, etc.They don’t have experts to play important roles like the Chinese government and occasionally don’t have experts to play technical roles or the US government.Players forget about important possibilities or don’t know what actions would be reasonable.There are a bunch of background variables which would be nice to keep track of more systematically, such as what the best publicly deployed AIs from each company are, how big private deployments are and for what purpose they are deployed, compute usage at each AGI project, etc. For simplicity, at the moment they only make a graph of best internal AI at each project (and rogue AIs if they exist).It's effortful for them to vary things like the starting situation of the game, distribution of alignment outcomes, takeoff speeds, etc.

AI can significantly improve on all the limitations above, such that more people can go through more scenarios faster at the same quality. One can also prompt AIs for stranger scenarios much more easily than one can people. At the end of the day, AI will still be probing a space with priors, but people have more rigid priors and get tired during sampling.

In line with Sentinel’s thesis that we are collectively underappreciating unknown-unknowns and their interactions, running 5-100x more serious wargames in a wider range of scenarios means that many more chances to find buried grains of truth to coordinate around.

Implementation Details, Limitations, and Improvements

grim is a telegram bot. This has the advantage that all a new user has to do is join a chat.

Example of scenario setup, action submission, results.

There are three ways of interacting with the bot:

ACTION

INFO

FEED

On the backend, the output of the bot is the result of a pipeline of

“Forecaster” LLM calls that generate outcomes of actions and sample from them

The Forecaster LLM generating outcomes and their respective weights for Nuño’s research action.

We’re constantly finding ways to improve the usefulness of the tool. A good next step could be to build Eli’s suggestion of including expert agents to increase the realism of key live-players or important institutions.

Some Learnings So Far

Shelf-stable food and waterPersonal protective equipment for biological threats

A “go bag” with suppliesPredetermined destinations and meeting points

out-of-the-money VXX calls

bird flu risk

Permissionless analog systems like amateur radio require delicate reflection off oft-shifting atmospheric effects to reach targets past line-of-sight. Transmitting on frequencies that pass through the earth is more unwieldy in terms of power and antenna setup and are also de jure off-limits to civilians. In some emergencies there may be few with the capacity to enforce this ban, however.Uncorrelated networking stacks could be very useful if sufficient trunks of the physical layer of the internet are intact. Much like how AI benchmarks retain secret questions so as not to contaminate training runs, an unpublished operating system and networking stack could be very useful in a rogue AI scenario. To that end we’ve briefly discussed the notion of “SentinelOS” which would focus on communication, coordination, and continuity of humanity. We’re not sure how much to invest in the idea but if you have thoughts on this or think you’d be the right person to contribute to the design/implementation of this artifact, please reach out.

Get Involved!

If you are working in the GCR space, we'd love if you reached out at u>hello@sentinel-team.org</u with expressions of interest for participating in a wargame with Sentinel, running a wargame for a scenario and players of your choosing, or interest in contributing to our repository. If you’re involved in emergency response in particular, it’d be great to be able to stress test your responses.

Discuss

Table of Contents

How AI Improves on the State of the Art

Implementation Details, Limitations, and Improvements

Some Learnings So Far

Get Involved!

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签