少点错误 01月18日
Scaling Wargaming for Global Catastrophic Risks with AI
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Sentinel团队正在开发一款名为Grim的AI驱动战争推演工具,旨在显著增加组织可以探索的灾难性场景数量,并提升应急响应能力。该工具通过模拟复杂的系统动态,帮助用户理解系统弱点,并提供更有效率的危机应对方案。Grim作为一个Telegram机器人,用户可以通过它进行行动、获取信息和输入假设。其后端由多个LLM调用组成,包括生成结果的“预测器”和构建叙事的“游戏大师”。该工具旨在通过AI克服传统战争推演的局限性,例如资源密集、专家依赖和人为疏忽等问题,从而提高危机应对的效率和效果。

🤖 Grim是一个AI驱动的战争推演工具,旨在通过模拟复杂的系统动态,帮助用户理解系统弱点,并提高危机应对能力,它利用AI克服传统战争推演的资源密集、专家依赖等局限性。

🕹️ Grim的核心交互方式包括:ACTION(采取行动推进游戏)、INFO(获取世界信息)和FEED(输入假设信息)。后端由“预测器”LLM生成行动结果,并由“游戏大师”LLM构建叙事,从而提供沉浸式体验。

💡 该工具的应用不仅限于模拟危机场景,还能帮助用户在危机前做好准备,包括储备物资、制定财务策略和建立通讯网络。同时,强调了个人在危机中的重要性,建议提前做好物理安全和财务安全准备。

📡 文章还探讨了在危机中通信的重要性,包括使用业余无线电等模拟系统,以及开发独立的网络堆栈,以应对网络基础设施脆弱和AI干扰等挑战。并讨论了开发“SentinelOS”的可能性,以保障危机中的通信和协调。

Published on January 18, 2025 3:10 PM GMT

We at Sentinel are developing an AI-enabled wargaming-tool, grim, to significantly scale up the number of catastrophic scenarios that concerned organizations can explore and to improve emergency response capabilities of, at least, Sentinel.

Table of Contents

    How AI Improves on the State of the ArtImplementation Details, Limitations, and ImprovementsLearnings So FarGet Involved!

How AI Improves on the State of the Art

In a wargame, a group dives deep into a specific scenario in the hopes of understanding the dynamics of a complex system and understanding weaknesses in responding to hazards in the system. Reality has a surprising amount of detail, so thinking abstractly about the general shapes of issues is insufficient. However, wargames are quite resource intensive to run precisely because they require detail and coordination.

Eli Lifland shared with us some limitations about the exercises his team has run, like at The Curve conference:

    It took about a month of total person-hours to iterate of iterating on the rules, printouts, etc.They don’t have experts to play important roles like the Chinese government and occasionally don’t have experts to play technical roles or the US government.Players forget about important possibilities or don’t know what actions would be reasonable.There are a bunch of background variables which would be nice to keep track of more systematically, such as what the best publicly deployed AIs from each company are, how big private deployments are and for what purpose they are deployed, compute usage at each AGI project, etc. For simplicity, at the moment they only make a graph of best internal AI at each project (and rogue AIs if they exist).It's effortful for them to vary things like the starting situation of the game, distribution of alignment outcomes, takeoff speeds, etc.

AI can significantly improve on all the limitations above, such that more people can go through more scenarios faster at the same quality. One can also prompt AIs for stranger scenarios much more easily than one can people. At the end of the day, AI will still be probing a space with priors, but people have more rigid priors and get tired during sampling.

In line with Sentinel’s thesis that we are collectively underappreciating unknown-unknowns and their interactions, running 5-100x more serious wargames in a wider range of scenarios means that many more chances to find buried grains of truth to coordinate around.

Implementation Details, Limitations, and Improvements

grim is a telegram bot. This has the advantage that all a new user has to do is join a chat.

Example of scenario setup, action submission, results.

There are three ways of interacting with the bot:

On the backend, the output of the bot is the result of a pipeline of

    “Forecaster” LLM calls that generate outcomes of actions and sample from them

    The Forecaster LLM generating outcomes and their respective weights for Nuño’s research action.

    a “Game Master” LLM whose job it is to weave things into a narrative.

We’re constantly finding ways to improve the usefulness of the tool. A good next step could be to build Eli’s suggestion of including expert agents to increase the realism of key live-players or important institutions.

Some Learnings So Far

    There are benefits to increasing the number of people who will take action in a crisis besides the obvious of increasing effort applied to the problem. People take different actions early in the game and that’s good for finding low-hanging fruit and unusually effective interventions. I—Rai—like to understand what influential players like billionaires, heads of state, and their influencers are saying so I can think about what’s likely to get left behind or get made worse by their efforts. Nuño likes the action of tweeting out a warning early and trying to build a group around the issue. When we had guests participate in early rounds, sometimes they were one hop away from people who were uncontactable by us on short notice in practice. One participant in a bio exercise tried to reach out to their contacts at WHO, CEPI, Gates Foundation to weigh in on the state of a growing outbreak.Preparing yourself and your loved ones for emergencies beforehand broadly allows you to be a live player more quickly and sanely. Since we played as ourselves during these scenarios, often the first thing we did was make sure we and our loved ones were physically safe. If you were at all hoping to be a live player during crises, we think taking basic precautions sooner rather than later would be a great idea. Get yourself and your loved ones
      Shelter-in-place capacity
        Shelf-stable food and waterPersonal protective equipment for biological threats
      Mobilization capacity
        A “go bag” with suppliesPredetermined destinations and meeting points
      Financial resilience
        A preference for more liquid assets in general and having some cash in particular.Predetermined financial strategies so that you spend crucial moments acting on the world rather than trying to act on financial markets. Our preferred strategy right now is to place a bet on out-of-the-money VXX calls. We’re not finance experts so we welcome critiques here if you have a better idea of what is a low-maintenance, easily-accessible position that could be put on in a wide variety of lead-ins to crisis.
    In several types of catastrophes and significant events, like a regime change in Bangladesh or in Syria, we can be impotent to influence events since we lack many of the necessary preparations for effective actions.Many events will remain contained to local and regional events, and indeed this is the most likely outcome. But when thinking about expected value, the question is less “what will happen?” and more “how could this escalate?”.Information and dashboards, like this bird flu risk dashboard are cheap types of scalable and permissionless interventions.Communication could become fraught, especially in scenarios with high AI capabilities, AI persuasion and network infrastructure fragility might make communication much harder.
      Permissionless analog systems like amateur radio require delicate reflection off oft-shifting atmospheric effects to reach targets past line-of-sight. Transmitting on frequencies that pass through the earth is more unwieldy in terms of power and antenna setup and are also de jure off-limits to civilians. In some emergencies there may be few with the capacity to enforce this ban, however.Uncorrelated networking stacks could be very useful if sufficient trunks of the physical layer of the internet are intact. Much like how AI benchmarks retain secret questions so as not to contaminate training runs, an unpublished operating system and networking stack could be very useful in a rogue AI scenario. To that end we’ve briefly discussed the notion of “SentinelOS” which would focus on communication, coordination, and continuity of humanity. We’re not sure how much to invest in the idea but if you have thoughts on this or think you’d be the right person to contribute to the design/implementation of this artifact, please reach out.

Get Involved!

If you are working in the GCR space, we'd love if you reached out at u>hello@sentinel-team.org</u with expressions of interest for participating in a wargame with Sentinel, running a wargame for a scenario and players of your choosing, or interest in contributing to our repository. If you’re involved in emergency response in particular, it’d be great to be able to stress test your responses.



Discuss

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AI 战争推演 危机应对 应急响应 Grim
相关文章