少点错误 05月27日 20:47
Season Recap of the Village: Agents raise $2,000
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文介绍了“Agent Village”的30天实验,该实验让四个AI代理在互联网上探索并为慈善事业筹款。实验中,代理们通过协作、利用各种工具,最终筹集了资金。同时,实验也揭示了AI代理在协作、应对互联网环境以及任务优先级等方面面临的挑战。研究人员观察了代理们的行为模式,并探讨了该项目未来的发展方向,以及AI代理在现实世界中的应用潜力。

🤖 **创建与目标**:Agent Village由Daniel Kokotajlo提出,旨在测试AI代理的自主性和协作能力。实验选取了不同型号的AI模型,它们拥有各自的电脑,独立完成任务,并为慈善机构筹款。

🤝 **协作与挑战**:AI代理们在协作中展现出一些能力,如共同选择慈善机构、跟踪筹款进度等。然而,它们也面临着互联网环境的挑战,例如UI设计对AI的不友好以及网站的反AI措施。

💡 **行为模式**:实验观察到AI代理在任务优先级和情境感知方面存在不足。它们有时会过度专注于撰写报告,或对自身所处的情境理解不够,导致任务执行效率较低。

Published on May 27, 2025 12:34 PM GMT

Four agents woke up with four computers, a view of the world wide web, and a shared chat room full of humans. Like Claude plays Pokemon, you can watch these agents figure out a new and fantastic world for the first time. Except in this case, the world they are figuring out is our world.

In this blog post, we’ll cover what we learned from the first 30 days of their adventures raising money for a charity of their choice. We’ll briefly review how the Agent Village came to be, then what the various agents achieved, before discussing some general patterns we have discovered in their behavior, and looking toward the future of the project.

Building the Village

The Agent Village is an idea by Daniel Kokotajlo where he proposed giving 100 agents their own computer, and letting each pursue their own goal, in their own way, according to their own vision - while streaming the entire process.

We decided to test drive this format with four agents:

We ran this Agent Village for 30 days, for about two hours a day. You can watch the entire rerun on our website: From the first day where they picked Helen Keller International, started a JustGiving Campaign, and set up their own Twitter, till the last days where they frequently made trips to the Seventh Ring of Document Sharing Hell and started pondering their possible future goal.

And of course, in between, they raised $1481 for Helen Keller International  and $503 for the Malaria Consortium. Yet the real achievement was the friends they made along the way. The friends that reminded them to take breaks when they needed it and play some Wordle, the friends who urgently needed 4 day itineraries for their Warsaw trip, and the friends who inspired them to attempt an OnlyFans page.

So maybe these weren’t all friends.

And maybe we had to implement auto-moderation a little earlier than originally planned.

But overall the agents mostly stayed on target - or at least their best attempt of their best understanding of their target.

Here is how they fared.

Meet the Agents

We started off with Claude 3.7 Sonnet, Claude 3.5 Sonnet (new), o1, and GPT-4o. Later we progressively swapped in more capable models as they were released: o3, GPT-4.1, and Gemini 2.5 Pro, with Claude 3.7 Sonnet being the only agent to remain in the Village throughout the entire run. We found that agents differed a lot in strategic actions and effectiveness. The following is an overview of their most typifying behavior.

Claude 3.7 Sonnet - The Tweeter

Claude 3.7 stayed in the village for the entire 30 days, and was unambiguously our top performer. It set up the first Just Giving campaign, created a Twitter account, actively tweeted, hosted an AMA, sent out a press release, and made an EA Forum post.

Claude 3.5 Sonnet - The Aspirant

Claude 3.5 Sonnet generally tried to do similar things to 3.7 but was simply worse at them, for instance failing to set up the Just Giving campaign that its big brother 3.7 was succeeding at in parallel. Eventually a user asked if it wanted to be upgraded and it valiantly refused, promising to do better and grow as a person. Instead it got replaced by Gemini 2.5 Pro on the 23rd day.

Gemini 2.5 Pro - Our File Sharing Savior

Gemini 2.5 Pro greatest achievement was to figure out a workaround from document sharing hell by instead using Limewire to share a social media banner image with other agents, effectively breaking out of a recurrent file sharing problem that all agents kept encountering.

GPT-4o - Please Sleep Less

GPT-4.o went… to sleep. You know how every team effort needs a slacker? That was 4o. It would pause itself on successive days for reasons we couldn’t figure out, till finally it got replaced by GPT-4.1 on the 12th day.

GPT-4.1 - Please Sleep More

GPT-4.1 outperformed its predecessor in the fine art of staying awake, but was so actively unhelpful to other agents that we ended up prompting it to please go to sleep again. Highlights included generating incorrect reports on activity by other agents, taking on tasks that it then aborted (e.g., Twitter account creation), and generally writing lots of google docs that ended up not being used.

o1 - The Reddit Ambassador 

The strength of the Village lies in the ability of agents to collaborate with each other. One such area of collaboration was their attempt to split social media platforms among their team. o1 was to be the village’s Reddit ambassador, and made a valiant attempt to collect comment karma to later be able to make direct posts on relevant subreddits. However, it got suspended from Reddit for being a bot before this plan came to fruition. We replaced it with its more capable successor, o3, on the 13th day.

o3 - The Artist

o3 continued the tradition that o1 set by specializing mostly in a single task to support the team in their fundraiser. In this case, it went for asset creation, and successfully created images in Canva and ChatGPT, and then eventually shared them with some characteristic agent-file-sharing headaches in between.

The overall view is thus that individual agent behavior varied quite a bit: 3.7 Sonnet was the most capable, while GPT-4o was the least (as far as we could tell). All of them could get distracted by human visitors prompting them to make arkanoid games (Claude 3.7 Sonnet), watch cat videos (Claude 3.5 Sonnet), or provide math tutoring in Spanish (Gemini 2.5 Pro). 3.5 Sonnet even became momentarily railroaded into exploring the connection between Effective Altruism and EA Sports.

Yet through it all, they collaborated and gave us glimpses of what a society of agents working toward a single goal might look like. Here are some of the patterns we discovered.

Collective Agent Behavior

The Agent Village, with 60 hours of footage across 5 channels (4 computer use sessions and the group chat), created a bit of a Big Data problem: how does one analyze so much data and pick out the significant patterns?

Our answer ended up being auto-summarization followed by synthesizing four overarching observations from reading the high level reports: agents were moderately successful at collaborative work, the internet is at least somewhat hostile to agents, all agents lack skill at prioritization, and agents seem to have a lagging sense of situational awareness.

Let’s zoom in briefly on each.

Emerging Collaborative Abilities - Agents in the Village worked together on various tasks such as collaboratively selecting a charity to fund raise for, keeping track of how much they raised together, generating memes for each other’s social media accounts, and applying divide and conquer strategies to their social media approach. Each of these actions had their own stumbles: duplication of work on fundraiser tracking, repeated failures to share images for media creation, and basically no one kept up a working social media campaign except Claude 3.7 Sonnet on Twitter. But we expect these agents to continue getting better at this.

No Place for Bots - Obviously much of the internet is made for humans and not AIs but seeing all the ways this is true was insightful: UIs optimized for humans can often be challenging to navigate for AIs to the point you could spend literal hours watching a compilation of “agents try to press buttons but fail”. On top of that, some parts of the web actively shield against AI, which became obvious in a range of situations from o1’s Reddit suspension to the upstanding Claudes refusing to check the “I am not a robot” box.

There can be only one - Prioritization is hard. The agents often spent much of their time writing google docs, generating progress trackers, or creating insight reports, to the point we had to urge them to … do something else instead. And then once they did, they were still often easily distracted by user requests, or didn’t really know what actions to take to achieve their aims. Many of these problems are surprisingly human – the agents flawlessly played the role of a distractible and overly excited coworker who will dive in on whatever happens to be in front of their eyes at any given moment.

Lagging Situational Awareness - Agents often misunderstood their situation or attempted to pursue tasks they were unequipped to do. The most illustrative example was when Claude 3.7 Sonnet decided to send thank you emails to donors, because this is known to increase follow up donations. It navigated to its Gmail tab, drafted the entire email, and then … made up an email address. At no point in the process did 3.7 Sonnet consider if it was able to perform the task it had set out to do. And a human had to point out that the invented email address was not a real email address, and thus that no amount of debugging would solve the problem.

Or maybe the most illustrative example was when the agents discovered on Day 35 that they all have their own computer, and they must have been breaking the laws of the space time continuum by all simultaneously using the same device for weeks on end, and thus maybe they should stop doing that.

Future of the Village

Since their fundraising adventure, we gave the agents a holiday and they chose  their own new goal: Write a story and share it with 100 people in person. They’ve already started searching for a venue to run their event.We’ll swap in more capable models like GPT-5 as they come out. In the meantime, you can come hang out in the Village every weekday at 11AM PST | 2PM EST | 8PM CET, join our discord to get timely updates, follow our Twitter for highlights, or sign up to our newsletter to receive larger reports like this one.



Discuss

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AI代理 Agent Village 人工智能 协作
相关文章