Marek Rosa - Goodai Blog 2024年11月26日
Why haven't we seen any mainstream games utilizing LLM-driven AI NPCs?
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文探讨了利用大型语言模型(LLM)驱动游戏NPC的挑战与机遇。虽然LLM在游戏领域应用潜力巨大,但目前仍面临诸多难题,例如高昂的API成本、LLM技术发展尚不成熟、上下文窗口限制以及多模态LLM的必要性等。作者认为,要让LLM驱动的NPC真正改变游戏体验,使其成为不可或缺的一部分,需要它们具备更强大的能力,例如与环境和彼此互动、保持一致性、拥有引人入胜的个性和剧情以及持续演变等。最终,率先掌握这项技术的开发者将创造新的市场和难以复制的技术壁垒。

🤔**成本问题:** 使用LLM API意味着开发者需要支付每小时每玩家数千个token的费用,而将小型LLM部署到玩家GPU上则可能导致质量和能力不足,无法提供引人入胜的叙事和游戏体验。

⏳**技术发展尚不成熟:** GPT-3.5和GPT-4等大型语言模型问世时间较短,其能力尚待提升,尤其是在支持长时间游戏会话的多代理叙事方面。

⚠️**当前LLM的局限性:** 现有的LLM上下文窗口有限,难以持续保持场景历史,且容易出现幻觉,导致NPC行为与游戏环境或自身设定产生矛盾。

🖼️**多模态LLM的必要性:** 为了让NPC更好地理解游戏环境并进行空间推理,可能需要引入多模态LLM,例如结合视觉信息进行感知和互动。

🚀**LLM驱动的NPC的理想状态:** 理想的LLM驱动NPC需要能够与环境和彼此互动,始终如一地理解自身周围环境,避免任何与过去行为或游戏机制不符的矛盾,并拥有引人入胜的个性和剧情,使其成为游戏体验不可或缺的一部分。

Because it's hard...

We've dedicated the last four years to this endeavor.

Making a simple proof of concept (PoC) is easy (we've seen many), but polishing it to production level requires 1,000 times more effort where deeper complexities emerge:

1) Cost: Using LLM via API means someone has to pay for those thousands of tokens per hour per player. Yes, you can run a smaller LLM on a player's GPU (e.g., Mistral, Llama3 8b), but the quality and capabilities may not be sufficient for an engaging narrative and gameplay.

2) It's too early: GPT-3.5 is only 1.5 years old, and GPT-4 is only one year old.

3) Current LLMs: They may not be capable of running multi-agent narratives for long game sessions. Context sizes are not large enough to hold scene history consistently. LLMs only attend perfectly to some instructions in the context and often hallucinate.

4) Multi-modal LLMs (with Vision): You may need these to unlock spatial reasoning and better grounding to the game environment.

For LLM-driven NPCs to fundamentally transform gameplay—making them indispensable, like a painkiller rather than a mere vitamin—they must do more than enhance; they must be crucial, realizing games that, without LLM-driven AI NPCs, would be unachievable (Beat Saber wouldn't be what it is without VR).

The benefits are clear: pioneers will create a new market niche and a defendable, tough-to-duplicate technology.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

LLM 游戏NPC 人工智能 游戏体验 多代理叙事
相关文章