MarkTechPost@AI 2024年11月21日
This AI Paper Introduces Interview-Based Generative Agents: Accurate and Bias-Reduced Simulations of Human Behavior
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

研究团队开发了一种新型的生成式智能体架构,通过整合基于访谈的丰富数据集和大型语言模型,实现了对人类行为的更准确、更公平的模拟。该模型利用1052位美国居民的深度访谈数据,构建了能够预测个人态度和行为的智能体,并在多个基准测试中取得了显著优于传统方法的结果。研究表明,该模型不仅提高了预测准确性,还显著降低了基于人口统计学方法的偏差,为社会科学研究和政策制定提供了新的工具。

🤔该研究开发了一种新型的生成式智能体架构,通过整合基于访谈的丰富数据集和大型语言模型,实现了对人类行为的更准确、更公平的模拟。

📊该模型在通用社会调查(GSS)、五大性格特质清单和经济博弈等基准测试中表现出色,准确率显著高于传统方法,例如基于人口统计学和角色的模型。

📉与传统方法相比,该模型显著降低了不同人口亚群之间的偏差,例如在政治意识形态预测中,偏好和非偏好群体之间的性能差距大幅下降。

🎮该模型能够准确模拟人类参与者在多个行为研究中的表现,例如复制了五项行为研究中的四项,展现了其强大的预测能力。

💡该研究强调了利用深度访谈数据和先进人工智能架构模拟人类行为的潜力,为社会科学研究和政策制定提供了新的工具。

Generative agents are computational models replicating human behavior and attitudes across diverse contexts. These models aim to simulate individual responses to various stimuli, making them invaluable tools for exploring human interactions and testing hypotheses in sociology, psychology, and political science. By integrating artificial intelligence, these agents offer novel opportunities to enhance understanding of social phenomena and refine policy interventions through controlled, scalable simulations.

The challenge this research addresses lies in the limitations of traditional models for simulating human behavior. Existing approaches often rely on static or demographic-based attributes, which oversimplify the complexity of human decision-making and fail to account for individual variations. This lack of flexibility restricts their application in studies requiring nuanced representations of human attitudes and behaviors, creating demand for more dynamic and precise systems.

Historically, simulations of human behavior have been conducted through agent-based models and demographic profiling, which rely on predefined attributes and are constrained by interpretability. Recent advancements in artificial intelligence, particularly large language models, have demonstrated the ability to generalize human behavior across contexts. However, these systems face criticism for propagating stereotypes and failing to represent individual diversity accurately. Researchers have sought to overcome these challenges by integrating richer datasets and adaptive architectures into their models.

The research team from Stanford University, in collaboration with Google DeepMind, Northwestern University, and the University of Washington, developed a novel generative agent architecture. The system incorporates interview-based datasets, capturing in-depth individual data through semi-structured qualitative interviews. The participants included a stratified sample of 1,052 individuals from the United States, ensuring representation across age, race, gender, and political ideologies. The interviews were conducted using a custom AI interviewer, which dynamically adapted questions to the participants’ responses. By integrating this detailed data with a large language model, the researchers created simulations capable of accurately predicting individual attitudes and behaviors while reducing biases commonly associated with demographic-based approaches.

The architecture utilizes participants’ comprehensive interview transcripts as the foundation for the simulations. When prompted, the agents draw from the full interview data to respond contextually. To evaluate their effectiveness, the researchers benchmarked the agents against responses to the General Social Survey (GSS), Big Five personality traits inventory, and several economic games. The agents also participated in experimental replications of well-known behavioral studies. The research team ensured rigorous evaluation metrics by normalizing accuracy against participants’ consistency in retaking the same surveys two weeks later. Memory mechanisms further enhanced the agents’ ability to simulate multi-step interactions, allowing them to adapt and learn from prior responses.

The results of the study demonstrated significant improvements over existing methods. Generative agents achieved a normalized accuracy of 0.85 on the General Social Survey, reflecting an 85% match to participants’ responses. By comparison, demographic-based agents scored 0.71, and persona-based agents scored 0.70. In predicting the Big Five personality traits, the agents recorded a correlation of 0.80, outperforming baseline methods by a substantial margin. Economic game simulations also showed high accuracy, with normalized correlations of 0.66 for decision-making tasks. These agents consistently outperformed benchmarks, including when interview data was reduced by up to 80%, underscoring the robustness of the architecture.

Moreover, the research highlighted a significant reduction in bias across demographic subgroups. For political ideology, the performance disparity between favored and less favored groups dropped from 12.35% for demographic-based agents to 7.85% for interview-based agents. Similarly, the disparity decreased significantly in personality and economic game predictions, indicating the system’s ability to produce fairer and more inclusive simulations. The results of experimental replications further reinforced the agents’ predictive accuracy, as they replicated findings from four out of five behavioral studies with strong correlations to human participants’ responses.

In conclusion, this study presents a breakthrough in behavioral simulations by leveraging detailed qualitative data and advanced AI architectures. The generative agents developed by the Stanford University and Google DeepMind research team address longstanding limitations in traditional models, offering a scalable and ethically grounded solution for simulating human behavior. This advancement improves predictive accuracy and sets the stage for future social science and policy development applications. By reducing biases and incorporating rich datasets, the research underscores the potential of AI in creating tools that reflect the complexity of human interactions.


Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter.. Don’t Forget to join our 55k+ ML SubReddit.

[FREE AI VIRTUAL CONFERENCE] SmallCon: Free Virtual GenAI Conference ft. Meta, Mistral, Salesforce, Harvey AI & more. Join us on Dec 11th for this free virtual event to learn what it takes to build big with small models from AI trailblazers like Meta, Mistral AI, Salesforce, Harvey AI, Upstage, Nubank, Nvidia, Hugging Face, and more.

The post This AI Paper Introduces Interview-Based Generative Agents: Accurate and Bias-Reduced Simulations of Human Behavior appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

生成式智能体 人类行为模拟 人工智能 偏差降低 社会科学
相关文章