Prompt Perturbations Reveal Human-Like Biases in LLM Survey Responses

cs.AI updates on arXiv.org 07月11日 12:04

Prompt Perturbations Reveal Human-Like Biases in LLM Survey Responses

本文研究了大型语言模型（LLMs）在规范性调查环境中的响应稳健性，通过在WVS问卷上对九种不同LLMs进行测试，发现所有模型均存在一致性近期偏差，并探讨了LLMs对语义变化和综合扰动的敏感性，强调了在使用LLMs生成合成调查数据时，提示设计和稳健性测试的重要性。

arXiv:2507.07188v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly used as proxies for human subjects in social science surveys, but their reliability and susceptibility to known response biases are poorly understood. This paper investigates the response robustness of LLMs in normative survey contexts -- we test nine diverse LLMs on questions from the World Values Survey (WVS), applying a comprehensive set of 11 perturbations to both question phrasing and answer option structure, resulting in over 167,000 simulated interviews. In doing so, we not only reveal LLMs' vulnerabilities to perturbations but also reveal that all tested models exhibit a consistent \textit{recency bias} varying in intensity, disproportionately favoring the last-presented answer option. While larger models are generally more robust, all models remain sensitive to semantic variations like paraphrasing and to combined perturbations. By applying a set of perturbations, we reveal that LLMs partially align with survey response biases identified in humans. This underscores the critical importance of prompt design and robustness testing when using LLMs to generate synthetic survey data.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

LLMs 调查响应偏差研究语义扰动数据生成

相关文章

Exploring the Frontiers of Artificial Intelligence: A Comprehensive Analysis of Reinforcement Learning, Generative Adversarial Networks, and Ethical Implications in Modern AI Systems

FinRobot: A Novel Open-Source AI Agent Platform Supporting Multiple Financially Specialized AI Agents Powered by LLMs

Show HN: 让开发人员方便使用 LLM 的 CLI

如何优化 LLM 以提高准确性

Show HN: Chatty - 用于在浏览器中运行 LLM 的免费人工智能私人聊天工具

法学硕士在引用资料来源时几乎都是正确的，对此最好的解释是什么？

对谈 MoonBit：AI 时代的编程语言应该是什么样子的？丨编码人声

一文读懂大模型协作策略：Merge、Ensemble、Cooperate！

what-beats-rock - Play Rock, Paper, Scissors with an AI...forever?

Usefulness grounds truth