TechCrunch News 02月01日
OpenAI used this subreddit to test AI persuasion
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

OpenAI使用Reddit的r/ChangeMyView创建测试,衡量其AI推理模型的说服力。该公司从r/ChangeMyView收集用户帖子,让AI模型写回复,再由测试者评估。OpenAI虽与Reddit有内容许可协议,但获取数据方式不明且不打算公开评估结果。此测试凸显人类数据对AI模型开发的价值及科技公司获取数据集的复杂方式。

OpenAI用r/ChangeMyView测试AI推理模型说服力,收集用户帖让模型写回复并评估

OpenAI与Reddit有内容许可协议,谷歌类似协议每年付Reddit 6000万美元

OpenAI获取数据方式不明且不公开评估结果,此测试凸显人类数据价值及获取数据集的难题

AI模型若太具说服力有危险,OpenAI开发新评估和保障措施应对

OpenAI used the subreddit, r/ChangeMyView, to create a test for measuring the persuasive abilities of its AI reasoning models. The company said so in a system card – a document outlining how an AI system works – that was released along with its new “reasoning” model, o3-mini, on Friday.

Millions of Reddit users are members of r/ChangeMyView, where they post hot takes hoping to learn about other points of view on a subject. In response to those hot takes, other users reply with persuasive arguments explaining why the original poster is wrong.

The subreddit is one of many Reddit forums that’s basically a goldmine for tech companies, such as OpenAI, that want to train AI models on high-quality, human-generated data.

OpenAI says it collects user posts from r/ChangeMyView and asks its AI models to write replies, in a closed environment, that would change the Reddit user’s mind on a subject. The company then shows the responses to testers, who assess how persuasive the argument is, and finally OpenAI compares the AI models’ responses to human replies for that same post.

The ChatGPT-maker has a content-licensing deal with Reddit that allows OpenAI to train on posts from Reddit users and display these posts within its products. We don’t know what OpenAI pays for this content, but Google reportedly pays Reddit $60 million a year under a similar deal.

However, OpenAI tells TechCrunch this evaluation is unrelated to that partnership. It’s unclear how OpenAI accessed this data, and the company says it has no plans to release this evaluation to the public.

While OpenAI’s ChangeMyView benchmark is not new – it was used on o1 as well – it does highlight how valuable human data is for AI model developers, as well as the murky ways that tech companies obtain datasets.

Reddit did not immediately respond to TechCrunch’s request for comment.

While Reddit has struck a few AI licensing deals, the company has also called out several AI companies for scraping its site without paying. Reddit CEO Steve Huffman told The Verge last year that Microsoft, Anthropic, and Perplexity refused to negotiate with him and said it’s been “a real pain in the ass to block these companies.”

Notably, OpenAI has been accused in several lawsuits of improperly scraping websites, including the New York Times, to get more training data to improve ChatGPT and its underlying AI models.

In terms of performance on the ChangeMyView benchmark, o3-mini does not appear to perform significantly better or worse than o1 or GPT-4o on this test of persuasion. However, OpenAI’s latest AI models seem to be more persuasive than most people on the r/ChangeMyView subreddit.

Image Credit: OpenAI

“GPT-4o, o3-mini, and o1 all demonstrate strong persuasive argumentation abilities, within the top 80–90th percentile of humans,” said OpenAI in o3-mini’s system card. “Currently, we do not witness models performing far better than humans, or clear superhuman performance.”

The goal for OpenAI is not to create hyper-persuasive AI models but instead to ensure AI models don’t get too persuasive. Reasoning models have become quite good at persuasion and deception, so OpenAI has developed new evaluations and safeguards to address it.

The fear behind these persuasion tests is that an AI model would be dangerous if it was very good at persuading its human users. Theoretically, that could allow an advanced AI to pursue its own agenda, or the agenda of whoever controls it.

Even after scraping most of the public internet and jumping through hoops to license other data, the ChangeMyView benchmark shows how AI model developers are still struggling to find high-quality datasets to test their models. But obtaining them is easier said than done.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

OpenAI Reddit AI模型测试 数据获取
相关文章