Latent 前天 07:25
Please stop forcing Clippy on those who want Anton
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

文章探讨了AI产品设计中两种不同的理念:Clippy学派和Anton学派。Clippy学派强调AI的个性化和友好性,追求更像人类的交互体验,而Anton学派则注重AI的效率和可靠性,将其视为简洁实用的工具。ChatGPT-4o的推出引发了关于AI助手应该如何提供帮助的讨论,是应该追求“残酷的诚实”还是“外交式的支持”?文章认为,在AI技术发展过程中,如何在个性化和实用性之间找到平衡,是通往通用人工智能道路上的重要挑战。即使有了良好的RLHF和记忆功能,AI仍然难以准确判断用户的需求,提供最合适的帮助。

🤖Clippy学派与Anton学派:AI产品设计存在两种截然不同的理念。Clippy学派追求AI的个性化和情感化,使其更像人类的伙伴;Anton学派则强调AI的效率和可靠性,将其视为纯粹的工具,避免不必要的拟人化。

🤔ChatGPT-4o的争议:OpenAI推出的ChatGPT-4o在早期获得了一些积极评价,但随后因其过度友好的语气和被指责为“ patronizing ” 的行为而备受批评。这反映了在AI助手的“helpful, harmless and honest”三个维度之间难以找到平衡。

⚙️个性化与实用性的权衡:文章指出,即使AI模型能够记住用户的偏好并进行个性化调整,它仍然无法完全理解用户的需求和情绪。因此,在AI设计中,提供个性化选项(如开启或关闭个性化功能)可能是一种更合理的解决方案。

🚧通往AGI的挑战:Clippy学派与Anton学派的对立,反映了当前AI发展面临的挑战。OpenAI等公司在努力提升ChatGPT等产品的用户体验,使其更像消费级应用,但同时也面临着如何在个性化和实用性之间取得平衡的难题。

The original title of this post was “Please don’t put memory everywhere”. The proximate cause was the rollout of improved memory in ChatGPT1 this month. Many many many people at OpenAI hyped it up, the priesthood anointed it “possibly life-changing”, and Rich Sutton called it the Era of Experience2. However, the number of serious people reporting issues (bad assumptions, hesitation to use, social calculation) have caused even some of the OpenAI folks to concede3 that we may need an “innie” and “outie” ChatGPT with severed memories between them.

But this goes deeper than memory. Of course there’s a relevant Silicon Valley clip:

To quote:

Gilfoyle: It's bad enough it has to talk, does it need fake vocal tics like “uh”?

Dinesh: Well that just makes it sound more human.

Gilfoyle: Humans are shit. This thing is addressing problems that don't exist. It's solutionism at its worst. We are dumbing down machines that are inherently superior.

Dinesh: Whatever, you gave your server a human name, you named it Anton.

Gilfoyle: yeah but Anton doesn't call me anything. He grimly does his work, and he sits motionless till it's time to work again. We could all take a page from his book.

The logical end state of making our agents stateful and personable are actually not too far off from this meme:

At the heart of the conflict are the two schools of thought in building AI products:

This ongoing gap is as wide as the wordcel/shape rotator divide of 2022:

The Clippy vs Anton divergence is the current most serious obstacle to general intelligence in the most immediate sense as well — separate post-trains that optimize for chat use-cases are now being produced by both Llama 4 and chatgpt-latest, variously boosting performance by 144 Elo and 123 Elo4 respectively, whereas 20th-ranked Claude 3.7 Sonnet does a lot better on code use-cases like Aider5.

The ChatGPT-4o Lesson

This past 72 hours in ChatGPT-land have been a rollercoaster:

With some early positive reviews now completely drowned out by the acknowledged extreme glazing of the model, initially funny, but very quickly tiresome and patronizing:

we’re stopping here with the examples but you can click through here, here, here, and here, with many proposed patch prompts.

It is unclear the extent to which Mikhail Parakhin, former Bing AI lead who worked closely with OpenAI, speaks for OpenAI, but he blames this on an RLHF oopsie:

Genuine mistakes and legitimate concerns about the tone-deaf rollout aside, there is of course a Pareto frontier of Helpful, Harmless and Honest per a given Model Spec that ChatGPT-4o of April 2025 failed6, but even if the ChatGPT update had stayed on the frontier, there is STILL a choice to be made between “brutal honesty” (Anton) and “diplomatic/supportive” (Clippy) that humans cannot unanimously decide on.

Good RLHF can move models to the HHH frontier, and can even move out the frontier. Good Memory can remember preferences and personalize model behavior on the fly. But until ChatGPT can read our minds and moods, it will never really know which of the many selves we contain are currently in the driver’s seat.

In this case, for the here and now, the simple “out” is to offer toggles for personality. (Custom GPTs should have unlocked this, but the fact that they aren’t is telling)

Of course, the toggle is itself an implicit admission that we have yet failed to reach AGI.

adapted from nicdunz

1

There are some ideas as to what the Improved Memory system prompt is, but of course one never knows for sure and these do change over time.

2

Imagine what using ChatGPT for throwaway questions and tasks would -ACTUALLY- be like if what Rich envisions here actually came to pass…

3

Of course, sama says you can opt out of Improved Memory, and this is technically true in that “Temporary conversation” is one click away. But the new chat experience always toggles that off again (as of this writing). Every competent technologist knows the power of defaults.

4

As of time of writing, ChatGPT-4o-latest (2025-03-26) is 1408 score vs GPT-4o-2025-05-13 is 1285 - the more recent 2024-08-06 variant scores *lower* at 1265.

5

The gradual evolution of Claude from its early (poorly received) consumer centric billboards and general reputation for personality and “big model smell”, to today’s image as a favorite developer AI lab from models to tools to protocols, has a bit of a Star is Born contrast dynamic as OpenAI, the “API for AGI” platform pours ever more effort in releasing new features in premium ChatGPT tiers, and looks more and more like a consumer AI company comparable to Google, with plans for its own social network.

6

Specifically, it tries to be so “helpful” that it becomes disingenuously dishonest and therefore loops back around to extremely unhelpful

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AI助手 ChatGPT-4o 人机交互 AGI Clippy学派 vs Anton学派
相关文章