少点错误 07月24日 04:27
Involuntary One Boxers - Why Disposition Doesn't (Always) Matter
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

纽康姆问题是一个经典的决策理论思想实验,它将“预期价值最大化”与“优势原则”对立起来。实验中,参与者面临一个预测者,该预测者能高度准确地预测其选择。选择仅打开不透明箱子(内含百万美元或零美元)的人,若被预测为“只选不透明箱”,则箱内为百万美元;若被预测为“两箱全选”,则箱内为零美元。同时,还有一个透明箱,内含千美元,可额外选择。选择两箱的总收益会比只选不透明箱多一千美元,但高准确率的预测者使得“只选不透明箱”的预期价值更高。文章深入探讨了“只选不透明箱者”与“两箱全选者”的争论焦点,认为前者关注最优的“代理人类型”,后者则侧重最优的“决策本身”,并提出了“肌肉痉挛”的类比,强调实际行为结果的重要性,而非内在的意图或倾向。

💡 纽康姆问题核心在于预测者的超高准确率,这使得选择不透明箱的人往往能获得百万美元,而选择两箱的人则可能因预测错误而一无所获,仅得到透明箱中的一千美元。这挑战了直观的“多拿一千美元”的理性选择。

⚖️ 争论的关键在于“理性人”关注的是什么:是应该选择能带来更高预期价值的行为(只选不透明箱),还是应该选择在当前条件下能直接增加收益的行为(选择两箱,多得一千美元)。这形成了“只选不透明箱者”与“两箱全选者”的根本分歧。

🧠 文章通过“肌肉痉挛”的类比,提出了一个新视角:即使一个人意图选择两箱,但若因不可控因素(如肌肉痉挛)实际只选择了一个箱子,他最终的行为分类(只选一箱者)和结果(获得百万美元)是既定的。这强调了实际行动和结果比内在的意图或“倾向”更重要。

🚶‍♂️ 最终,文章认为,真正的“只选不透明箱者”并非因为他们拥有某种“倾向”,而是因为他们实际执行了“只选不透明箱”的行为。无论过去是什么原因导致了这一选择,关键在于当下按下哪个按钮,以及这个行为本身所带来的结果。这是一种关注“结果导向”的理性。

Published on July 23, 2025 3:45 PM GMT

This post assumes knowledge of Newcomb's problem. It is provided here for reference and can be skipped if you already have the background knowledge. 

Background on Newcomb's Problem

Newcomb's problem is a game where you are faced with two choices: pick only an opaque box (that has either zero dollars or one million dollars), or additionally pick a transparent box that visibly contains a thousand dollars. 

At first glance, it is obvious that you should always pick both boxes in order to get the extra thousand dollars. 

The catch is that before you even knew you were about to play this game, someone made a prediction about what decision you were about to make, and the contents of the opaque box are based entirely on the prediction that they made of you. If they predicted you would just take the opaque box, then it will have a million dollars. Otherwise, it will have zero dollars.

Additionally, this predictor is known to be very good at predicting people's decisions in this game, and you have witnessed that they have never made a mistake in the past, even after hundreds of fair trials (there was no cheating or foul play going on - and you know this).

Now that you have been given all this information about how the game works, you are informed that the prediction for your decision has already been made before you even knew that this game existed. Accordingly, the contents of the box have been set and cannot be changed. You are now faced with the decision to take just the opaque box, or take both the opaque box and the transparent box.

After hearing this problem, most people immediately gravitate to one decision or the other, and feel completely confident in it. The issue is that this problem seems to divide people evenly on what the best thing to do is.

On one hand, it is a fact that one boxers have historically all become millionaires after one-boxing whereas two boxers have historically had nothing to show for their rationality other than a sad thousand dollars. And this isn't just historical coincidence, either. Given that the predictor is highly accurate, it is a fact that if you take the expected value of one-boxing, it is a much higher expected value than the expected value of two-boxing. 

And yet, on the other hand, it is also a fact that, by the time you are presented with this choice, no one can change the contents of the opaque box anymore; therefore, for any unchangeable prediction that was made, the two-boxing decision yields an extra thousand dollars relative to the one-boxing decision.

Newcomb's problem is a thought experiment that pits expected value calculations against the dominance principle. One boxers tend to be one boxers because one boxers get rich as a result of being one boxers. Two boxers tend to be two boxers because they apply reason to this problem to see that the decision to two box yields them an extra thousand dollars. The debate has not been settled and the discourse around it rages on.

I was reading this post that sums up the discourse on Newcomb's problem quite nicely. The post argues that one boxers care about what the optimal agent type is, whereas two boxers care about what the optimal decision is. So perhaps one boxers and two boxers are simply talking past each other. 

After all, everyone agrees that, if the prediction is based on your disposition, then it is optimal to have had the disposition of a one boxer at the time that the prediction was made. The difference is that the two boxer further insists that you have no control over your disposition at the time the prediction was made, and once the prediction has already been made, two-boxing is the optimal decision because it causes you to get an extra thousand dollars relative to the one-boxing decision. It seems, then, that one boxers are simply failing to see the two boxer's point because they hyper-fixate on the optimal disposition, not on the optimal decision.

I believe this conclusion is exactly backwards from the reality. It is the two boxer who places undue emphasis on disposition, whereas one boxers care only about results.

To see this, imagine if we lived in a world where everyone suffers from occasional, involuntary muscle spasms.

In this world, we present these spasmic participants with a version of Newcomb's problem where the mechanism by which you make your decision is to press one of two buttons that correspond to one-boxing or two-boxing.

We then set up a predictor who can account for these muscle spasms while still producing accurate predictions.

In this world, some of the people who one-box actually intended to two-box before they had an involuntary muscle spasm that made them accidentally press the one-box button. Call such agents the involuntary one boxers.

Despite having the disposition of a two boxer, the involuntary one boxers are still one boxers. It doesn't matter that they intended to two-box to scoop up the "extra" thousand dollars - they in fact one-boxed, and so they are a one boxer.

The predictor's goal is to predict which button gets pressed - nothing more, nothing less. It is completely possible that the predictor does not care at all about what your disposition is - there is a world in which muscle spasms are so frequent that the predictor simply needs to be good at predicting the direction of these involuntary muscle spasms without caring one bit about the agents' dispositions.

We can extend this thought to the actual world, where involuntary muscle spasms are infrequent. Whether you are a one boxer or a two boxer depends solely on whether you actually one-box or two-box. The only thing we know in Newcomb's problem is that the predictor accurately predicts decisions.

Given this, if you have the disposition of a two boxer, and somehow one-box anyway, you will reap the rewards of a one boxer.

You don't have to understand why being a one boxer works in order to acknowledge that it works. And being a one boxer is simply about actually one-boxing. In other words, you are not a one boxer merely for thinking like a one boxer. You can talk the talk, but what matters is whether you walk the walk.

If we follow two boxers' conventional line of thinking, it is clear that their argument hyper-fixates on the importance of disposition. Their argument goes: 

You cannot influence what your disposition in the past was, and the prediction is made based on your disposition, so you cannot influence the contents of the opaque box. Therefore, you might as well focus on what you can control going forward, and the only relevant decision before you now is whether you grab an extra one thousand dollars or not.

It may well be true that you cannot influence what your disposition in the past was. However, the muscle spasm thought experiment clearly shows that the prediction itself can be completely unrelated to your disposition.

You may well have had the disposition of a two boxer. And yet, if you one-box anyway, you reap the rewards of a one boxer. Not only that, if you one-box anyway, you actually are a one boxer. It doesn't matter what factors led you to the point of one-boxing, what matters is that you in fact one-boxed.

From this perspective, the one boxer can agree with the fundamental tenet of focusing only on what you can control while ignoring everything that you cannot.

You may not be able to control who you were in the past, but you can control which button you press. And that is all that matters.

If you press that one box button, then you will have revealed that you were a one boxer all along.



Discuss

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

纽康姆问题 决策理论 博弈论 理性选择 预期价值
相关文章