少点错误 9小时前
Open weights == Closed source
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

文章深入探讨了“开放权重”(open weight)模型与“开源”(open source)的本质区别。作者指出,尽管一些公司将开放权重模型称为开源,但这会混淆概念并可能夸大其开放性。真正的开源要求提供源代码,且不限制特定人群或领域的使用。而开放权重模型通常只提供模型参数(权重),缺乏训练代码和数据,并且其使用许可常常包含对用户规模和使用场景的限制,这与开源的定义和精神相悖。因此,将开放权重模型等同于开源是一种误导,应更准确地理解和使用这些术语。

💡 **“开放权重”与“开源”的定义差异**:文章明确指出,“开放权重”模型仅发布模型参数(权重),而“开源”则要求提供完整的源代码,并允许自由分发和修改。这种混淆会造成概念不清,并可能误导公众对模型开放程度的认知。

💻 **开源的核心要求与开放权重模型的局限**:根据开源定义,源代码是关键,并且不得对使用者或使用领域进行歧视。然而,开放权重模型通常不包含训练代码和数据,且其许可证常限制特定用户(如公司规模)或特定用途(如军事),这违反了开源的“无歧视”原则。

🤔 **开放权重模型的使用动机与实际含义**:作者认为,一些公司将开放权重模型宣传为“开源”,更多是出于营销目的,意在塑造“我们是好人”、“我们关心用户”的形象。这种做法可能将原本的“闭源”行为包装成更具吸引力的“开放”概念,从而获得不当的优势。

⚖️ **许可证限制与开源精神的冲突**:开放权重模型对公司规模和使用场景的限制,直接违背了开源旨在促进广泛应用和创新的初衷。例如,禁止特定规模的公司或特定领域(如军事)使用,就与开源的“无歧视”精神相悖。

Published on August 7, 2025 1:04 AM GMT

Disagreements degenerate into debates about word usage way too often and such debates are usually pointless. However, sometimes words can mislead or provoke emotional reactions and in those cases word usage becomes important. This is one of those cases.

Many modern AI models, such as those typically used for text or image generation, consist mainly of a huge neural network. The AI models, for which parameters (a.k.a. weights) are freely distributed on the internet came to be known as open weight models. This is contrast to other models, whose parameters are kept secret on some servers and  are only accessible through APIs or web interfaces.

Some[1] have instead used the term open source to designate the same thing. I cannot speak about the motivation behind that word choice, but it ends up distorting what "open source" means, creates confusion and could make publishing weights seem more virtuous than it is.

What is open source?

I won't get into all the details of defining open source. Here are a couple of points from the definition by the Open Source Initiative (the organization who introduced the term in 1998): [2]

2. Source Code
The program must include source code, and must allow distribution in source code as well as compiled form. Where some form of a product is not distributed with source code, there must be a well-publicized means of obtaining the source code for no more than a reasonable reproduction cost, preferably downloading via the Internet without charge. The source code must be the preferred form in which a programmer would modify the program. Deliberately obfuscated source code is not allowed. Intermediate forms such as the output of a preprocessor or translator are not allowed.

What is the source code of a generative model?
Let's say I want to use the model for my company, while making sure it knows nothing about competitors. The most reliable way to do that would be to remove such references from the training data and retrain the model. This illustrates that the preferred form for modifying a generative model, such as Llama or DeepSeek would include:

This is not what is provided when a model's weights are published so we should not call it "open source". There are a number of reasons why providing the training data is unrealistic and probably never going to happen. Retraining the model would cost a fortune anyway so one could argue that open source is just an unsuitable distribution method for generative models.

5. No Discrimination Against Persons or Groups
The license must not discriminate against any person or group of persons.

6. No Discrimination Against Fields of Endeavor
The license must not restrict anyone from making use of the program in a specific field of endeavor. For example, it may not restrict the program from being used in a business, or from being used for genetic research.

Open weight generative models typically restrict usage based on company size and use case (e.g. Llama's license disallows use for military purposes and use by companies with over a given number of users).

It is unclear whether company size restriction breaks rule 5, but I would argue the restriction is against the spirit of the rule. The use case restriction certainly breaks rule 6.

Conclusion

While "open source" was defined before open weight generative models were a thing, the intuition behind the term points to something quite different from those models. Open weights very closely matches what was traditionally considered closed source - training corresponds to the compilation and the output is an inscrutable array of numbers, which can be used to perform some computation.

Publishing weights is often misrepresented as if it supports some open source ideology. In my opinion companies use it to signal things like "we're the good guys", "we care about our users".

The term "open source" is useful vocabulary and becomes meaningless if we use it when we actually mean "closed source".

  1. ^
  2. ^


Discuss

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

开放权重 开源 AI模型 模型许可证 技术术语
相关文章