钛媒体:引领未来商业与生活新知 16小时前
Huawei Denies Plagiarism in Pangu AI Model After Allegations of Copying Alibaba's Qwen
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

近期,华为因其开源的盘古Pro MoE大模型被指控抄袭阿里云的Qwen-2.5模型,引发了关于开源AI知识产权界限的激烈讨论。 一项GitHub研究指出,华为模型与阿里云模型在参数分布上高度相似,并发现了代码中遗留的阿里云版权信息。尽管华为否认抄袭,强调自主研发,但事件凸显了开源AI领域在许可、归属和商业化方面的模糊地带。 专家呼吁开发者明确遵守开源许可协议。 这起事件也反映了开源AI领域日益激烈的竞争和知识产权保护的挑战。

🤔 华为的盘古Pro MoE大模型因涉嫌抄袭阿里云的Qwen-2.5模型而引发争议,争议源于GitHub上的一份研究,该研究使用了模型指纹技术,发现两个模型在注意力参数分布上高度相关,并发现了代码中遗留的阿里云版权信息。

📢 华为回应称,盘古Pro MoE是基于华为昇腾硬件平台从头开始开发的,并严格遵守开源许可要求,但承认参考了开源行业实践。公司强调其创新的分组混合专家(MoGE)架构,旨在优化分布式训练环境中的负载平衡。

⚖️ 这起事件突显了开源AI开发中知识产权的模糊地带,包括许可义务、归属要求和商业化限制。专家强调,开发者必须明确承认原始来源并遵守许可条款,特别是在改编或商业化开源项目时。

🌍 开源AI领域竞争激烈,低技术门槛使得初创企业能够快速部署服务。此前,01.AI的Yi-34B和斯坦福的Llama3-V也曾因类似问题受到批评。这促使人们更加关注清晰的许可框架和执行机制。

AsianFin -- Huawei has issued a public rebuttal following mounting accusations that its newly open-sourced Pangu Pro MoE large language model repackages code and architecture from Alibaba Cloud’s Qwen-2.5 model, reigniting a debate over IP boundaries in the rapidly evolving open-source AI ecosystem.

The controversy emerged after a study published on GitHub on July 4 alleged that Huawei’s 72-billion-parameter Pangu Pro MoE model showed a striking resemblance to Alibaba’s 14B Qwen-2.5. Using a model fingerprinting technique, the unnamed author—claiming to be a Korean student at the University of Costa Rica—reported a 0.927 correlation between the two models’ attention parameter distributions, suggesting possible non-independent development. The post also cited leftover metadata in Huawei’s open-source code referencing “Copyright 2024 The Qwen team, Alibaba Group” as evidence.

Huawei responded on July 5 via a statement from its Noah’s Ark Lab, the company’s AI research division, denying any wrongdoing. The lab emphasized that the Pangu Pro MoE is a foundational model developed from scratch on Huawei’s Ascend hardware platform and is not based on incremental training from any other vendor’s models.

“We strictly adhere to open-source license requirements and clearly indicate copyright statements within the code,” Huawei said. “Referencing open-source industry practices is standard, and aligns with the collaborative spirit of the open-source community.”

The company highlighted key innovations behind Pangu Pro MoE, including a proprietary Grouped Mixture of Experts (MoGE) architecture designed to optimize load balancing in distributed training environments. Huawei said it welcomes constructive feedback and aims to promote open innovation, inclusivity, and sustainability in the open-source AI field.

Despite the GitHub repository being taken down, the episode has stirred widespread debate on social media platforms and Chinese developer forums such as Zhihu. Some commenters questioned the scientific validity of the fingerprinting methodology used to make the plagiarism claim, arguing that using parameter standard deviation is insufficient to determine model similarity.

The dispute underscores the legal and ethical gray areas in open-source AI development. While open-sourcing a model doesn’t waive intellectual property rights, confusion remains around licensing obligations, attribution requirements, and commercialization restrictions. Experts caution that developers must clearly acknowledge the original source and comply with license terms, particularly when adapting or commercializing open-source projects.

This isn’t the first time a major open-source AI model has faced plagiarism accusations. Earlier this year, 01.AI’s Yi-34B was criticized for borrowing architecture from Meta’s LLaMA model, and Stanford’s Llama3-V was revealed to have repackaged Moonshot AI’s MiniCPM-Llama3-V 2.5.

The open-source model space has become increasingly crowded, with low technical barriers allowing startups to quickly deploy services by wrapping models like GPT or DeepSeek in user-friendly apps. As generative AI adoption grows, the need for clearer licensing frameworks and enforcement mechanisms is becoming critical.

Huawei launched the Pangu model series in 2021, spanning NLP, computer vision, and scientific computing. On June 30, the company open-sourced its 7B dense model, the 72B Pangu Pro MoE, and inference tech based on Ascend, describing the move as a strategic push to strengthen its AI ecosystem.

Huawei Cloud said the Pangu model has been deployed in over 30 industries and 400 real-world scenarios—including government, finance, healthcare, autonomous driving, and industrial design—generating tangible business value.

While Alibaba has yet to comment on the matter, AI research teams from several major Chinese tech companies are closely watching the situation. For now, the incident highlights a growing tension in the AI community: how to encourage open-source innovation without eroding intellectual property protection or fueling unchecked model replication.

更多精彩内容,关注钛媒体微信号(ID:taimeiti),或者下载钛媒体App

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

华为 盘古大模型 开源 知识产权 AI
相关文章