TechCrunch News 01月15日
Meta execs obsessed over beating OpenAI’s GPT-4 internally, court filings reveal
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

法庭文件揭示,Meta在开发Llama 3时,高管和研究人员极度渴望超越OpenAI的GPT-4模型。内部消息显示,Meta的AI领导层将Anthropic的Claude和OpenAI的GPT-4视为目标,并对Mistral等竞争对手不屑一顾。Meta为了训练Llama 3不惜一切代价,甚至在数据获取上采取了“非常激进”的策略,并使用了包含受版权保护书籍的数据集。尽管Meta发布了开源AI模型,但其内部竞争压力巨大,目标是成为行业领导者。Llama 3最终在2024年4月发布,在性能上与领先的封闭模型竞争,但其训练数据也面临着版权诉讼的审查。

🎯 Meta AI团队在开发Llama 3时的首要目标是超越OpenAI的GPT-4模型,内部文件显示高管们对此抱有强烈的竞争意识,并为此投入了大量资源。

🥇 Meta内部将Anthropic的Claude和OpenAI的GPT-4视为行业标杆,并以此为目标努力提升Llama模型的性能。对于其他竞争对手,如Mistral,则持轻视态度。

📚 为了快速提升Llama 3的竞争力,Meta在数据获取上采取了“非常激进”的策略,甚至使用了包含受版权保护书籍的数据集,这导致了后续的版权诉讼。

🚀 尽管面临版权争议,Llama 3最终在2024年4月发布,并在性能上与谷歌、OpenAI和Anthropic等公司的领先模型展开竞争,同时超越了Mistral等开源模型。

Executives and researchers leading Meta’s AI efforts obsessed over beating OpenAI’s GPT-4 model while developing Llama 3, according to internal messages unsealed by a court on Tuesday in one of the company’s ongoing AI copyright cases, Kadrey v. Meta.

“Honestly… Our goal needs to be GPT-4,” said Meta’s VP of Generative AI, Ahmad Al-Dahle, in an October 2023 message to Meta researcher Hugo Touvron. “We have 64k GPUs coming! We need to learn how to build frontier and win this race.”

Though Meta releases open AI models, the company’s AI leaders were far more focused on beating competitors that don’t typically release their model’s weights, like Anthropic and OpenAI, and instead gate them behind an API. Meta’s execs and researchers held up Anthropic’s Claude and OpenAI’s GPT-4 as a gold standard to work toward.

The French AI startup Mistral, one of the biggest open competitors to Meta, was mentioned several times in the internal messages, but the tone was dismissive.

“Mistral is peanuts for us,” Al-Dahle said in a message. “We should be able to do better,” he said later.

Tech companies are racing to upstage each other with cutting-edge AI models these days, but these court filings reveal just how competitive Meta’s AI leaders truly were – and seemingly still are. At several points in the message exchanges, Meta’s AI leads talked about how they were “very aggressive” in obtaining the right data to train Llama; at one point, an exec even said that “Llama 3 is literally all I care about,” in a message to coworkers.

Prosecutors in this case allege that Meta’s executives occasionally cut corners in their mad race to shipping AI models, training on copyrighted books in the process.

Touvron noted in a message that the mix of datasets used for Llama 2 “was bad,” and talked about how Meta could use a better mix of data sources to improve Llama 3. Touvron and Al-Dahle then talked about clearing the path to use the LibGen dataset, which contains copyrighted works from Cengage Learning, Macmillan Learning, McGraw Hill, and Pearson Education.

“Do we have the right datasets in there[?]” said Al-Dahle. “Is there anything you wanted to use but couldn’t for some stupid reason?”

Meta CEO Mark Zuckerberg has previously said he’s trying to close the performance gap between Llama’s AI models and closed models from OpenAI, Google, and others. The internal messages reveal the intense pressure within the company to do so.

“This year, Llama 3 is competitive with the most advanced models and leading in some areas,” said Zuckerberg in a letter from July 2024. “Starting next year, we expect future Llama models to become the most advanced in the industry.”

When Meta ultimately released Llama 3 in April 2024, the open AI model was competitive with leading closed models from Google, OpenAI, and Anthropic, and outperformed open options from Mistral. However, the data Meta used to train its models — data Zuckerberg reportedly gave the green light to use, despite its copyright status — are facing scrutiny in several ongoing lawsuits.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Meta AI Llama 3 GPT-4 AI竞争 版权问题
相关文章