Fortune | FORTUNE 2024年11月22日
HarperCollins strikes AI training deal with unnamed company amid rising copyright tensions between publishers and AI firms
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

出版业与生成式人工智能行业正在达成协议,旨在保护版权并满足AI行业快速增长的需求。例如,哈珀柯林斯与一家科技公司达成协议,允许其使用部分书籍训练生成式AI模型,并支付每本书2500美元的费用。这引发了出版界关于作者权益和AI模型训练数据来源的讨论。一些出版商与科技公司达成协议,允许使用其内容训练AI模型,但也有一些作者拒绝参与。专家认为,需要更广泛的对话,让更多利益相关者参与其中,以确保AI模型训练过程的公平性和合规性。

📖哈珀柯林斯与科技公司达成协议,允许使用部分书籍训练AI模型,每本书支付2500美元。

📚威立出版公司也与科技公司达成2300万美元的协议,允许使用学术和专业书籍内容训练AI模型。

🤔出版界对这些协议反应不一,一些作者拒绝参与,认为AI模型训练会损害其权益。

🗣️专家认为,需要更广泛的对话,让作者等利益相关者参与到AI模型训练的讨论中。

📰纽约时报起诉OpenAI和微软侵犯版权,其他媒体机构则与OpenAI达成协议。

Publishing giants and generative artificial intelligence companies are striking deals that aim to both protect copyright and provide for the rapidly increasing needs of the AI industry.US publishing giant HarperCollins has reached a contract with an unnamed tech company allowing it to use some of its books to train its generative AI models.In a letter seen by AFP, the tech company is proposing a payment of $2,500 per selected book to train its so-called large language model (LLM) for up to three years.AI models need massive quantities of texts to train their everyday language use.“HarperCollins has reached an agreement with an artificial intelligence technology company to allow limited use of select nonfiction backlist titles for training AI models to improve model quality and performance,” the publisher said in a statement.It said the agreement has “limited scope and clear guardrails around model output that respects author’s rights.”Authors “have the choice to opt in to the agreement or to pass on the opportunity”, it added.The offer has had a mixed reception in the publishing world, with writers such as Daniel Kibblesmith curtly declining.“I’d probably do it for a billion dollars. I’d do it for a sum of money that wouldn’t require me to work anymore, since that’s the ultimate goal of this technology,” the author posted on the Bluesky social network.HarperCollins is one of the largest publishers to reach such an accord, but not the first.US scientific publisher Wiley said it has allowed “access to previously published academic and professional book content for specific use in training LLM models” in a $23 million contract with an unidentified “large tech company”.The accords underscore the tension behind AI models, which collect huge quantities of content on the web, creating the risk of widespread copyright violations.‘A broader conversation’ Giada Pistilli, head of ethics at Hugging Face, a French-American open-access AI platform, said these agreements are a step forward since they involve payments to publishers. But she regrets that they leave little room for the authors to negotiate.“What we are going to see is a mechanism of bilateral agreements between new technology companies and publishers or copyright holders, whereas in my opinion, we need a broader conversation that includes stakeholders a little more,” she said.Julien Chouraqui, legal director at the French publishing union (SNE), said the accords represented “progress”.“An agreement means that there has been a dialogue and a desire to achieve a balance between the use of source data, which are subject to copyright and which will generate value,” he said.The press is also organising to face the challenges created by AI.In late 2023, The New York Times sued OpenAI, creator of ChatGPT, as well as Microsoft, its main investor, for violating copyright protections. Other media groups have cut deals with OpenAI.Tech companies may have no choice but to pay out to improve their products, especially as they are starting to run out of new materials to power their models.“On the web, you find lots of licit and illicit stiff, and lots of pirated copy. That not only causes legal problems but also raises issues about the quality of the data,” said Chouraqui at the SNE.“If we are committed to developing a market on a virtuous basis, we must involve all the players,” he said.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

人工智能 版权 出版业 生成式AI AI模型
相关文章