少点错误 01月22日
Training Data Attribution: Examining Its Adoption & Use Cases
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本报告探讨训练数据归因及其对降低AI极端风险的潜在重要性和可行性。涉及TDA技术的应用、研究现状、面临的瓶颈及可能带来的社会效益等方面。

TDA技术可分三类,梯度法被认为是实用TDA的最可能路径。

进行TDA的成本较高,其在大型模型上的准确性尚不明确。

TDA对AI研究和LLM发展有积极影响,如减少幻觉等问题。

AI实验室不愿公开训练数据,限制了TDA工具的公共访问。

Published on January 22, 2025 3:41 PM GMT

Note: This report was conducted in June 2024 and is based on research originally commissioned by the Future of Life Foundation (FLF). The views and opinions expressed in this document are those of the authors and do not represent the positions of FLF.

This report investigates Training Data Attribution (TDA) and its potential importance to and tractability for reducing extreme risks from AI. TDA techniques aim to identify training data points that are especially influential on the behavior of specific model outputs. They are motivated by the question: how would the model's behavior change if one or more data points were removed from or added to the training dataset? 

Report structure:

Key takeaways from our report:



Discuss

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

训练数据归因 AI风险 LLM发展 数据隐私
相关文章