AI News 04月15日 17:02
Apple AI stresses privacy with synthetic and anonymised data
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

苹果公司正采取一种新方法来训练其AI模型,避免收集或复制iPhone或Mac用户的个人内容。通过使用合成数据和差分隐私技术,苹果旨在改进如邮件摘要等功能,同时保护用户隐私。该公司依靠设备分析计划,让设备将合成邮件与本地存储的真实用户数据进行比较,并将匹配信息发送回苹果,而非实际的用户数据。这种方法已应用于Genmoji等功能,并计划扩展到其他Apple Intelligence功能,如Image Playground和写作工具。苹果在iOS 18.5、iPadOS 18.5和macOS 15.5的beta版本中推出该系统,旨在平衡用户隐私和模型性能。

📱苹果采用合成数据和差分隐私技术,避免直接收集用户在iPhone或Mac上的内容,以此来训练其AI模型。

📧在设备分析计划中,设备会将合成邮件与本地存储的少量真实用户数据进行比较,并向苹果发送匹配信息,而非用户实际数据。

✍️苹果使用该技术来改进Genmoji等功能,并通过匿名投票确定常用提示词。未来,苹果计划将其应用于Image Playground、Image Wand、Memories Creation和写作工具等其他Apple Intelligence功能。

📊苹果通过生成数千个样本消息,并将其转换为数值表示,来改进邮件摘要等复杂任务。用户设备将这些数值与本地存储的样本进行比较,并将匹配信息分享给苹果,以优化训练数据。

🚀该系统已在iOS 18.5、iPadOS 18.5和macOS 15.5的beta版本中推出,旨在平衡用户隐私与模型性能,解决AI开发中的挑战。

Apple is taking a new approach to training its AI models – one that avoids collecting or copying user content from iPhones or Macs.

According to a recent blog post, the company plans to continue to rely on synthetic data (constructed data that is used to mimic user behaviour) and differential privacy to improve features like email summaries, without gaining access to personal emails or messages.

For users who opt in to Apple’s Device Analytics program, the company’s AI models will compare synthetic email-like messages against a small sample of a real user’s content stored locally on the device. The device then identifies which of the synthetic messages most closely matches its user sample, and sends information about the selected match back to Apple. No actual user data leaves the device, and Apple says it receives only aggregated information.

The technique will allow Apple to improve its models for longer-form text generation tasks without collecting real user content. It’s an extension of the company’s long-standing use of differential privacy, which introduces randomised data into broader datasets to help protect individual identities. Apple has used this method since 2016 to understand use patterns, in line with the company’s safeguarding policies.

Improving Genmoji and other Apple Intelligence features

The company already uses differential privacy to improve features like Genmoji, where it collects general trends about which prompts are most popular without linking any prompt with a specific user or device. In upcoming releases, Apple plans to apply similar methods to other Apple Intelligence features, including Image Playground, Image Wand, Memories Creation, and Writing Tools.

For Genmoji, the company anonymously polls participating devices to determine whether specific prompt fragments have been seen. Each device responds with a noisy signal – some responses reflect actual use, while others are randomised. The approach ensures that only widely-used terms become visible to Apple, and no individual response can be traced back to a user or device, the company says.

Curating synthetic data for better email summaries

While the above method has worked well with respect to short prompts, Apple needed a new approach for more complex tasks like summarising emails. For this, Apple generates thousands of sample messages, and these synthetic messages are converted into numerical representations, or ’embeddings,’ based on language, tone, and topic. Participating user devices then compare the embeddings to locally stored samples. Again, only the selected match is shared, not the content itself.

Apple collects the most frequently-selected synthetic embeddings from participating devices and uses them to refine its training data. Over time, this process allows the system to generate more relevant and realistic synthetic emails, helping Apple to improve its AI outputs for summarisation and text generation without apparent compromise of user privacy.

Available in beta

Apple is rolling out the system in beta versions of iOS 18.5, iPadOS 18.5, and macOS 15.5. According to Bloomberg’s Mark Gurman, Apple is attempting to address challenges with its AI development in this way, problems which have included delayed feature rollouts and the fallout from leadership changes in the Siri team.

Whether its approach will yield more useful AI outputs in practice remains to be seen, but it signals a clear public effort to balance user privacy with model performance.

(Photo by Unsplash)

See also: ChatGPT got another viral moment with ‘AI action figure’ trend

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Explore other upcoming enterprise technology events and webinars powered by TechForge here.

The post Apple AI stresses privacy with synthetic and anonymised data appeared first on AI News.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

苹果 人工智能 隐私保护 合成数据
相关文章