GreatAIPrompts 2024年12月19日
OpenAI Unveils Voice Engine That Can Copy Human Voices, But Won’t Share It Yet
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

OpenAI发布了名为Voice Engine的全新文本转语音AI模型,该模型仅需15秒的录音即可合成逼真的语音。尽管该技术在阅读辅助、内容创作和个性化语音方面具有巨大潜力,但由于担心潜在的滥用风险,OpenAI决定暂缓大规模发布,仅进行预览。目前,OpenAI已与部分合作伙伴进行测试,并要求他们遵守严格的使用条款,包括禁止未经许可的模仿和强制获得声音克隆者的知情同意。此外,所有生成的语音样本都带有水印,以便追溯来源。为了应对潜在风险,OpenAI提出了三项建议:逐步淘汰银行账户的语音验证、加强公众对AI欺骗性内容的教育,以及加速开发音频内容溯源技术。OpenAI强调,在广泛应用合成语音技术时,需要采取谨慎和知情的态度。

📢 OpenAI的Voice Engine仅需15秒音频即可克隆人声,实现文本转语音的惊人效果。

⚠️ 尽管技术潜力巨大,OpenAI因担忧滥用风险,选择暂缓大规模发布,仅限预览。

🔒 OpenAI与合作伙伴测试时,要求签署协议,禁止未经许可模仿,并强制声音克隆者知情同意。

💧 所有Voice Engine生成的语音均带有水印,便于追溯来源。

💡 OpenAI建议社会逐步淘汰语音验证,加强AI内容欺骗性的教育,并加速音频溯源技术发展。

April 2nd, 2024: OpenAI, the company behind the popular ChatGPT, has announced Voice Engine, a new text-to-speech AI model that can create synthetic voices based on a 15-second segment of recorded audio.

The technology, developed in late 2022, has the potential to provide numerous benefits, such as reading assistance, global reach for creators, and personalized speech options for non-verbal individuals.

Voice engine generated audio

However, despite the potential advantages, OpenAI has decided to preview the technology but not widely release it at this time due to concerns about potential misuse.

The company initially planned to launch a pilot program for developers to sign up for the Voice Engine API earlier this month but scaled back its ambitions after considering the ethical implications.

In a statement, OpenAI said, “We are choosing to preview but not widely release this technology at this time. We hope this preview of Voice Engine both underscores its potential and also motivates the need to bolster societal resilience against the challenges brought by ever more convincing generative models.”

The company has been testing the technology with select partner companies since last year, requiring them to agree to terms of use that prohibit impersonation without consent and mandate informed consent from individuals whose voices are being cloned.

OpenAI has also implemented a watermark in every voice sample to assist in tracing the origin of any voice generated by its Voice Engine model.

To address the potential risks associated with voice-cloning technology, OpenAI has provided three recommendations for society to adapt: phasing out voice-based authentication for bank accounts, educating the public about the possibility of deceptive AI content, and accelerating the development of techniques to track the origin of audio content.

The company emphasizes the need for a cautious and informed approach to the broader release of synthetic voice technology.

“We hope to start a dialogue on the responsible deployment of synthetic voices and how society can adapt to these new capabilities,” OpenAI stated. “Based on these conversations and the results of these small scale tests, we will make a more informed decision about whether and how to deploy this technology at scale.”

As the development of voice-cloning technology continues to advance, it is crucial for companies like OpenAI to consider the potential risks and ethical implications while working to harness the benefits for society.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

OpenAI Voice Engine AI语音合成 伦理风险 技术安全