Mashable 2024年11月20日
Microsoft will let you clone your voice for Teams calls, powered by AI
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

微软Teams即将推出AI驱动的Interpreter工具,该工具允许用户创建其语音的数字副本,并将其实时翻译成多种语言。这意味着用户可以在会议中以自己的声音进行跨语言交流。该功能最初支持英语、法语、德语等多种语言,旨在提高远程办公和数字化社交的便捷性。然而,该技术也引发了关于安全性和技术偏见等方面的担忧,例如AI翻译可能出现幻觉或错误,以及非自愿的深度伪造等问题。微软表示,该工具旨在忠实地复制说话者的信息,并需用户同意才能启用语音模拟功能。

🤔 **AI语音克隆技术应用于Teams:**微软Teams即将推出Interpreter工具,允许用户创建其语音的数字副本,并实时将其翻译成多种语言,实现会议中跨语言的流畅交流。

🗣️ **多语种支持与实时翻译:**该功能最初支持英语、法语、德语、意大利语、日语、韩语、葡萄牙语、普通话和西班牙语,并提供实时语音到语音的翻译。

⚠️ **潜在的安全和偏见问题:**AI语音翻译技术也存在潜在风险,例如AI可能出现幻觉或错误,以及可能被用于非自愿的深度伪造等。微软强调,该工具旨在忠实地复制说话者的信息,并需用户同意才能启用语音模拟功能。

🧑‍🤝‍🧑 **提升远程办公与数字化社交的便捷性:**该功能有望使远程工作和数字化社交更易于访问,尤其对非英语使用者而言,但同时也需要谨慎考虑其潜在的负面影响。

🌍 **语音克隆技术的发展趋势:**AI语音克隆技术正受到越来越多的关注,苹果和微软等公司均已推出相关功能,主要应用于辅助功能和提升用户体验。

Microsoft Teams users will soon be able to use cloned versions of their voices to speak and translate conversation in real time, as the company unveils its new, AI-powered Interpreter tool.

Announced at the annual Microsoft Ignite conference and reported by TechCrunch, the new feature allows users to create digital replicas of their voices that can then be used to translate their speech into various languages. "Imagine being able to sound just like you in a different language. Interpreter in Teams provides real-time speech-to-speech translation during meetings, and you can opt to have it simulate your speaking voice for a more personal and engaging experience," wrote Microsoft CMO Jared Spataro in a blog post shared with the publication.

The feature will only be available to Microsoft365 subscribers, and will launch initially for English, French, German, Italian, Japanese, Korean, Portuguese, Mandarin Chinese, and Spanish.

Microsoft's Interpreter has the potential to make the business of remote work and digital socialization more accessible to a wider array of non-English speakers, though it's not yet as dynamic as a live, human translator. And beyond its express application, the tool raises even more questions about security and technological bias.

A recent study found that popular AI-powered transcription tool Whisper — also used in Microsoft's cloud computing programs — were rife for hallucinations, including inventing content or phrases when translating patient information in the medical field. This was especially true for patients with speech disorders like aphasia. The previously hyped Humane AI pin, advertised for its live translation abilities, turned out to be an inconsistent digital alternative to human translation. Addressing similar concerns for Teams' Interpreter, Microsoft told TechCrunch: "Interpreter is designed to replicate the speaker’s message as faithfully as possible without adding assumptions or extraneous information. Voice simulation can only be enabled when users provide consent via a notification during the meeting or by enabling ‘Voice simulation consent’ in settings."

The technology could have immense implications in the accessibility space, with notable figures like U.S. representative Jennifer Wexton amplifying the use of personalized high-tech voice cloning for people with atypical speech. But it has also prompted concerns about nonconsensual deepfake uses and the potential for the tech to be a tool in the arsenal of scammers. Powerful AI speech cloning tech — Microsoft's is reportedly impressively human-like — has evoked ethical concerns, with Microsoft's own CEO calling for stronger guardrails and AI governance in the face of increasing celebrity deepfakes.

Still, the buzz around voice cloning, bolstered by the AI craze, has only grown among the industry's innovators, adding to previous investments in AI speech-to-text translation. Last year, Apple announced its Personal Voice feature, a machine learning tool that creates a synthesized version of a user's voice that can be used in live text-to-speech situations, like FaceTime, and was advertised as an accessibility. Microsoft unveiled its own Personal Voice feature around the same time, powered by its Azure AI and available in 90 languages.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

微软Teams AI语音翻译 语音克隆 远程办公 AI伦理
相关文章