Mashable 04月10日 03:19
Meet Nova Sonic, Amazons new AI voice model
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

亚马逊发布了名为Nova Sonic的新型AI语音模型,旨在提升Alexa等语音助手的自然度和交互体验。该模型通过整合语音理解和语音生成,实现了更接近人类的语音对话。与之前的AI语音模型相比,Nova Sonic在停顿、语调和语调方面表现更出色。此外,亚马逊声称Nova Sonic在成本效益上优于OpenAI的GPT-4o模型。目前,Nova Sonic已通过亚马逊的Bedrock平台向开发者开放,预示着AI语音助手领域的竞争将进一步加剧。

🗣️ Nova Sonic通过将语音识别、大型语言模型和文本转语音等多个模型整合到一个统一模型中,实现了更自然的语音表现。这使得它能够更好地理解语音中的细微差别,从而产生更流畅、更人性化的语音输出。

👂 Nova Sonic在语音的停顿、语调和语调方面有了显著改进,使其听起来更像人类的语音。亚马逊提供了示例,用户可以亲自体验这种改进。

💰 亚马逊宣称Nova Sonic在成本方面具有优势,相比OpenAI的GPT-4o模型更具成本效益。这使得Nova Sonic在商业应用中更具吸引力。

🚀 Nova Sonic已经应用于亚马逊的下一代AI语音助手Alexa+。目前,开发者可以通过亚马逊的Bedrock平台访问Nova Sonic。

AI companies have been working on voice models for a while now, but it seems things really ramped up after OpenAI unveiled ChatGPT Voice Mode.

Now, Amazon has just introduced its new "foundation" AI voice model called Nova Sonic. And it really makes Alexa sound like she's living way in the past.

According to Amazon, Nova Sonic "unifies speech understanding and speech generation into a single model, to enable more human-like voice conversations in AI applications." With the samples provided, it certainly does seem more human-like than the company's previous iterations of AI voice models. 

For example, there are proper pauses, tone, and inflections on words depending on where they are and what they mean in a sentence. Amazon provided some samples you can listen to here and here.

Again, "more human-like" is the key description here. There are still plenty of signs that it's an AI voice, but it also does sound like a big step over previous AI voice assistants like Alexa.

Amazon says that it achieved this by combining multiple models that would traditionally be used, like speech recognition, large language models, and text-to-speech, into one single unified model. According to Amazon, it not only understands the nuances in speech to produce it, but it also understands it when a human inputs their own speech with these nuances as well.

According to TechCrunch, Nova Sonic is already powering Amazon's next-generation AI voice assistant, Alexa+.

Based on recent developments, it does seem like the big AI companies are currently focusing on voice models. So, prepare for competition in that space to heat up. Amazon is already pointing to claims that Nova Sonic is roughly 80 percent cheaper than OpenAI's GPT-4o model and promoting it as “the most cost-efficient."

Nova Sonic is currently available to developers through Amazon's enterprise AI developer platform, Bedrock.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Nova Sonic 亚马逊 AI语音模型 Alexa 语音助手
相关文章