The Verge - Artificial Intelligences 2024年08月01日
Watch ChatGPT’s new voice mode mimic accents and correct pronunciation
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

OpenAI 推出的 ChatGPT 新高级语音模式,部分 ChatGPT Plus 订阅者可使用,该模式能唱歌、模仿口音等,虽有不足但表现出色,今年秋季将向所有订阅者开放。

🎤ChatGPT 新高级语音模式功能多样,如唱歌、模仿口音、纠正语言发音、进行叙事讲故事等,在多个方面展示出强大能力。

💬该模式在处理一些复杂请求时存在困难,如添加引擎声音等,但语音清晰且富有情感,能较好地处理用户中断。

🌐ChatGPT 称能处理数十种语言的输入,虽在某些语言的口音上可能不够纯正,但能完成故事请求并做出适当反应。

🎙️演示中出现了几种不同的男女声音,但不包括五月从服务中移除的类似斯嘉丽·约翰逊的“Sky”声音。

Image: The Verge

It’s been a couple of days since OpenAI rolled out ChatGPT’s new advanced voice mode, and the small group of ChatGPT Plus subscribers given access to it seem pretty impressed so far. Various clips of the feature in action have appeared online, demonstrating its ability to sing, imitate accents, correct language pronunciation, and perform narrative storytelling.

An example of the latter can be seen in the below videos, in which X user @nickfloats asks ChatGPT to “tell me a story as if you’re an airline pilot telling it to passengers on a flight.” The chatbot jumps into action barely a second later, and even alters the audio to sound more like it’s coming from an intercom. ChatGPT struggled to accommodate more complex requests like layering on engine sounds, but the voice itself is clear and emotive and ChatGPT handles user interruptions well.

In a conversation uploaded to YouTube, ChatGPT says it can handle inputs in “dozens of languages,” but the exact number can vary “depending on how you count dialects and regional variations.” One clip demonstrates the chatbot’s ability to correct the pronunciation of French words, giving specific pointers on adjusting inflection. Another language demo shows ChatGPT speaking Turkish after following a detailed request to tell an emotive story. While some Turkish X users noted that the accent didn’t sound native, it was able to complete the story request and react appropriately by laughing and crying at certain points.

The bot does a passable job with regional US accents, with one video running through a variety of examples that include New York, Boston, Wisconsin, and a stereotypical “valley girl.” Other videos also show ChatGPT’s advanced voice feature singing in different styles, producing a blues-style take on “Happy Birthday” and, amusingly, trying to imitate what animals like frogs and cats would sound like singing the same tune.

A few different male and female-sounding voices were present across these demonstrations, though these notably don’t include the Scarlett Johansson-like “Sky” voice that was pulled from the service in May.

As for anyone who feels left out of these fun demonstrations, OpenAI spokesperson Taya Christianson told The Verge that advanced voice mode will be available to all ChatGPT Plus subscribers (which costs $20 per month) sometime this fall.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

ChatGPT 高级语音模式 语言处理 多种功能
相关文章