TechCrunch News 2024年12月13日
ChatGPT now understands real-time video, seven months after OpenAI first demoed it
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

OpenAI为ChatGPT推出近七个月前演示过的实时视频功能。用户可通过手机指向物体,ChatGPT近实时响应,还能理解屏幕内容。该功能今日开始推出,下周完成。此前曾多次延迟,且存在错误。OpenAI此前还专注于将语音模式推广到更多平台和欧盟用户。

🎥ChatGPT推出实时视频功能,用户可通过手机使用

👀该功能可理解物体及屏幕内容,也存在错误

⏳功能推出多次延迟,今起逐步推广

🌍OpenAI此前专注将语音模式推广到更多地区

OpenAI has finally released the real-time video capabilities for ChatGPT that it demoed nearly seven months ago.

On Thursday during a livestream, the company said that Advanced Voice Mode, its human-like conversational feature for ChatGPT, is getting vision. Using the ChatGPT app, users subscribed to ChatGPT Plus or Pro can point their smartphones at objects and have ChatGPT respond in near-real-time.

Advanced Voice Mode with vision can also understand what’s on a device’s screen, via screen sharing. It can explain various settings menus or give suggestions on a math problem.

The rollout of Advanced Voice Mode with vision will start today, OpenAI says, and wrap up in the next week.

In a recent demo on CNN’s 60 Minutes, OpenAI president Greg Brockman had Advanced Voice Mode with vision quiz Anderson Cooper on his anatomy skills. As Cooper drew body parts on a blackboard, ChatGPT could “understand” what he was drawing.

Image Credits:OpenAI

“The location is spot on,” the assistant said. “The brain is right there in the head. As for the shape, it’s a good start. The brain is more of an oval.”

In that same demo, Advanced Voice Mode with vision made a mistake on a geometry problem, however — suggesting that it’s prone to hallucinating.

Advanced Voice Mode with vision has been delayed multiple times — reportedly in part because OpenAI announced the feature far before it was production-ready. In April, OpenAI promised that Advanced Voice Mode would roll out to users “within a few weeks.” Months later, the company said it needed more time.

When Advanced Voice Mode finally arrived in early fall for some ChatGPT users, it lacked the visual analysis component. In the lead-up to today’s launch, OpenAI has focused most of its attention on bringing the voice-only Advanced Voice Mode experience to additional platforms and users in the EU.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

ChatGPT 实时视频 功能延迟 视觉分析
相关文章