Mashable 2024年11月20日
ChatGPT’s Advanced Voice Mode could get a new 'Live Camera' feature
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

ChatGPT可能即将具备视觉功能,代码显示相关信息。其曾在活动中展示过视觉处理能力,OpenAI还推出了其他功能。若视觉模式成真,用户可测试新功能。此外,OpenAI近期还忙于其他工作。

🦘ChatGPT最新版本代码中提及'Live camera'及相关警告

📷代码似乎指示ChatGPT可通过相机查看并谈论周围环境

🎥OpenAI活动中展示过GPT-4o的视觉处理能力

💻OpenAI推出了多种功能,如ChatGPT Search等

ChatGPT's highly-anticipated vision capabilities might be coming soon, according to some eagle-eyed sleuths.

Android Authority spotted some lines of code in the Advanced Voice Mode part of the latest ChatGPT v1.2024.317 beta build, which point to something called "Live camera." The code appears to be a warning to users to not use Live camera "for live navigation or decisions that may impact your health or safety."

Another line in the code seems to give instructions for vision capabilities saying, "Tap the camera icon to let ChatGPT view and chat about your surroundings."

ChatGPT’s evolving capabilities: Vision, voice, and beyond

ChatGPT's ability to visually process information was a major feature debuted at the OpenAI event last May, launching GPT-4o. Demos from the event showed how GPT-4o could use a mobile or desktop camera to identify subjects and remember details about the visuals. One particular demo featured GPT-4o identifying a dog playing with a tennis ball and remembering that it's name is "Bowser."

Since the OpenAI event and subsequent early access to a few lucky alpha testers, not much has been said about GPT-4o with vision. Meanwhile, OpenAI shipped Advanced Voice Mode to ChatGPT Plus and Team users in September.

If ChatGPT's vision mode is imminent as the code suggests, users will soon be able to test out of both components of the new GPT-4o features teased last spring.

OpenAI has been busy lately, despite reports of diminishing returns with future models. Last month, it launched ChatGPT Search, which connects the AI model to the web, providing real-time information. It is also rumored to be working on some kind of agent that's capable of multi-step tasks on the user's behalf, like writing code and browsing the web, possibly slated for a January release.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

ChatGPT 视觉功能 OpenAI 新功能
相关文章