Latent 02月01日
The Agent Reasoning Interface: o1/o3, Claude 3, ChatGPT Canvas, Tasks, and Operator — with Karina Nguyen of OpenAI
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

AI工程师峰会即将召开,Karina Nguyen将在会上发表关于OpenAI的闭幕主题演讲。她在Notion、Square等多家公司工作后加入Anthropic,参与了Claude 1、2和3的研发。目前在OpenAI负责模型行为研究,致力于开发如ChatGPT Canvas、Tasks等新交互模式。她分享了高效的AI研究+产品流程,包括定义需求、获取资源、原型设计、评估、模型训练、测试和发布。Karina强调了Canvas和Tasks等工具的潜力,认为它们应更早出现,并介绍了GPT-4o如何作为创意伙伴进行协作。她还探讨了AI Agent的发展方向,从一次性操作到复杂环境中的长期委托。

💡Karina Nguyen在OpenAI负责模型行为研究,致力于开发新交互模式,如ChatGPT Canvas和Tasks,旨在提升AI的推理能力和用户体验。

🎨Canvas是GPT-4o的一个创新应用,它能理解用户意图,在写作和编码时触发画布,生成多样内容,进行针对性编辑,并提供精准反馈和建议。其核心行为包括触发画布、生成多样内容、进行目标编辑、重写文档和提供内联评论。

🎯Tasks是AI Agent发展的一个阶段,从一次性操作发展到协作,最终实现复杂环境中的长期委托,Canvas和Tasks属于前两个阶段,而第三阶段的形态仍待开发。OpenAI通过新型的合成数据生成技术,对模型进行后训练,提升其核心行为能力,并使用超过20个自动化内部评估来衡量进展。

⚙️Karina分享了她高效的AI研究+产品流程:定义需求、获取资源、原型设计、评估、模型训练、测试和发布。这一流程强调从原型到发布的迭代过程,旨在快速实现产品落地。

🗣️Karina认为,Canvas和Tasks等工具本应更早出现,并强调了它们在提升AI交互体验方面的潜力。她还分享了她对AI Agent未来发展的看法,认为AI将逐步演变为更智能、更自主的助手。

Sponsorships and tickets for the AI Engineer Summit are selling fast! See the new website with speakers and schedules live!

If you are building AI agents or leading teams of AI Engineers, this will be the single highest-signal conference of the year for you, this Feb 20-22nd in NYC.

We’re pleased to share that Karina will be presenting OpenAI’s closing keynote at the AI Engineer Summit. We were fortunate to get some time with her today to introduce some of her work, and hope this serves as nice background for her talk!


There are very few early AI careers that have been as impactful as Karina Nguyen’s. After stints at Notion, Square, Dropbox, Primer, the New York Times, and UC Berkeley, She joined Anthropic as employee ~60 and worked on a wide range of research/product roles for Claude 1, 2, and 3. We’ll just let her LinkedIn speak for itself:

incredible

Now, as Research manager and Post-training lead in Model Behavior at OpenAI, she creates new interaction paradigms for reasoning interfaces and capabilities, like ChatGPT Canvas, Tasks, SimpleQA, streaming chain-of-thought for o1 models, and more via novel synthetic model training.

Ideal AI Research+Product Process

In the podcast we got a sense of what Karina has found works for her and her team to be as productive as they have been:

We could turn this into a snazzy viral graphic but really this is all it is. Simple to say, difficult to do well. Hopefully it helps you define your process if you do similar product-research work.

Show Notes

Timestamps

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AI工程师峰会 Karina Nguyen OpenAI ChatGPT Canvas AI Agents
相关文章