AI News 2024年07月18日
SenseTime SenseNova 5.5: China’s first real-time multimodal AI model
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

商汤科技发布了其大型语言模型SenseNova 5.5的升级版,其中包括SenseNova 5o,被誉为中国首个实时多模态模型。SenseNova 5o在人工智能交互方面取得了重大进步,其功能与GPT-4o的流式交互功能相当。这一进步使用户能够像与真人交谈一样与模型互动,使其特别适合实时对话和语音识别应用。商汤科技表示,其最新模型在多个基准测试中超越了竞争对手。

🤩 **实时多模态交互:** SenseNova 5o作为中国首个实时多模态模型,实现了与GPT-4o的流式交互功能相当的实时交互能力,用户可以像与真人对话一样与模型进行互动。这使得SenseNova 5o在实时对话、语音识别等应用场景中具有独特优势。

🚀 **性能提升:** SenseNova 5.5在整体性能上相比前代产品SenseNova 5.0提升了30%,尤其是在数学推理、英语水平和指令遵循能力方面表现出色。

💰 **边缘侧大模型:** 商汤科技推出了经济实惠的边缘侧大型模型,将每台设备的成本降低至每年9.9元人民币(1.36美元),这将加速其在各种物联网设备上的广泛应用。

🤝 **免费迁移服务:** 商汤科技推出了“项目$0 Go”,为从OpenAI平台迁移的企业用户提供免费的入门套餐,包括5000万个代币和API迁移咨询服务,旨在降低企业利用SenseNova功能的门槛。

🤖 **AI应用扩展:** 商汤科技推出了Vimi,一个可控的AI头像视频生成器,该工具可以从一张照片中创建短视频片段,并对面部表情和上半身动作进行精确控制,为娱乐和互动应用开辟了新的可能性。商汤科技还升级了其SenseTime Raccoon系列,这是一套AI原生生产力工具。Code Raccoon的响应速度提高了五倍,编码精度提高了10%,而Office Raccoon已扩展到包括面向消费者的网页和微信小程序版本。

📈 **行业应用:** 商汤科技的大型模型技术已在各个行业掀起波澜。在金融领域,它提高了合规、营销和投资研究的效率。在农业领域,它帮助将材料使用量减少了20%,同时将作物产量提高了15%。文化旅游行业在旅行计划和预订效率方面看到了显著提升。

🏆 **行业地位:** 商汤科技拥有超过3000家政府和企业客户,涵盖科技、医疗保健、金融和编程领域,巩固了其作为人工智能领域关键参与者的地位。

SenseTime has unveiled SenseNova 5.5, an enhanced version of its LLM that includes SenseNova 5o—touted as China’s first real-time multimodal model.

SenseNova 5o represents a leap forward in AI interaction, providing capabilities on par with GPT-4o’s streaming interaction features. This advancement allows users to engage with the model in a manner akin to conversing with a real person, making it particularly suitable for real-time conversation and speech recognition applications.

According to SenseTime, its latest model outperforms rivals across several benchmarks:

Dr. Xu Li, Chairman of the Board and CEO of SenseTime, commented: “This is a critical year for large models as they evolve from unimodal to multimodal. In line with users’ needs, SenseTime is also focused on boosting interactivity.

“With applications driving the development of models and their capabilities, coupled with technological advancements in multimodal streaming interactions, we will witness unprecedented transformations in human-AI interactions.”

The upgraded SenseNova 5.5 boasts a 30% improvement in overall performance compared to its predecessor, SenseNova 5.0, which was released just two months earlier. Notable enhancements include improved mathematical reasoning, English proficiency, and command-following abilities.

In a move to democratise access to advanced AI capabilities, SenseTime has introduced a cost-effective edge-side large model. This development reduces the cost per device to as low as RMB 9.90 ($1.36) per year, potentially accelerating widespread adoption across various IoT devices.

The company has also launched “Project $0 Go,” a free onboarding package for enterprise users migrating from the OpenAI platform. This initiative includes a 50 million tokens package and API migration consulting services, aimed at lowering entry barriers for businesses looking to leverage SenseNova’s capabilities.

SenseTime’s commitment to edge-side AI is evident in the release of SenseChat Lite-5.5, which features a 40% reduction in inference time compared to its predecessor, now at just 0.19 seconds. The inference speed has also increased by 15%, reaching 90.2 words per second.

Expanding its suite of AI applications, SenseTime introduced Vimi, a controllable AI avatar video generator. This tool can create short video clips with precise control over facial expressions and upper body movements from a single photo, opening up new possibilities in entertainment and interactive applications.

The company has also upgraded its SenseTime Raccoon Series, a set of AI-native productivity tools. The Code Raccoon now boasts a five-fold improvement in response speed and a 10% increase in coding precision, while the Office Raccoon has expanded to include a consumer-facing webpage and a WeChat mini-app version.

SenseTime’s large model technology is already making waves across various industries. In the financial sector, it’s improving efficiency in compliance, marketing, and investment research. In agriculture, it’s helping to reduce the use of materials by 20% while increasing crop yields by 15%. The cultural tourism industry is seeing significant boosts in travel planning and booking efficiency.

With over 3,000 government and corporate customers already using SenseNova across technology, healthcare, finance, and programming sectors, SenseTime is cementing its position as a key AI player.

(Image Credit: SenseTime)

See also: AI revolution in US education: How Chinese apps are leading the way

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Explore other upcoming enterprise technology events and webinars powered by TechForge here.

The post SenseTime SenseNova 5.5: China’s first real-time multimodal AI model appeared first on AI News.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

SenseNova 商汤科技 人工智能 大模型 多模态 实时交互
相关文章