AI News 2024年07月09日
SenseTime SenseNova 5.5: China’s first real-time multimodal AI model
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

商汤科技发布了其大型语言模型SenseNova 5.5的升级版,其中包括SenseNova 5o,被誉为中国首个实时多模态模型。SenseNova 5o在AI交互方面取得了重大突破,其功能与GPT-4o的流式交互功能相当。该模型可以像与真人对话一样与用户互动,使其特别适用于实时对话和语音识别应用。商汤科技表示,其最新模型在多个基准测试中超越了竞争对手。

🎉 **实时多模态交互:** SenseNova 5o是商汤科技推出的中国首个实时多模态模型,其功能与GPT-4o的流式交互功能相当。该模型可以像与真人对话一样与用户互动,使其特别适用于实时对话和语音识别应用。

🚀 **性能提升:** SenseNova 5.5在整体性能方面比其前代产品SenseNova 5.0提高了30%。值得注意的改进包括数学推理、英语水平和指令遵循能力的提升。

💰 **边缘侧AI普及:** 商汤科技推出了一种经济高效的边缘侧大型模型,将每台设备的成本降低至每年9.90元人民币(1.36美元),这有可能加速其在各种物联网设备中的广泛应用。

🤝 **企业用户迁移:** 商汤科技推出了“Project $0 Go”,这是一个面向从OpenAI平台迁移的企业用户的免费入驻套餐。该计划包括5000万个代币套餐和API迁移咨询服务,旨在降低企业利用SenseNova功能的门槛。

🤖 **AI应用扩展:** 商汤科技推出了Vimi,一个可控的AI头像视频生成器。该工具可以从一张照片中创建包含精确的面部表情和上半身动作控制的短视频片段,为娱乐和交互式应用打开了新的可能性。

📈 **行业应用:** 商汤科技的大型模型技术已经在多个行业掀起波澜。在金融领域,它正在提高合规、营销和投资研究的效率。在农业领域,它正在帮助减少材料使用量20%,同时将作物产量提高15%。文化旅游行业正在看到旅行规划和预订效率的显著提升。

SenseTime has unveiled SenseNova 5.5, an enhanced version of its LLM that includes SenseNova 5o—touted as China’s first real-time multimodal model.

SenseNova 5o represents a leap forward in AI interaction, providing capabilities on par with GPT-4o’s streaming interaction features. This advancement allows users to engage with the model in a manner akin to conversing with a real person, making it particularly suitable for real-time conversation and speech recognition applications.

According to SenseTime, its latest model outperforms rivals across several benchmarks:

Dr. Xu Li, Chairman of the Board and CEO of SenseTime, commented: “This is a critical year for large models as they evolve from unimodal to multimodal. In line with users’ needs, SenseTime is also focused on boosting interactivity.

“With applications driving the development of models and their capabilities, coupled with technological advancements in multimodal streaming interactions, we will witness unprecedented transformations in human-AI interactions.”

The upgraded SenseNova 5.5 boasts a 30% improvement in overall performance compared to its predecessor, SenseNova 5.0, which was released just two months earlier. Notable enhancements include improved mathematical reasoning, English proficiency, and command-following abilities.

In a move to democratise access to advanced AI capabilities, SenseTime has introduced a cost-effective edge-side large model. This development reduces the cost per device to as low as RMB 9.90 ($1.36) per year, potentially accelerating widespread adoption across various IoT devices.

The company has also launched “Project $0 Go,” a free onboarding package for enterprise users migrating from the OpenAI platform. This initiative includes a 50 million tokens package and API migration consulting services, aimed at lowering entry barriers for businesses looking to leverage SenseNova’s capabilities.

SenseTime’s commitment to edge-side AI is evident in the release of SenseChat Lite-5.5, which features a 40% reduction in inference time compared to its predecessor, now at just 0.19 seconds. The inference speed has also increased by 15%, reaching 90.2 words per second.

Expanding its suite of AI applications, SenseTime introduced Vimi, a controllable AI avatar video generator. This tool can create short video clips with precise control over facial expressions and upper body movements from a single photo, opening up new possibilities in entertainment and interactive applications.

The company has also upgraded its SenseTime Raccoon Series, a set of AI-native productivity tools. The Code Raccoon now boasts a five-fold improvement in response speed and a 10% increase in coding precision, while the Office Raccoon has expanded to include a consumer-facing webpage and a WeChat mini-app version.

SenseTime’s large model technology is already making waves across various industries. In the financial sector, it’s improving efficiency in compliance, marketing, and investment research. In agriculture, it’s helping to reduce the use of materials by 20% while increasing crop yields by 15%. The cultural tourism industry is seeing significant boosts in travel planning and booking efficiency.

With over 3,000 government and corporate customers already using SenseNova across technology, healthcare, finance, and programming sectors, SenseTime is cementing its position as a key AI player.

(Image Credit: SenseTime)

See also: AI revolution in US education: How Chinese apps are leading the way

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Explore other upcoming enterprise technology events and webinars powered by TechForge here.

The post SenseTime SenseNova 5.5: China’s first real-time multimodal AI model appeared first on AI News.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

SenseNova 5.5 商汤科技 实时多模态模型 AI交互 边缘侧AI
相关文章