钛媒体:引领未来商业与生活新知 前天 15:21
ByteDance's Volcano Engine Supercharges AI Offerings With Major Model Upgrades and New Agent Ecosystem
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

字节跳动旗下的火山引擎正通过升级其“豆包”大模型系列,加速在企业级AI和数字代理解决方案领域的布局。此次升级包括豆包图像编辑模型3.0、豆包同传模型2.0以及全面升级的豆包大模型1.6系列。新模型在性能、效率和成本上均有显著提升,例如图像编辑模型支持自然语言指令进行复杂视觉调整,同传模型延迟大幅降低并支持零样本语音克隆。豆包大模型已在中国公有云大模型服务市场占据领先地位,日使用量激增。火山引擎还通过开源Coze平台和推出HiAgent“数字员工”工作空间,进一步巩固其AI生态系统,旨在赋能企业提升生产力。

🚀 **豆包大模型系列全面升级,性能与效率显著提升**:火山引擎发布了包括豆包图像编辑模型3.0(SeedEdit)和豆包同传模型2.0在内的多项大模型更新。SeedEdit支持通过自然语言进行背景移除、光照调整等复杂图像编辑,广泛应用于广告、内容创作等领域。同传模型2.0将延迟从8-10秒缩短至2-3秒,并支持零样本语音克隆,为国际商务、媒体和教育等场景带来便利。旗舰模型豆包-Seed-1.6-flash在代码、数学和推理任务上表现更强,延迟低至10ms/token,同时大幅降低了企业试用成本。

📈 **豆包模型市场份额领先,商业化进程加速**:豆包大模型展现出强劲增长势头,日token使用量已达16.4万亿,较发布时增长137倍。根据IDC报告,豆包在中国公有云大模型服务市场占据46.4%的市场份额,领先优势明显。火山引擎正积极推进商业化,2024年营收已超120亿元人民币,并设定2025年营收目标超过250亿元,显示出其在企业级AI市场的强大竞争力。

💡 **AI正从工具转变为代理,赋能企业效率提升**:火山引擎总裁谭岱强调,AI正从单纯的工具演变为执行任务的“代理”。通过Coze平台的开源和HiAgent“数字员工”工作空间平台的推出,火山引擎致力于构建围绕AI代理的生态系统。HiAgent通过集中的任务处理、个性化界面和系统集成,帮助企业解决重复性任务、系统切换干扰和决策盲点等问题,提高员工生产力。

🌐 **开放生态与多模态能力并重,引领AI发展趋势**:火山引擎将Coze平台的核心能力开源,吸引了大量开发者关注,加速了AI生态的构建。其Seed 1.6-Embedding模型支持文本、图像和视频的联合检索,并在MMEB_v2图像排行榜上名列前茅,展现了其在多模态AI领域的领先地位。火山引擎通过快速的模型迭代和开放的生态策略,旨在满足客户多样化的需求,并巩固其在AI基础设施领域的领导者地位。

AsianFin -- ByteDance’s Volcano Engine is accelerating its AI ambitions with a sweeping upgrade of its Doubao large model suite, underscoring the company’s intensifying push into enterprise AI and digital agent solutions amid China’s increasingly competitive cloud landscape.

On July 30, Volcano Engine launched several new offerings, including the Doubao Image Editing Model 3.0, Doubao Simultaneous Interpretation Model 2.0, and a fully upgraded Doubao Large Model 1.6 series. The upgrades come alongside a broader effort to bolster its AI-native infrastructure and cement its lead in China’s rapidly growing cloud-based large model services market.

Doubao’s meteoric rise is backed by strong data: Daily token usage surged to 16.4 trillion as of May, representing a 137-fold increase since its debut in May 2024. According to an IDC report, Doubao now leads China’s public cloud large model service market by a wide margin, commanding a 46.4% market share—more than Baidu AI Cloud and Alibaba Cloud combined.

Volcano Engine, ByteDance’s enterprise tech arm, is aggressively monetizing that growth. In 2024, it generated over RMB 12 billion in revenue and is targeting more than RMB 25 billion in 2025—positioning it to potentially surpass Baidu Cloud’s full-year top line.

“AI is no longer just a tool—it’s becoming the agent,” said Tan Dai, President of Volcano Engine. “Software is now executing tasks, not just enabling them.”

At the center of the latest upgrade is Doubao·Image Editing Model 3.0 (SeedEdit), which allows complex visual manipulations—like background removal, lighting adjustments, and pose alterations—through natural language prompts. The model is designed for commercial use in advertising, content creation, and e-commerce, and is available to enterprise users via Volcano Ark and to consumers via ByteDance apps like Jimeng and Doubao.

The new Doubao·Simultaneous Interpretation Model 2.0 slashes latency from 8–10 seconds to 2–3 seconds, thanks to a full-duplex system. It also supports zero-shot voice cloning, allowing for foreign language speech generation in the user’s own voice without prior training data—opening up use cases in international business, media, and education.

Meanwhile, the flagship Doubao-Seed-1.6-flash model delivers stronger performance in code, math, and reasoning tasks with latency as low as 10ms per token. Token pricing has also been aggressively cut: RMB 0.15 per million input tokens, and RMB 1.5 per million output tokens, slashing costs by up to 70% in enterprise trials.

Also notable is the multimodal Seed 1.6-Embedding model, which enables joint retrieval across text, image, and video. It currently tops the MMEB_v2 image leaderboard, outperforming rival models including Alibaba’s Qwen2 7B by 5.6 points.

Volcano Engine is doubling down on open-source as part of its strategy to build a broader ecosystem around AI agents. The core capabilities of its Coze platform—including visual development tool Coze Studio and management suite Coze Loop—were recently open-sourced. Within three days, Coze Studio had amassed over 10,000 GitHub stars.

To support intelligent agent deployment, the company rolled out a new Responses API with native context management and multimodal support, cutting development time for AI assistants from two days to just one hour. Code requirements have been reduced by 87%, according to internal benchmarks.

Volcano Engine has also launched HiAgent, a “digital employee” workspace platform that acts as a centralized task hub. It enables personalized interfaces tailored to job roles—sales, HR, operations—integrating enterprise systems and streamlining workflows. The platform is already in deployment at clients including Guangjiao Digital Technology and Xiamen University.

Zhang Xin, Volcano Engine’s VP, highlighted how HiAgent addresses three key productivity bottlenecks: repetitive rule-based tasks, system switching disruptions, and decision-making blind spots. “The goal is not to replace people, but to help them do more of what matters,” he said.

Tan Dai sees the current AI wave as the third major computing platform shift, following the PC and mobile eras. He likens Volcano Engine’s journey to a marathon—and the company is only “500 meters in.”

Looking ahead, ByteDance’s enterprise arm is targeting RMB 100 billion in annual revenue by 2030, provided macroeconomic conditions remain favorable. That growth hinges on converting its massive scale, technical edge, and early-mover advantage into long-term, defensible commercial value.

“Every link in the chain has to be strong,” Tan said. “In cloud computing, customer needs vary drastically. But in AI, we must do everything better—from the large model, to native infrastructure, to agent deployment.”

Volcano Engine’s rapid model iteration and open ecosystem approach appear designed to do just that. Whether it can maintain this breakneck pace as competition heats up from rivals like Baidu, Alibaba, and Tencent remains to be seen.

But for now, ByteDance is making a strong claim to be China’s AI infrastructure leader—not just building large models, but translating them into agents that work.

更多精彩内容,关注钛媒体微信号(ID:taimeiti),或者下载钛媒体App

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

火山引擎 豆包大模型 人工智能 企业AI 数字代理
相关文章