POE Blog 前天 01:44
报告:2025年春季人工智能模型使用趋势
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本报告分析了2025年1月至5月Poe平台用户在文本、推理、图像、视频和音频领域的AI模型使用模式。数据显示,推理模型的使用持续增长,尤其是在DeepSeek viral moment之后,Gemini 2.5 Pro和OpenAI的多种推理模型表现强劲。图像和视频生成领域竞争日益激烈,GPT Image Generation和Kling 2.0迅速崛起,而Google Imagen3和Runway也占据重要份额。音频生成领域,ElevenLabs仍保持领先,但新兴竞争者正在涌现。报告揭示了AI生态系统的快速演变和模型多样化的趋势。

🚀 **推理模型使用显著增长**:在DeepSeek viral moment之后,推理模型在Poe平台上的消息分享比例从约2%增长到10%,表明用户对能够处理复杂任务的AI模型需求旺盛。Google的Gemini 2.5 Pro和OpenAI发布的多个推理模型(如o1-preview系列)增长迅速,用户也倾向于采用最新版本,如Claude-3.7-Sonnet取代Claude-3.5-Sonnet。

🖼️ **图像生成竞争白热化**:GPT Image Generation(GPT-Image-1)在发布后两周内迅速占据了17%的图像生成使用份额。Google的Imagen3系列用户份额稳步增长至30%,与领先的Black Forest Labs的FLUX系列(约35%)不相上下,显示出图像生成技术的快速迭代和市场竞争的激烈。

🎬 **视频生成领域新秀涌现**:Kuaishou发布的Kling家族视频生成模型,尤其是Kling-2.0-Master,在发布三周内迅速获得了Poe平台21%的视频生成使用份额,成为主要竞争者。Google的Veo 2也保持了约20%的强劲使用份额,而Runway的使用份额则有所下降。

🎤 **音频生成领域竞争渐显**:ElevenLabs在文本转语音(TTS)领域继续保持领先地位,满足了约80%的用户请求。然而,Cartesia、Unreal Speech、PlayAI和Orpheus等新兴竞争者开始崭露头角,它们通过独特的语音选项、音效和不同的性能价格配置,预示着音频生成领域的多元化发展。

💡 **AI生态快速演变,平台价值凸显**:Poe平台汇集了大量前沿AI模型,为用户提供了跨提供商的访问能力。本报告的数据反映了AI生态系统的快速演变,模型多样性和提供商竞争的加剧,进一步凸显了Poe作为模型探索、比较和利用平台的价值,尤其是在推理模型和多媒体生成领域,新趋势和模式正在不断涌现。

The AI landscape is evolving at an unprecedented pace, yet understanding demand and usage patterns, beyond standardized benchmarks or leaderboard platforms, remains a challenge. Meanwhile, the preferred model one week can easily shift with the introduction of a powerful upgrade from a frontier provider or an unexpected disruptor.

Our goal is to make Poe the best place to explore, compare, and harness the outputs of AI models. Since Poe users have provider-agnostic access to the latest frontier models in a single interface, underlying trends among them may herald broader shifts in the AI ecosystem.

Building off our previous report, this analysis displays weekly aggregated usage data from January 2025 to May 2025 among Poe users in several key, but expanded, domains: text, reasoning, image, video and audio. This includes the sustained growth of reasoning models following DeepSeek's viral moment, how image and video generation are becoming increasingly competitive, and early signs of diversification in audio.

We hope our latest findings offer researchers and the public a helpful glimpse into the rapidly expanding AI ecosystem. [1] [2]

Frontier labs are rapidly releasing smarter general-purpose text models [3]

  • OpenAI’s GPT-4.1 family and Google’s Gemini 2.5 Pro, both of which offer improved performance on real-world coding tasks, rapidly increased to message shares of ~10% and ~5%, respectively, within weeks of launch.

  • Anthropic’s Claude family (e.g. Claude 3.5 Sonnet and Claude 3.7 Sonnet) saw a ~10% absolute decline in share over the same period.

  • DeepSeek's viral moment appears to have waned, as other affordable, verbose reasoning models have been released, with DeepSeek R1’s message share declining from a peak of 7% in mid-February to 3% by the end of April.

  • Similar to the findings in the previous report, new flagship models within an individual provider’s offering appear to cannibalize their predecessors. In this case, Poe subscribers rapidly adopted Claude-3.7-Sonnet over Claude-3.5-Sonnet, although the latter retained a notable ~12% overall usage among LLMs.

Reasoning models sustain usage following DeepSeek’s viral entry earlier this year

Since the start of 2025, frontier labs have been rapidly iterating on their reasoning model offerings. This has resulted in an increase in models that are capable of spending more time and compute to solve complex tasks with more precision and reliability. Notably, the share of all text messages sent to reasoning models on Poe grew from ~2% to ~10% during the report period, peaking during the height of DeepSeek’s viral moment.

The below breaks out the message share among models with reasoning capabilities as a subcategory of text.

  • Usage of Gemini 2.5 Pro has been growing quickly among Poe subscribers, with the model obtaining a reasoning message share of ~30% within only ~6 weeks of its launch.

  • OpenAI, after releasing the category-defining reasoning model o1-preview in late 2024, continues to release more capable and affordable reasoning models at a pace unmatched by other labs, with the launch of o1-pro, o3-mini, o3-mini-high, o3, and o4-mini in just the first four months of 2025. Within the OpenAI set of reasoning models, it appears subscribers are quickly adopting the latest (e.g. o3-mini → o4-mini, o1 → o3) quite fluidly.

  • While xAI’s Grok 3 topped various problem-solving benchmarks in its February 2025 public release, Grok-3-mini continues to be the only model in the family that supports reasoning in the xAI API, which is perhaps why it accounts for less than 1% of reasoning model usage.

  • We note the early emergence of hybrid reasoning models, such as Gemini 2.5 Flash Preview and Qwen3, which can decide (or can be controlled) to vary their reasoning level conversationally (i.e., not just via API parameters). However, their collective usage remains small at ~1% in the subcategory.

Image generation is becoming increasingly competitive as quality and adherence improve

  • GPT Image Generation (GPT-Image-1) launched in the API in late April and rapidly attained 17% of image generation usage in only two weeks, mirroring its viral launch throughout March and early April in the ChatGPT app.

  • Google’s Imagen3 family has continued its steady usage growth from ~10% to ~30% share over the course of 2025, putting it on par with category-leader Black Forest Labs’ FLUX family of image generation models, which held ~35% share collectively as of last week of April.

  • The FLUX family of image generation models maintained its overall plurality share of image generation on Poe, but it declined slightly from ~45% to ~35% during the report period.

Kling 2.0 quickly emerged as a top contender in video generation in just three weeks. [4]

  • The newly released Kling family of video generation models from Chinese lab Kuaishou have rapidly garnered a collective ~30% usage share, most notably Kling-2.0-Master, which yielded 21% of all video generation on Poe at the end of April 2025, only three weeks after its release.

  • Google’s Veo 2 continues to maintain strong usage share at ~20% in the months following its February launch.

  • Category-defining video generation incumbent Runway has seen its usage share in video generation decline by ~40% to ~20% throughout the report period. [5]

ElevenLabs maintains its lead in audio generation amidst early signs of rising competition [6]

  • In audio generation (specifically text-to-speech, or “TTS”), ElevenLabs appears to be preferred by users with it fulfilling ~80% of all subscribers’ TTS requests during the report period.

  • However, competition is brewing in this space with the emergence of Cartesia, Unreal Speech, PlayAI, and Orpheus, which offer unique voice options, voice effects, and different performance and price profiles.

Conclusion

We hope sharing data from Poe's diverse user base and official integrations offers a valuable, real-world perspective on the dynamic and ever-evolving AI landscape. The increasing model diversity and provider competition helps underscore the value of our platform, for both users and creators. We’re currently observing break-out usage among reasoning models, and expect to see this continue as a top competitive driver among leading frontier labs. Multimedia is heating up and following OpenAI’s breakout offering in its new image generation capabilities, it may not be long before we see something similar among video models.

We look forward to continuing to share these important insights as we capture signs of new patterns and emerging trends. Finally, if you would like to experience access to our library of 100+ official model integrations, you can sign up on Poe today at https://poe.com/.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AI模型 Poe平台 推理模型 图像生成 视频生成 音频生成
相关文章