a16z 8小时前
The API Battleground: A New Era of Platform WarsNew
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

随着B2B AI创业的兴起,许多AI产品依赖于Slack、Salesforce等平台的API接口获取数据。然而,Salesforce收紧Slack数据访问权限的举动预示着平台方可能开始限制第三方数据访问,以保护自身战略或推动自身AI发展。这一趋势将迫使AI初创公司调整商业模式,例如通过RPA、嵌入式服务或自建数据栈来应对,同时也为开源技术提供了重新掌控数据主权的机遇。企业用户应关注数据所有权和可移植性,而平台方则需平衡创新与用户信任。

🔑 **平台数据接口的收紧是B2B AI创业的新挑战。** 许多AI原生应用依赖Slack、Salesforce等平台的API获取数据来自动化工作流程。Salesforce限制Slack数据访问,以及JPMorgan拟对金融科技聚合器收费等事件,表明平台方正加强对数据的控制,这将影响依赖这些数据的企业级AI产品,如企业助手和知识图谱。

💡 **平台方限制数据访问的动机多样。** 主要驱动因素包括:加强数据隐私与合规性(如GDPR);保护自身战略资产和AI发展野心;以及通过锁定数据访问来限制市场竞争。 incumbents(现有平台)已处于用户关系的主导地位,限制API访问是其巩固优势的一种方式。

🚀 **AI初创公司面临多种应对策略。** 面对API访问限制,初创公司可考虑:采用RPA(机器人流程自动化)2.0模拟用户行为抓取数据;通过平台自带的市场接入(但需支付高额费用);嵌入到企业内部并与客户协商数据访问;或重建数据提取层,帮助客户重新拥有数据。这些策略均旨在规避API变动带来的风险,但可能压缩利润空间。

🌐 **开源技术为数据主权提供了解决方案。** 尽管开源LLM(如Mistral、LLaMA)和相关框架(如LangChain)本身不直接提供专有数据访问,但它们赋能企业提取、处理、存储和独立利用自身数据。向量数据库尤为关键,它们支持在平台生态之外进行AI驱动的相似性搜索和检索,帮助企业构建自主知识图谱和AI助手,打破供应商锁定。

🤝 **生态各方需调整策略以适应变化。** 初创公司应设计更具弹性的数据提取层,积极谈判早期合作,并尝试拥有部分数据栈。企业买家应坚持数据所有权,警惕平台原生AI锁定,并偏好提供数据可移植性的合作伙伴。平台方则需谨慎行事,避免过度限制导致创新放缓和用户流失,应提供透明的访问条款和数据导出路径。

As an investor who’s met hundreds of B2B AI startups, I can tell you with confidence that one trend is crystal clear: modern AI products almost always depend on incumbents’ data, accessed via APIs — Slack, Salesforce, Zendesk, QuickBooks, Jira, LinkedIn and beyond. The typical AI-native app needs the valuable information stored in various systems of record to automate a given set of workflows (thus solving the Messy Inbox Problem). But what happens when these platforms start to hoard their data instead of allowing permissioned third parties to access it?

Salesforce’s recent tightening of access to Slack data is the canary in the coal mine. As of mid-2025, third-party tools can no longer bulk index Slack messages. Non-Salesforce-Marketplace apps are rate limited and cannot store historical data long-term. This move disrupts enterprise copilots, knowledge graphs, and AI agents that were built on this foundational access.

But this isn’t just about Slack! It would appear permissioned data access is fully under siege. JPMorgan has threatened to charge leading fintech account aggregators $300 million per year for data access. Microsoft is asserting stricter control over platforms like Bing and Github. Other incumbents are likely to follow suit. If data access becomes a closed-loop system, what breaks — and what is lost?

As my partner Seema recently wrote, “The platform wars are just beginning — and they won’t be fought with product, they’ll be fought at the API level.” I agree. And while API access may not be cut off completely, the growing restrictions, gating, and monetization pressures will radically reshape how startups build, grow, and price their AI products.

Why are incumbents doing this?

The driving motivations for these restrictions typically include:

  • Strengthening data privacy and compliance amid tightening regulations (e.g., GDPR, DORA)
  • Protecting strategic assets and paving the way for incumbents’ own AI and platform ambitions
  • Restricting market competition by locking out rivals from valuable platform and data access

By definition, incumbents are already in the driver’s seat with their customers — why make it easier to get booted to the back?

Will customers rebel — or adapt?

Enterprises crave integration. They’ve built internal copilots that ingest Slack, Salesforce, and Jira to answer questions, draft emails, summarize threads, and coordinate incidents. If those taps run dry, productivity drops and internal teams scramble. There are entire ops teams whose responsibility is to prevent data and decision-making from becoming siloed. So we expect these blockages will not occur without a fight.

The truth is, while many customers technically own their data, incumbents still control the pipes. As my partner Alex likes to say, “the best companies [i.e., systems of record] have hostages, not customers.” However, in most cases, even “hostages” won’t tolerate a full-on API blockade. They’ll revolt if they can’t use tools they love. That’s why, as Seema put it, “access stays open — but becomes more expensive.” Incumbents are unlikely to flip the kill switch. They’ll squeeze the ecosystem with rate limits, clunky sandboxes, higher access fees, opaque review processes, and stricter gating behind marketplace policies. This is API war by attrition, not revocation.

We can look to open banking for precedent here. Plaid was founded to let users securely link their bank accounts to prominent fintech applications. Despite Dodd Frank Section 1033, which clearly established that consumers have access to their own financial data, many banks attempted to block Plaid’s access, ultimately fearing that they were being disintermediated from their customers. But something important happened: in many cases, customers had grown so loyal to their fintech apps that they were more likely to leave the bank blocking Plaid than they were the fintech app that could no longer connect. 

The implications here are twofold: 1) it might be prudent for regulators to more clearly delineate ownership of data between customer and software provider (ideally affirming the customer’s right to access it wherever they’d like), and 2) it’s possible that the early magic and clear utility offered by B2B AI applications will inspire customers to seriously rebel against any incumbent attempts to restrict access.

If incumbents do block access, what happens to startups that rely on the data?

The startups most exposed are those offering unified search, enterprise copilots, automated summarization, and knowledge graph aggregation — basically, anything that needs to pull from one or multiple systems of record to deliver on its product promise. If your product assumes broad, persistent, multi-platform indexing, your model may be at risk. The good news is, there are options:

  • Lean into RPA 2.0 — computer use to circumnavigate APIs. Simulate human behavior to actually go into the system of record and pull the necessary data. Clunkier for sure, but an option.
  • GTM via marketplaces owned by the incumbents (though that comes with a huge tax)
  • Pivot to embed within the enterprise and negotiate access on a per-customer basis. We expect to see many consultative AI automation providers.
  • Rebuild from the ingestion layer up, helping customers re-own their own data.

None of these strategies are bulletproof, but they all provide protection against any sudden API access changes. The most immediate risk? Margins will compress. Platform fees + rate limits = weaker unit economics + longer sales cycles. 

The open-source opportunity: reclaiming data sovereignty

Yes, open source offers a powerful counter-narrative, but it’s not a silver bullet. While open-source LLMs (Mistral, LLaMA), orchestration frameworks (LangChain, LlamaIndex), and vector databases (Pinecone) don’t magically grant access to proprietary incumbent data, they provide a critical mechanism for enterprises to regain control over their own data assets. 

When an enterprise can extract its data (or parts of it) from a proprietary platform, open-source tools allow them to process, store, and leverage that data independently. Vector databases, in particular, are vital here: they efficiently store and index the numerical representations (embeddings) of complex, unstructured data (like text from Slack or documents from Salesforce) enabling fast, AI-powered similarity search and retrieval outside the incumbent’s ecosystem. 

This means an enterprise can build its own knowledge graphs and AI copilots on its data, deployed on its infrastructure, with open-source models, ultimately breaking free from existing vendor lock-in and dictating its own data destiny. Where open-source truly shines is in portability: it allows you to deploy your solutions wherever the data lives, offering a crucial bulwark against future data clampdowns. Absent any regulatory action — like the aforementioned hypothetical equivalent of Dodd Frank 1033 for enterprise data — open source may serve as the key unlock to enterprises reclaiming control.

Will startups build the entire stack?

Absolutely. We’re already seeing early signals:

  • Horizontal infrastructure providers like Databricks, Reducto, and Pinecone are powering ingestion-to-inference pipelines.
  • Vertical players like Harvey embed tightly with partners, customizing stack and model in lockstep with customer workflows.
  • More and more companies are becoming mini–consulting/SaaS hybrids: services upfront to secure the data, productized layers behind the scenes to scale.

In an access-restricted world, “full-stack AI startup” stops being a trope and starts becoming the most defensible go-to-market strategy.

How can each participant in the ecosystem plan for this potential future?

Founders and startups:

  • Abstract your ingestion layers: design for API failure, include scrapers, embeds, forward-deployed agents, computer use models and RPA
  • Negotiate early partnerships: join marketplaces, secure data-sharing contracts, become embedded infrastructure.
  • Own at least part of the data stack: build import tools, host within the customer’s firewall, or shift to BYO-data deployments.

Open-source strategically: contribute or maintain ingestion infrastructure as insurance against lockouts.

Enterprise software buyers:

  • Control your data destiny: insist on software where you own the index.
  • Be cautious of native AI lock-in: if an LLM can only see what one vendor shows it, it’s not your copilot — it’s theirs.
  • Favor partners who offer portability: can you swap infra? Move models? Preserve auditability?

Incumbents:

  • Be careful not to overreach — platforms that gate too aggressively may slow innovation and trigger quiet customer churn.
  • Offer tiered access, transparent terms, and “data export” pathways to avoid regulatory backlash or mass defection.
  • Understand that you’re no longer competing with startups, you’re competing with your own ecosystem’s trust.

Final word

We’re entering an era where data access is the single most important strategic input for AI. If the data gets locked down, the future bifurcates: 

Path 1: Closed-stack incumbents, whose AI features are lackluster — but they own the customer’s workflows end-to-end. A true hostage situation!

Path 2: Full-stack challengers, who go deeper, integrate tighter, and offer customers freedom, control, and superior outcomes.

Founders building in B2B AI must ask themselves now, not later:

“If every API I use disappeared tomorrow, would I still have a business?”

If the answer is no, it’s time to rebuild. Because in this next chapter, data could very well be the moat, the product, and the leverage.

                                                                    </div>

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

B2B AI API限制 数据主权 开源技术 平台战略
相关文章