TechCrunch News 01月15日
Rockfish is helping enterprises leverage synthetic data
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Rockfish是一家利用生成式AI创建合成数据的初创公司,旨在帮助企业打破数据孤岛。该公司由Vyas Sekar和Giulia Fanti创立,起初是为了解决学术界的数据再现性危机。通过与企业沟通,他们发现商业领域也存在同样的问题。Rockfish的产品能够与AWS和Azure等数据库集成,并根据公司政策或数据用途,为用户选择最佳的数据配置方案。Rockfish专注于运营数据,如金融交易、网络安全和供应链等领域,这些数据持续生成且不断变化。该公司最近完成了400万美元的种子轮融资,总融资额达到600万美元。尽管合成数据市场竞争激烈,Rockfish仍希望通过其独特的方法和技术脱颖而出。

💡Rockfish的诞生源于解决学术界数据再现性危机,后发现企业也面临类似挑战,从而转向商业应用,利用生成式AI创建合成数据。

⚙️Rockfish专注于运营数据,如金融交易、网络安全和供应链,这些领域数据持续产生且变化迅速,与其他竞争对手形成差异化竞争优势。

💰Rockfish已完成400万美元种子轮融资,总融资额达600万美元,表明投资者对其技术和市场前景的认可。

📊Rockfish的产品可以与AWS和Azure等数据库集成,并根据公司政策或数据用途,为用户选择最佳的数据配置方案, 帮助企业打破数据孤岛。

For years, Vyas Sekar would call up Muckai Girish, an old friend from undergrad, to talk through potential startup ideas and get Girish’s opinion. The two usually talked through an idea and ended the conversation at that. When Sekar called Girish with an idea involving synthetic data in early 2022, the conversation didn’t just end when they hung up the phone.

Sekar and fellow Carnegie Mellon University colleague Giulia Fanti had been working on building synthetic data to fix the reproducibility crisis, or inability to reproduce data, within academia. While Sekar was seeing the need for a solution in academia, Girish knew his customers at the time were facing the same problem. After talking to a few enterprises, the thesis was further validated.

“At that time, it felt that this was very real and there was an opportunity,” Girish, CEO, told TechCrunch. “So that’s what got us started and over the next couple of months we spoke to some investors, people we knew, and more importantly enterprises and realized this was a significant problem and it is worth putting, you know, an entire life behind it.”

The result was Rockfish, a startup that uses generative AI to create synthetic data for operational workflows to help enterprises break down their data silos. Rockfish integrates with database providers including AWS and Azure, among others, and helps users choose the best configuration for their data based on company policies or uses for the data.

Synthetic data has increasingly become a hot topic in the world of AI, but there was already growing momentum for it when the company got started in June 2022. Girish said that Rockfish wanted to make sure that it was building a product that was differentiated from its peers and also a solution enterprises would be using daily, not just every once in a while.

That’s why the company’s product is designed to ingest data constantly and is focused on operational data, which includes data on things like financial transactions, cybersecurity, and supply chains. These areas are constantly producing data for companies and are also constantly changing. Girish thinks focusing here helps Rockfish stand apart from other competitors.

Now the company works with a handful of enterprise clients, Girish said, including streaming analytics platform Conviva, in addition to government departments including the U.S. Army and the U.S. Department of Defense.

Rockfish is announcing a $4 million seed round led by Emergent Ventures with participation from Foster Ventures, TEN13, and Dallas VC, among others. This brings the company’s total funding up to about $6 million.

Anupam Rastogi, a managing partner at Emergent Ventures, told TechCrunch that he had been tracking Sekar long before the founding of Rockfish. He said that what caused the firm to invest was “team, market, and product, in that order.” Plus, Rockfish’s focus on building for enterprises made it a better fit for Emergent than some of the other players in the space.

“The team is super high-quality data scientists, multiple PhDs,” Rastogi said. “This is a space that we think is very technically sophisticated and having that technical strength around the table is really critical. They have done a lot of the foundational work in the space, not just in the company, but the whole industry.”

While Rockfish hopes its focus helps give it a moat amongst competitors, it doesn’t change the fact that synthetic data will likely be an increasingly crowded market. AI companies are turning toward synthetic data as multiple players think the market has exhausted other AI training data.

There are already numerous startups looking to tackle the market, including Tonic AI, which has raised more than $45 million in venture funding; Mostly AI, which has raised $31 million in VC funding; and Hazy, which raised $14.5 million before being acquired by SAS in 2024, just to name a few.

Girish said the company looks to add on to its approach to synthetic data by incorporating other types of models like state space models, mathematical models that use state variables . The company also looks to improve its end-to-end features.

“It’s not like you take random data for the internet and generate synthetic data,” Girish said. “There is no guarantee that it’ll do well. But if you put all of this together for enterprises, it actually is very relevant and realistic. So that’s the key to this, and then being able to do that on a constant basis is what we find to be useful.”

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

合成数据 生成式AI 数据孤岛 企业数据 Rockfish
相关文章