TechCrunch News 2024年11月18日
SuperAnnotate wants to help companies manage their AI data sets
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

SuperAnnotate 是一款由兄弟二人创立的AI训练数据管理平台,旨在解决AI模型训练中数据准备和管理的难题。它提供了一个易于使用的平台,帮助用户创建、管理和评估大型AI训练数据集,涵盖数据标注、模型微调、迭代和评估等功能。SuperAnnotate 拥有众多客户,包括Databricks和Canva,并通过市场连接众包工作者进行数据标注。尽管在数据标注人员待遇方面存在一些争议,但SuperAnnotate 近期获得了 3600 万美元的 B 轮融资,将用于团队扩充、产品研发和客户拓展,力求成为适应企业不断变化需求的AI数据管理平台。

😊 **SuperAnnotate 旨在解决AI模型训练中的数据管理难题**: 兄弟二人创立SuperAnnotate,源于在大学期间训练算法时面临大量数据管理的挑战,意识到市场对于类似工具的需求。

🚀 **提供全面的AI训练数据管理功能**: SuperAnnotate 提供数据标注、模型微调、迭代和评估等工具,帮助用户创建、管理和评估大型AI训练数据集,并支持连接本地和云端数据源。

📊 **提供模型性能比较和部署功能**: 用户可以通过SuperAnnotate的平台比较不同训练数据的模型性能,并在模型准备就绪后将其部署到各种环境中。

🤝 **连接众包工作者进行数据标注**: SuperAnnotate提供了一个数据标注人员市场,帮助企业获取高质量的标注数据,但其标注人员待遇问题也引发了一些争议。

💰 **获得3600万美元B轮融资**: SuperAnnotate近期获得了3600万美元的B轮融资,将用于团队扩充、产品研发和客户拓展,以满足更多企业的需求。

High-quality data may be the key to high-quality AI. With studies finding that data set curation, rather than size, is what really affects an AI model’s performance, it’s unsurprising that there’s a growing emphasis on data set management practices. According to some surveys, AI researchers today spend much of their time on data prep and organization tasks.

Brothers Vahan Petrosyan and Tigran Petrosyan felt the pain of having to manage lots of data while training algorithms in college. Vahan went so far as to create a data management tool during his Ph.D. research on image segmentation.

A few years later, Vahan realized that developers — and even corporations — would happily pay for similar tooling. So the brothers founded a company, SuperAnnotate, to build it.

“During the explosion of innovation in 2023 surrounding models and multimodal AI, the need for high-quality datasets became more stringent, with each organization having multiple use cases requiring specialized data,” Vahan said in a statement. “We saw an opportunity to build an easy-to-use, low-code platform, like a Swiss Army Knife for modern AI training data.”

SuperAnnotate, whose clients include Databricks and Canva, helps users create and keep track of large AI training data sets. The startup initially focused on labeling software, but now provides tools for fine-tuning, iterating and evaluating data sets.

Image Credits:SuperAnnotate

With SuperAnnotate’s platform, users can connect data from local sources and the cloud to create data projects on which they can collaborate with teammates. From a dashboard, users can compare the performance of models by the data that was used to train them, and then deploy those models to various environments once they’re ready.

SuperAnnotate also provides companies access to a marketplace of crowd-sourced workers for data annotation tasks. Annotations are usually pieces of text labeling the meaning or parts of data that models train on, and serve as guideposts for models, “teaching” them to distinguish things, places and ideas.

To be frank, there are several Reddit threads about SuperAnnotate’s treatment of the data annotators it uses, and they aren’t flattering. Annotators complain about communication issues, unclear expectations, and low pay.

For its part, SuperAnnotate claims it pays fair market rates and that its demands on annotators aren’t outside the norm for the industry. We’ve asked the company to provide more detailed information about its practices and will update this piece if we hear back.

There are several competitors in the AI data management space, including startups like Scale AI, Weka and Dataloop. San Francisco-based SuperAnnotate has managed to hold its own, however, recently raising $36 million in a Series B round led by Socium Ventures, with participation from Nvidia, Databricks Ventures, Play Time Ventures and Defy.vc.

The fresh capital, which brings SuperAnnotate’s total raised to just over $53 million, will be used for augmenting its current team of around 100, for product R&D, and for growing SuperAnnotate’s customer base of roughly 100 companies.

“We aim to build a platform capable of fully adapting to enterprises’ evolving needs and offering extensive customization in data fine-tuning,” Vahan said.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AI训练数据 SuperAnnotate 数据管理 模型训练 数据标注
相关文章