AI News 2024年10月16日
Scoring AI models: Endor Labs unveils evaluation tool
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Endor Labs开始基于安全、流行度、质量和活跃度为AI模型打分,其独特能力旨在简化识别最安全开源AI模型的过程。该工具针对当前在Hugging Face上的模型,有助于改善AI治理,开发者使用时需注意存在的风险,该工具会进行多方面评估并提供清晰分数。

🌐Endor Labs推出为AI模型打分的工具,旨在简化识别安全开源AI模型的过程,通过提供直观分数,帮助开发者在Hugging Face上选择合适模型。

⚠️使用AI模型存在风险,当前情况类似“狂野西部”,人们抓取模型时可能忽略潜在漏洞,该工具关注安全漏洞、法律许可及运营风险等关键风险领域。

📊Endor Labs的评估工具对Hugging Face上的AI模型进行50项现成检查,根据多种因素生成“Endor分数”,同时考虑正负因素,其具有用户友好的特点。

Endor Labs has begun scoring AI models based on their security, popularity, quality, and activity.

Dubbed ‘Endor Scores for AI Models,’ this unique capability aims to simplify the process of identifying the most secure open-source AI models currently available on Hugging Face – a platform for sharing Large Language Models (LLMs), machine learning models, and other open-source AI models and datasets – by providing straightforward scores.

The announcement comes as developers increasingly turn to platforms like Hugging Face for ready-made AI models, mirroring the early days of readily-available open-source software (OSS). This new release improves AI governance by enabling developers to “start clean” with AI models, a goal that has so far proved elusive.

Varun Badhwar, Co-Founder and CEO of Endor Labs, said: “It’s always been our mission to secure everything your code depends on, and AI models are the next great frontier in that critical task.

“Every organisation is experimenting with AI models, whether to power particular applications or build entire AI-based businesses. Security has to keep pace, and there’s a rare opportunity here to start clean and avoid risks and high maintenance costs down the road.”

George Apostolopoulos, Founding Engineer at Endor Labs, added: “Everybody is experimenting with AI models right now. Some teams are building brand new AI-based businesses while others are looking for ways to slap a ‘powered by AI’ sticker on their product. One thing is for sure, your developers are playing with AI models.”

However, this convenience does not come without risks. Apostolopoulos warns that the current landscape resembles “the wild west,” with people grabbing models that fit their needs without considering potential vulnerabilities.

Endor Labs’ approach treats AI models as dependencies within the software supply chain

“Our mission at Endor Labs is to ‘secure everything your code depends on,'” Apostolopoulos states. This perspective allows organisations to apply similar risk evaluation methodologies to AI models as they do to other open-source components.

Endor’s tool for scoring AI models focuses on several key risk areas:

To combat these issues, Endor Labs’ evaluation tool applies 50 out-of-the-box checks to AI models on Hugging Face. The system generates an “Endor Score” based on factors such as the number of maintainers, corporate sponsorship, release frequency, and known vulnerabilities.

Positive factors in the system for scoring AI models include the use of safe weight formats, the presence of licensing information, and high download and engagement metrics. Negative factors encompass incomplete documentation, lack of performance data, and the use of unsafe weight formats.

A key feature of Endor Scores is its user-friendly approach. Developers don’t need to know specific model names; they can start their search with general questions like “What models can I use to classify sentiments?” or “What are the most popular models from Meta?” The tool then provides clear scores ranking both positive and negative aspects of each model, allowing developers to select the most appropriate options for their needs.

“Your teams are being asked about AI every single day, and they’ll look for the models they can use to accelerate innovation,” Apostolopoulos notes. “Evaluating Open Source AI models with Endor Labs helps you make sure the models you’re using do what you expect them to do, and are safe to use.”

(Photo by Element5 Digital)

See also: China Telecom trains AI model with 1 trillion parameters on domestic chips

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Explore other upcoming enterprise technology events and webinars powered by TechForge here.

The post Scoring AI models: Endor Labs unveils evaluation tool appeared first on AI News.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Endor Labs AI模型评估 安全风险 Hugging Face
相关文章