红队测试_Fishai

热点

"红队测试" 相关文章

Anthropic deploys AI agents to audit models for safety

AI News 2025-07-25T13:47:52.000000Z

涉嫌欺诈性数据提取，法国警方将对马斯克和X平台展开调查；谷歌用户追踪技术可突破隐私保护工具，用户数据安全性引担忧 | 牛览

安全牛 2025-07-16T00:40:30.000000Z

A Frontier AI Risk Management Framework: Bridging the Gap Between Current AI Practices and Established Risk Management

少点错误 2025-03-13T18:37:21.000000Z

攻破AI最强守卫，赏金2万刀！Anthropic新方法可阻止95% Claude「越狱」行为

新智元 2025-02-20T16:28:23.000000Z

攻破AI最强守卫，赏金2万刀！Anthropic新方法可阻止95% Claude「越狱」行为

智源社区 2025-02-18T05:07:22.000000Z

Scale AI Research Introduces J2 Attackers: Leveraging Human Expertise to Transform Advanced LLMs into Effective Red Teamers

MarkTechPost@AI 2025-02-17T20:19:10.000000Z

红队必看：生成式AI安全的八大实战教训

互联网安全内参 2025-02-13T11:53:16.000000Z

微软：100款 GenAI 产品安全测试背后的 8 条教训与 5 个案例总结！

PaperAgent 2025-01-28T16:13:02.000000Z

Dr. Peter Garraghan, CEO, CTO & Co-Founder at Mindgard – Interview Series

Unite.AI 2024-12-30T17:31:28.000000Z

British university spinoff Mindgard protects companies from AI threats

TechCrunch News 2024-12-20T08:07:24.000000Z

首个被人类骗钱骗感情的 AI 出现了，一段话转走几十万，马斯克点赞

APPSO 2024-12-14T05:14:56.000000Z

Best-of-N Jailbreaking: A Multi-Modal AI Approach to Identifying Vulnerabilities in Large Language Models

MarkTechPost@AI 2024-12-13T12:02:41.000000Z

速递｜AI界的恋爱赏金大赛！让AI机器人爱上你就能赚钱

Z Potentials 2024-12-07T07:46:24.000000Z

How OpenAI stress-tests its large language models

MIT Technology Review » Artificial Intelligence 2024-11-26T06:17:23.000000Z

OpenAI enhances AI safety with new red teaming methods

AI News 2024-11-22T15:47:15.000000Z

Promptfoo: An AI Tool For Testing, Evaluating and Red-Teaming LLM apps

MarkTechPost@AI 2024-11-02T07:20:35.000000Z

14款被严重低估的安全红队测试工具推荐

CISO洞察 2024-10-21T15:23:43.000000Z

日本发布《人工智能红队测试方法指南》1.0

决策研究 2024-10-10T02:24:05.000000Z

用“自动化红队测试”解决AI越狱问题，Haize Labs创业7个月估值一亿美元

36kr 2024-09-11T10:34:05.000000Z

大型语言模型（LLM）的红队测试

qz安全情报分析 2024-09-11T03:38:26.000000Z

Copyright © 2019 FISHAI.All Rights Reserved