热点
"LLM鲁棒性" 相关文章
Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs
cs.AI updates on arXiv.org 2025-07-30T04:46:06.000000Z
Microsoft Researchers Propose MedFuzz: A New AI Method for Evaluating the Robustness of Medical Question-Answering LLMs to Adversarial Perturbations
MarkTechPost@AI 2024-09-14T05:05:32.000000Z