热点
关于我们
xx
xx
"
幻觉评估
" 相关文章
EH-Benchmark Ophthalmic Hallucination Benchmark and Agent-Driven Top-Down Traceable Reasoning Workflow
cs.AI updates on arXiv.org
2025-08-05T11:10:26.000000Z
MIRAGE-Bench: LLM Agent is Hallucinating and Where to Find Them
cs.AI updates on arXiv.org
2025-07-29T04:21:43.000000Z
Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models
cs.AI updates on arXiv.org
2025-07-03T04:07:17.000000Z