热点
关于我们
xx
xx
"
LVLMs
" 相关文章
MAP: Mitigating Hallucinations in Large Vision-Language Models with Map-Level Attention Processing
cs.AI updates on arXiv.org
2025-08-05T17:08:29.000000Z
Self-Aware Safety Augmentation: Leveraging Internal Semantic Understanding to Enhance Safety in Vision-Language Models
cs.AI updates on arXiv.org
2025-07-30T04:11:58.000000Z
紫东太初开源视觉神经增强方法,即插即用终结多模态幻觉 | ACL 2025
智源社区
2025-06-28T14:02:55.000000Z
入选ICML 2025!哈佛医学院等推出全球首个HIE领域临床思维图谱模型,神经认知结果预测任务上性能提升15%
掘金 人工智能
2025-06-23T05:48:53.000000Z
入选ICML 2025!哈佛医学院等推出全球首个HIE领域临床思维图谱模型,神经认知结果预测任务上性能提升15%
智源社区
2025-06-23T04:17:45.000000Z
让视觉语言模型像o3一样动手搜索、写代码!Visual ARFT实现多模态智能体能力
机器之心
2025-05-27T07:20:30.000000Z
Teaching AI to Give Better Video Critiques
Unite.AI
2025-04-01T14:17:19.000000Z
DeepSeek-R1的风吹到了多模态,Visual-RFT发布,视觉任务性能飙升20%
PaperAgent
2025-03-13T12:01:47.000000Z
细粒度对齐无需仔细标注了!淘天提出视觉锚定奖励,自我校准实现多模态对齐
机器之心
2025-01-19T07:26:47.000000Z
VisOnlyQA: A New Dataset for Evaluating the Visual Perception of LVLMs (Large Vision Language Models)
MarkTechPost@AI
2024-12-10T05:19:54.000000Z
Self-Training on Image Comprehension (STIC): A Novel Self-Training Approach Designed to Enhance the Image Comprehension Capabilities of Large Vision Language Models (LVLMs)
MarkTechPost@AI
2024-10-01T23:51:12.000000Z