LVLMs_Fishai

热点

"LVLMs" 相关文章

MAP: Mitigating Hallucinations in Large Vision-Language Models with Map-Level Attention Processing

cs.AI updates on arXiv.org 2025-08-05T17:08:29.000000Z

Self-Aware Safety Augmentation: Leveraging Internal Semantic Understanding to Enhance Safety in Vision-Language Models

cs.AI updates on arXiv.org 2025-07-30T04:11:58.000000Z

紫东太初开源视觉神经增强方法，即插即用终结多模态幻觉 | ACL 2025

智源社区 2025-06-28T14:02:55.000000Z

入选ICML 2025！哈佛医学院等推出全球首个HIE领域临床思维图谱模型，神经认知结果预测任务上性能提升15%

掘金人工智能 2025-06-23T05:48:53.000000Z

入选ICML 2025！哈佛医学院等推出全球首个HIE领域临床思维图谱模型，神经认知结果预测任务上性能提升15%

智源社区 2025-06-23T04:17:45.000000Z

让视觉语言模型像o3一样动手搜索、写代码！Visual ARFT实现多模态智能体能力

机器之心 2025-05-27T07:20:30.000000Z

Teaching AI to Give Better Video Critiques

Unite.AI 2025-04-01T14:17:19.000000Z

DeepSeek-R1的风吹到了多模态，Visual-RFT发布，视觉任务性能飙升20%

PaperAgent 2025-03-13T12:01:47.000000Z

细粒度对齐无需仔细标注了！淘天提出视觉锚定奖励，自我校准实现多模态对齐

机器之心 2025-01-19T07:26:47.000000Z

VisOnlyQA: A New Dataset for Evaluating the Visual Perception of LVLMs (Large Vision Language Models)

MarkTechPost@AI 2024-12-10T05:19:54.000000Z

Self-Training on Image Comprehension (STIC): A Novel Self-Training Approach Designed to Enhance the Image Comprehension Capabilities of Large Vision Language Models (LVLMs)

MarkTechPost@AI 2024-10-01T23:51:12.000000Z

Copyright © 2019 FISHAI.All Rights Reserved