热点
"LVLMs" 相关文章
让视觉语言模型像o3一样动手搜索、写代码!Visual ARFT实现多模态智能体能力
机器之心 2025-05-27T07:20:30.000000Z
Teaching AI to Give Better Video Critiques
Unite.AI 2025-04-01T14:17:19.000000Z
DeepSeek-R1的风吹到了多模态,Visual-RFT发布,视觉任务性能飙升20%
PaperAgent 2025-03-13T12:01:47.000000Z
细粒度对齐无需仔细标注了!淘天提出视觉锚定奖励,自我校准实现多模态对齐
机器之心 2025-01-19T07:26:47.000000Z
VisOnlyQA: A New Dataset for Evaluating the Visual Perception of LVLMs (Large Vision Language Models)
MarkTechPost@AI 2024-12-10T05:19:54.000000Z
Self-Training on Image Comprehension (STIC): A Novel Self-Training Approach Designed to Enhance the Image Comprehension Capabilities of Large Vision Language Models (LVLMs)
MarkTechPost@AI 2024-10-01T23:51:12.000000Z