热点
"VQA" 相关文章
VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning
cs.AI updates on arXiv.org 2025-07-18T04:13:47.000000Z
Prompt4Trust: A Reinforcement Learning Prompt Augmentation Framework for Clinically-Aligned Confidence Calibration in Multimodal Large Language Models
cs.AI updates on arXiv.org 2025-07-15T04:24:38.000000Z
多模态扩展:DeepSeek视觉模块接入方案
掘金 人工智能 2025-06-30T09:58:14.000000Z
CLIP被淘汰了?LeCun谢赛宁新作,多模态训练无需语言监督更强!
新智元 2025-04-09T11:22:28.000000Z
CLIP被淘汰了?LeCun谢赛宁新作,多模态训练无需语言监督更强!
智源社区 2025-04-08T08:02:48.000000Z
CLIP被淘汰了?LeCun谢赛宁新作,多模态训练无需语言监督更强
36kr-科技 2025-04-07T09:42:13.000000Z