热点
"视频文本问答" 相关文章
Gather and Trace: Rethinking Video TextVQA from an Instance-oriented Perspective
cs.AI updates on arXiv.org 2025-08-07T04:49:20.000000Z