热点
"长视频描述" 相关文章
StoryTeller: Improving Long Video Description through Global Audio-Visual Character Identification
cs.AI updates on arXiv.org 2025-07-31T04:48:24.000000Z