热点
"视觉文档理解" 相关文章
DOGR: Towards Versatile Visual Document Grounding and Referring
cs.AI updates on arXiv.org 2025-07-22T04:34:00.000000Z
VDInstruct: Zero-Shot Key Information Extraction via Content-Aware Vision Tokenization
cs.AI updates on arXiv.org 2025-07-15T04:26:48.000000Z