热点
关于我们
xx
xx
"
视觉文档理解
" 相关文章
DOGR: Towards Versatile Visual Document Grounding and Referring
cs.AI updates on arXiv.org
2025-07-22T04:34:00.000000Z
VDInstruct: Zero-Shot Key Information Extraction via Content-Aware Vision Tokenization
cs.AI updates on arXiv.org
2025-07-15T04:26:48.000000Z