热点
关于我们
xx
xx
"
文档理解
" 相关文章
35%准确率蒸发!字节&华科WildDoc揭示多模态文档理解鲁棒性短板
PaperWeekly
2025-06-08T06:37:42.000000Z
英伟达发布 Llama Nemotron Nano VL AI:高效精准,攻克复杂文档解析难题
IT之家
2025-06-05T00:13:15.000000Z
NVIDIA AI Releases Llama Nemotron Nano VL: A Compact Vision-Language Model Optimized for Document Understanding
MarkTechPost@AI
2025-06-04T06:55:52.000000Z
多模态文档理解新挑战!字节跳动、华中科技大学联合发布WildDoc基准,揭示真实场景下MLLMs的文档理解鲁棒性短板
我爱计算机视觉
2025-05-26T13:07:16.000000Z
Reducto AI Released RolmOCR: A SoTA OCR Model Built on Qwen 2.5 VL, Fully Open-Source and Apache 2.0 Licensed for Advanced Document Understanding
MarkTechPost@AI
2025-04-06T05:10:28.000000Z
千页只需7块钱,Mistral发布世界最强文件扫描API,实测仍有缺陷
机器之心
2025-03-07T07:39:28.000000Z
IBM AI Releases Granite-Vision-3.1-2B: A Small Vision Language Model with Super Impressive Performance on Various Tasks
MarkTechPost@AI
2025-02-08T05:05:06.000000Z
Anthropic Introduces Claude 3.5 Sonnet: The AI That Understands Text, Images, and More in PDFs
MarkTechPost@AI
2024-11-06T06:19:56.000000Z
Optimizing Document Understanding with DocOwl2: A Novel High-Resolution Compression Architecture
MarkTechPost@AI
2024-09-11T11:35:50.000000Z
2B多模态新SOTA!华科、华南理工发布Mini-Monkey,专治「切分增大分辨率」后遗症
智源社区
2024-08-13T17:07:30.000000Z
GPT-4o弱点暴露了,PDF长文档阅读理解仅45分
智源社区
2024-08-04T16:36:58.000000Z
Streamline insurance underwriting with generative AI using Amazon Bedrock – Part 1
AWS Machine Learning Blog
2024-08-01T16:31:57.000000Z
This AI Paper from Snowflake Evaluates GPT-4 Models Integrated with OCR and Vision for Enhanced Text and Image Analysis: Advancing Document Understanding
MarkTechPost@AI
2024-06-12T15:31:34.000000Z