热点
关于我们
xx
xx
"
多模态数据集
" 相关文章
Hydra-Bench: A Benchmark for Multi-Modal Leaf Wetness Sensing
cs.AI updates on arXiv.org
2025-07-31T04:48:16.000000Z
GAITEX: Human motion dataset from impaired gait and rehabilitation exercises of inertial and optical sensor data
cs.AI updates on arXiv.org
2025-07-30T04:12:09.000000Z
JWB-DH-V1: Benchmark for Joint Whole-Body Talking Avatar and Speech Generation Version 1
cs.AI updates on arXiv.org
2025-07-29T04:21:30.000000Z
VideoMind: An Omni-Modal Video Dataset with Intent Grounding for Deep-Cognitive Video Understanding
cs.AI updates on arXiv.org
2025-07-25T04:28:29.000000Z
Time-RA: Towards Time Series Reasoning for Anomaly with LLM Feedback
cs.AI updates on arXiv.org
2025-07-22T04:44:52.000000Z
ERR@HRI 2.0 Challenge: Multimodal Detection of Errors and Failures in Human-Robot Conversations
cs.AI updates on arXiv.org
2025-07-21T04:06:43.000000Z
VideoConviction: A Multimodal Benchmark for Human Conviction and Stock Market Recommendations
cs.AI updates on arXiv.org
2025-07-14T04:08:27.000000Z
Grounded Gesture Generation: Language, Motion, and Space
cs.AI updates on arXiv.org
2025-07-08T05:54:03.000000Z
EarthScape: A Multimodal Dataset for Surficial Geologic Mapping and Earth Surface Analysis
A Geodyssey – Enterprise Search Discovery, Text Mining, Machine Learning
2025-04-01T12:02:37.000000Z
This AI Paper by The Data Provenance Initiative Team Highlights Challenges in Multimodal Dataset Provenance, Licensing, Representation, and Transparency for Responsible Development
MarkTechPost@AI
2024-12-25T01:34:56.000000Z
腾讯建全球最大甲骨文单字数据库:一个“牛”有3500种写法
Cnbeta
2024-12-10T07:07:18.000000Z
腾讯建全球最大甲骨文单字数据库:一个“牛”有3500种写法
快科技资讯
2024-12-10T07:01:28.000000Z
ByteDance Researchers Release InfiMM-WebMath-40: An Open Multimodal Dataset Designed for Complex Mathematical Reasoning
MarkTechPost@AI
2024-09-22T03:20:33.000000Z
解开分子结构:用于化学的多模态光谱数据集
智源社区
2024-08-29T03:07:33.000000Z
Nature曝惊人内幕:论文被天价卖出喂AI!出版商狂赚上亿,作者0收入
智源社区
2024-08-19T05:52:36.000000Z
MINT-1T Dataset Released: A Multimodal Dataset with One Trillion Tokens to Build Large Multimodal Models
MarkTechPost@AI
2024-07-26T12:04:20.000000Z
MINT-1T: An Open-Source Trillion Token Multimodal Interleaved Dataset and a Key Component for Training Large Multimodal Models LMMs
MarkTechPost@AI
2024-06-20T07:01:47.000000Z