热点
关于我们
xx
xx
"
大型多模态模型
" 相关文章
MMSearch-R1: End-to-End Reinforcement Learning for Active Image Search in LMMs
MarkTechPost@AI
2025-04-07T04:08:41.000000Z
Salesforce AI Research Introduce xGen-MM (BLIP-3): A Scalable AI Framework for Advancing Large Multimodal Models with Enhanced Training and Performance Capabilities
MarkTechPost@AI
2024-08-19T22:04:54.000000Z
MINT-1T Dataset Released: A Multimodal Dataset with One Trillion Tokens to Build Large Multimodal Models
MarkTechPost@AI
2024-07-26T12:04:20.000000Z
Visual Haystacks Benchmark: The First “Visual-Centric” Needle-In-A-Haystack (NIAH) Benchmark to Assess LMMs’ Capability in Long-Context Visual Retrieval and Reasoning
MarkTechPost@AI
2024-07-24T07:19:20.000000Z
LLaVA-NeXT-Interleave: A Versatile Large Multimodal Model LMM that can Handle Settings like Multi-image, Multi-frame, and Multi-view
MarkTechPost@AI
2024-07-13T16:46:13.000000Z
LongVA and the Impact of Long Context Transfer in Visual Processing: Enhancing Large Multimodal Models for Long Video Sequences
MarkTechPost@AI
2024-06-29T07:01:45.000000Z