VLM_Fishai

热点

"VLM" 相关文章

A Graph-based Approach for Multi-Modal Question Answering from Flowcharts in Telecom Documents

cs.AI updates on arXiv.org 2025-08-01T04:08:24.000000Z

Visual Language Models as Zero-Shot Deepfake Detectors

cs.AI updates on arXiv.org 2025-07-31T04:48:08.000000Z

你的AI管家可能正在「拆家」？最新研究揭秘家⽤具⾝智能体的安全漏洞

机器之心 2025-07-27T15:00:12.000000Z

可以留意一下10位业内人士如何看VLA

理想 TOP2 2025-07-22T05:06:31.000000Z

[奇思妙想] 基于 AI/VLM 做 OCR 的需求大吗？

V2EX 2025-07-19T11:56:36.000000Z

[奇思妙想] 基于 AI/VLM 做 OCR 的需求大吗？

V2EX 2025-07-19T10:02:48.000000Z

理想汽车智驾方案介绍专题 1 端到端+VLM 方案介绍

掘金人工智能 2025-07-17T15:47:09.000000Z

How to support new VLMs into SGLang: A Case Study with NVILA

Large Model Systems Organization 2025-07-16T16:49:48.000000Z

CogDDN: A Cognitive Demand-Driven Navigation with Decision Optimization and Dual-Process Thinking

cs.AI updates on arXiv.org 2025-07-16T04:28:39.000000Z

单向VLM变双向！人大斯坦福等提出MoCa框架：双向多模态编码器

智源社区 2025-07-11T08:54:02.000000Z

MoCa：首个大规模双向多模态表征模型

PaperAgent 2025-07-08T05:59:26.000000Z

DriveMRP: Enhancing Vision-Language Models with Synthetic Motion Data for Motion Risk Prediction

cs.AI updates on arXiv.org 2025-07-08T05:53:48.000000Z

打破长视频理解瓶颈：HoPE混合位置编码提升VLM长度泛化能力

机器之心 2025-06-29T10:25:47.000000Z

Four Key Features to Successfully Deploying Proprietary and Open-Source Vision Language Models

C3.AI 2025-06-25T07:44:55.000000Z

nanoVLM: 简洁、轻量的纯 PyTorch 视觉-语言模型训练代码库

Hugging Face 2025-06-18T16:30:33.000000Z

nanoVLM: 最简洁、最轻量的纯 PyTorch 视觉-语言模型训练代码库

掘金人工智能 2025-06-18T10:13:43.000000Z

首个多模态专用慢思考框架！超GPT-o1近7个百分点，强化学习教会VLM「三思而后行」

智源社区 2025-06-07T17:02:54.000000Z

让AI学着“看菜下碟”！港中大等新框架让推理长度减少90%，准确率反增17%

智源社区 2025-05-31T05:59:17.000000Z

本地部署大模型

掘金人工智能 2025-04-29T09:27:54.000000Z

英伟达发布 Eagle 2.5 视觉语言 AI 模型：8B 参数媲美 GPT-4o

IT之家 2025-04-23T06:33:51.000000Z

Copyright © 2019 FISHAI.All Rights Reserved