MarkTechPost@AI 2024年09月16日
Integrating Neural Systems for Visual Perception: The Role of Ventral Temporal Cortex VTC and Medial Temporal Cortex MTC in Rapid and Complex Object Recognition
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

文章探讨人类和灵长类动物在多时间尺度下的视觉感知,包括VTC和MTC在其中的作用,以及通过多种实验进行的研究和分析。

🧠人类和灵长类动物的视觉感知在多时间尺度下进行,VTC可在200毫秒内识别一些视觉属性,腹侧颞叶皮质(VTC)对快速视觉处理起作用,但对于整合视觉序列的了解较少。

🔍斯坦福研究者通过比较人类视觉表现和猕猴VTC记录,发现MTC在对象感知中起关键作用,MTC受损的人类表现类似VTC模型,眼动追踪实验揭示人类利用顺序注视模式进行复杂视觉推断。

📊研究使用多种实验设置和数据集,包括不同方向和设置的对象图像,通过交叉验证策略训练线性分类器检测异常对象,并使用CNN模型评估VTC模型性能,对比模型视觉处理与人的相似推断。

🕒研究比较了人类在时间受限和不受限的两种视觉模式下的表现,时间受限任务中参与者依赖即时视觉处理,而时间不受限时人类表现超过VTC支持的性能,表明人类在长时间观察时有超越VTC的能力。

Human and primate perception occurs across multiple timescales, with some visual attributes identified in under 200ms, supported by the ventral temporal cortex (VTC). However, more complex visual inferences, such as recognizing novel objects, require additional time and multiple glances. The high-acuity fovea and frequent gaze shifts help compose object representations. While much is understood about rapid visual processing, less about integrating visual sequences is known. The medial temporal cortex (MTC), particularly the perirhinal cortex (PRC), may aid in this process, enabling visual inferences beyond VTC capabilities by integrating sequential visual inputs.

Stanford researchers evaluated the MTC’s role in object perception by comparing human visual performance to macaque VTC recordings. While humans and VTC perform similarly with brief viewing times (<200ms), human performance significantly surpasses VTC with extended viewing. MTC plays a key role in this improvement, as MTC-lesioned humans perform like VTC models. Eye-tracking experiments revealed that humans use sequential gaze patterns for complex visual inferences. These findings suggest that MTC integrates visuospatial sequences into compositional representations, enhancing object perception beyond VTC capabilities.

Researchers used a dataset of various object images presented in different orientations and settings to estimate performance based on VTC responses and compare it with human visual processing. They implemented a cross-validation strategy where trials featured two typical objects and one outlier in randomized configurations. Neural responses from the brain’s high-level visual areas were then used to train a linear classifier to detect the odd object. This process was repeated multiple times, with results averaged to determine a performance score for distinguishing each pair of objects.

For comparison, a CNN model, pre-trained for object classification, was used to evaluate VTC model performance. The images were preprocessed for the CNN, and a similar experimental setup was followed, where a classifier was trained to detect odd objects in various trials. The model’s accuracy was tested and compared to neural response-based predictions, offering insights into how closely the model’s visual processing mirrored human-like inference.

The study compares human performance in two visual regimes: time-restricted (less than 200ms) and time-unrestricted (self-paced). In time-restricted tasks, participants rely on immediate visual processing since there’s no opportunity for sequential sampling through eye movements. A 3-way visual discrimination task and a match-to-sample paradigm were used to assess this. Results showed a strong correlation between time-restricted human performance and the performance predicted by the high-level VTC of macaques. However, with unlimited viewing time, human participants significantly outperformed VTC-supported performance and computational models based on VTC. This demonstrates that humans exceed VTC capabilities when given extended viewing times, suggesting reliance on different neural mechanisms.

The study reveals complementary neural systems in visual object perception, where the VTC enables rapid visual inferences within 100ms, while the MTC supports more complex inferences through sequential saccades. Time-restricted tasks align with VTC performance, but with more time, humans surpass VTC capabilities, reflecting MTC’s integration of visuospatial sequences. The findings emphasize MTC’s role in compositional operations, extending beyond memory to perception. Models of human vision, like convolutional neural networks, approximate VTC but fail to capture MTC’s contributions, suggesting the need for biologically plausible models that integrate both systems.


Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter..

Don’t Forget to join our 50k+ ML SubReddit

FREE AI WEBINAR: ‘SAM 2 for Video: How to Fine-tune On Your Data’ (Wed, Sep 25, 4:00 AM – 4:45 AM EST)

The post Integrating Neural Systems for Visual Perception: The Role of Ventral Temporal Cortex VTC and Medial Temporal Cortex MTC in Rapid and Complex Object Recognition appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

视觉感知 VTC MTC 对象识别
相关文章