cs.AI updates on arXiv.org 前天 12:13
Self-Supervised Monocular Visual Drone Model Identification through Improved Occlusion Handling
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文提出一种自监督学习方案,仅使用机载单目视频和飞行控制器数据训练基于神经网络的无人机模型。该方案首先训练一个自监督相对姿态估计模型,然后将其作为教师模型来训练无人机模型。为了在高速度和靠近障碍物的情况下工作,提出了一种改进的遮挡处理方法来训练自监督姿态估计模型。该方法可将最终里程计估计的均方根误差平均降低15%。

🚀 提出了一种自监督学习方案,用于训练基于神经网络的无人机模型,该模型仅使用机载单目视频和飞行控制器数据(IMU和电机反馈)。

👁️‍🗨️ 通过首先训练一个自监督相对姿态估计模型,然后将其作为教师模型来训练无人机模型,从而实现自监督学习。

🚧 为了使该方案能够在高速度和靠近障碍物的情况下工作,本文提出了一种改进的遮挡处理方法,用于训练自监督姿态估计模型,该方法可将最终里程计估计的均方根误差平均降低15%。

🏆 将神经无人机模型集成到传统的基于滤波器的VIO系统(ROVIO)中,在靠近障碍物的激进3D赛道轨迹上实现了卓越的里程计精度,证明了神经无人机模型的价值。

arXiv:2504.21695v1 Announce Type: cross Abstract: Ego-motion estimation is vital for drones when flying in GPS-denied environments. Vision-based methods struggle when flight speed increases and close-by objects lead to difficult visual conditions with considerable motion blur and large occlusions. To tackle this, vision is typically complemented by state estimation filters that combine a drone model with inertial measurements. However, these drone models are currently learned in a supervised manner with ground-truth data from external motion capture systems, limiting scalability to different environments and drones. In this work, we propose a self-supervised learning scheme to train a neural-network-based drone model using only onboard monocular video and flight controller data (IMU and motor feedback). We achieve this by first training a self-supervised relative pose estimation model, which then serves as a teacher for the drone model. To allow this to work at high speed close to obstacles, we propose an improved occlusion handling method for training self-supervised pose estimation models. Due to this method, the root mean squared error of resulting odometry estimates is reduced by an average of 15%. Moreover, the student neural drone model can be successfully obtained from the onboard data. It even becomes more accurate at higher speeds compared to its teacher, the self-supervised vision-based model. We demonstrate the value of the neural drone model by integrating it into a traditional filter-based VIO system (ROVIO), resulting in superior odometry accuracy on aggressive 3D racing trajectories near obstacles. Self-supervised learning of ego-motion estimation represents a significant step toward bridging the gap between flying in controlled, expensive lab environments and real-world drone applications. The fusion of vision and drone models will enable higher-speed flight and improve state estimation, on any drone in any environment.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

自监督学习 无人机 运动估计 神经网络 VIO
相关文章