Data-driven 6D Pose Tracking by Calibrating Image Residuals in Synthetic Domains release_6pxpnun7ffe77iyn47zzhchbhy

by Bowen Wen, Chaitanya Mitash, Kostas Bekris

Released as a article .

2022  

Abstract

Tracking the 6D pose of objects in video sequences is important for robot manipulation. This work presents se(3)-TrackNet, a data-driven optimization approach for long term, 6D pose tracking. It aims to identify the optimal relative pose given the current RGB-D observation and a synthetic image conditioned on the previous best estimate and the object's model. The key contribution in this context is a novel neural network architecture, which appropriately disentangles the feature encoding to help reduce domain shift, and an effective 3D orientation representation via Lie Algebra. Consequently, even when the network is trained solely with synthetic data can work effectively over real images. Comprehensive experiments over multiple benchmarks show se(3)-TrackNet achieves consistently robust estimates and outperforms alternatives, even though they have been trained with real images. The approach runs in real time at 90.9Hz. Code, data and supplementary video for this project are available at https://github.com/wenbowen123/iros20-6d-pose-tracking
In text/plain format

Archived Files and Locations

application/pdf  2.1 MB
file_ypfqslqlezb5vm3s6icvlpqpjy
arxiv.org (repository)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article
Stage   submitted
Date   2022-02-09
Version   v2
Language   en ?
arXiv  2105.14391v2
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: 0ac7cdcf-65d4-4b0d-89e2-3fe84fa052a0
API URL: JSON