Keep it Unreal: Bridging the Realism Gap for 2.5 D Recognition with Geometry Priors Only

Zakharov, S.; Planche, B.; Wu, Z.; Hutter, A.; Kosch, H.; Ilic, S.

Informatik 16 - Lehrstuhl für Anwendungen in der Medizin (Prof. Navab)

Back
Back to start of result list
Permanent link for displayed object

Title:: Keep it Unreal: Bridging the Realism Gap for 2.5 D Recognition with Geometry Priors Only
Document type:: Zeitschriftenaufsatz
Author(s):: Zakharov, S.; Planche, B.; Wu, Z.; Hutter, A.; Kosch, H.; Ilic, S.
Abstract:: With the increasing availability of large databases of 3D CAD models, depth-based recognition methods can be trained on an uncountable number of synthetically rendered images. However, discrepancies with the real data acquired from various depth sensors still noticeably impede progress. Previous works adopted unsupervised approaches to generate more realistic depth data, but they all require real scans for training, even if unlabeled. This still represents a strong requirement, especially when considering real-life/industrial settings where real training images are hard or impossible to acquire, but texture-less 3D models are available. We thus propose a novel approach leveraging only CAD models to bridge the realism gap. Purely trained on synthetic data, playing against an extensive augmentation pipeline in an unsupervised manner, our generative adversarial network learns to effectively segment depth images and recover the clean synthetic-looking depth information even from partial occlusions. As our solution is not only fully decoupled from the real domains but also from the task-specific analytics, the pre-processed scans can be handed to any kind and number of recognition methods also trained on synthetic data. Through various experiments, we demonstrate how this simplifies their training and consistently enhances their performance, with results on par with the same methods trained on real data, and better than usual approaches doing the reverse mapping. «
With the increasing availability of large databases of 3D CAD models, depth-based recognition methods can be trained on an uncountable number of synthetically rendered images. However, discrepancies with the real data acquired from various depth sensors still noticeably impede progress. Previous works adopted unsupervised approaches to generate more realistic depth data, but they all require real scans for training, even if unlabeled. This still represents a strong requirement, especially when c... »
Keywords:: CAMP,CAMPComputerVision,ComputerVision,3DV,DomainAdaptation,3DPoseEstimation
Journal title:: arXiv preprint arXiv:1804.09113
Year:: 2018
BibTeX

Occurrences:

mediaTUM Gesamtbestand Hochschulbibliographie 2018 Fakultäten Informatik Informatik 16 - Lehrstuhl für Anwendungen in der Medizin (Prof. Navab)

mediaTUM Gesamtbestand Einrichtungen Schools TUM School of Computation, Information and Technology Departments Computer Science Informatik 16 - Lehrstuhl für Anwendungen in der Medizin (Prof. Navab)Import