Vehicle Pose Estimation Using G-Net: Multi-Class Localization and Depth Estimation

Garc&#237;a L&#243;pez, Javier; Agudo, Antonio; Moreno-Noguer, Francesc

doi:10.3233/978-1-61499-918-8-355

loading subjects...

Vehicle Pose Estimation Using G-Net: Multi-Class Localization and Depth Estimation

Authors

Javier García López, Antonio Agudo, Francesc Moreno-Noguer

Pages

355 - 364

DOI

10.3233/978-1-61499-918-8-355

Series

Frontiers in Artificial Intelligence and Applications

Ebook

Volume 308: Artificial Intelligence Research and Development

Abstract

In this paper we present a new network architecture, called G-Net, for 3D pose estimation on RGB images which is trained in a weakly supervised manner. We introduce a two step pipeline based on region-based Convolutional neural networks (CNNs) for feature localization, bounding box refinement based on non-maximum-suppression and depth estimation. The G-Net is able to estimate the depth from single monocular images with a self-tuned loss function. The combination of this predicted depth and the presented two-step localization allows the extraction of the 3D pose of the object. We show in experiments that our method achieves good results compared to other state-of-the-art approaches which are trained in a fully supervised manner.

Contact

IOS Press Copyright 2024

Contact

IOS Press Copyright 2024

This website uses cookies

This website uses cookies