eCite Digital Repository

A comprehensive review on 3D object detection and 6D pose estimation with deep learning

Citation

Hoque, S and Arafat, M and Xu, S and Maiti, A and Wei, Y, A comprehensive review on 3D object detection and 6D pose estimation with deep learning, IEEE Access, 9 pp. 143746-143770. ISSN 2169-3536 (2021) [Refereed Article]


Preview
PDF
2Mb
  

Copyright Statement

Copyright 2021 The Author(s) Licensed under Creative Commons Attribution 4.0 International (CC BY 4.0) https://creativecommons.org/licenses/by/4.0/

DOI: doi:10.1109/ACCESS.2021.3114399

Abstract

Nowadays, computer vision with 3D (dimension) object detection and 6D (degree of freedom) pose assumptions are widely discussed and studied in the field. In the 3D object detection process, classifications are centered on the object's size, position, and direction. And in 6D pose assumptions, networks emphasize 3D translation and rotation vectors. Successful application of these strategies can have a huge impact on various machine learning-based applications, including the autonomous vehicles, the robotics industry, and the augmented reality sector. Although extensive work has been done on 3D object detection with a pose assumption from RGB images, the challenges have not been fully resolved. Our analysis provides a comprehensive review of the proposed contemporary techniques for complete 3D object detection and the recovery of 6D pose assumptions of an object. In this review research paper, we have discussed several proposed sophisticated methods in 3D object detection and 6D pose estimation, including some popular data sets, evaluation matrix, and proposed method challenges. Most importantly, this study makes an effort to offer some possible future directions in 3D object detection and 6D pose estimation. We accept the autonomous vehicle as the sample case for this detailed review. Finally, this review provides a complete overview of the latest in-depth learning-based research studies related to 3D object detection and 6D pose estimation systems and points out a comparison between some popular frameworks. To be more concise, we propose a detailed summary of the state-of-the-art techniques of modern deep learning-based object detection and pose estimation models.

Item Details

Item Type:Refereed Article
Keywords:machine learning, deep neural network, computer vision, image processing, convolutional neural network, 3D object detection, 6D pose estimation
Research Division:Information and Computing Sciences
Research Group:Computer vision and multimedia computation
Research Field:Computer vision
Objective Division:Information and Communication Services
Objective Group:Information systems, technologies and services
Objective Field:Artificial intelligence
UTAS Author:Hoque, S (Mrs Sabera Hoque)
UTAS Author:Xu, S (Dr Shuxiang Xu)
UTAS Author:Maiti, A (Dr Ananda Maiti)
UTAS Author:Wei, Y (Dr Yuchen Wei)
ID Code:148064
Year Published:2021
Web of Science® Times Cited:1
Deposited By:Information and Communication Technology
Deposited On:2021-12-01
Last Modified:2022-04-22
Downloads:0

Repository Staff Only: item control page