SECOND: Sparsely Embedded Convolutional Detection

doi:10.3390/S18103337

Open AccessJournal ArticleDOI

SECOND: Sparsely Embedded Convolutional Detection

Yan Yan, +2 more

- 06 Oct 2018 -

Sensors

- Vol. 18, Iss: 10, pp 3337

TLDR

An improved sparse convolution method for Voxel-based 3D convolutional networks is investigated, which significantly increases the speed of both training and inference and introduces a new form of angle loss regression to improve the orientation estimation performance.

Abstract:

LiDAR-based or RGB-D-based object detection is used in numerous applications, ranging from autonomous driving to robot vision. Voxel-based 3D convolutional networks have been used for some time to enhance the retention of information when processing point cloud LiDAR data. However, problems remain, including a slow inference speed and low orientation estimation performance. We therefore investigate an improved sparse convolution method for such networks, which significantly increases the speed of both training and inference. We also introduce a new form of angle loss regression to improve the orientation estimation performance and a new data augmentation approach that can enhance the convergence speed and performance. The proposed network produces state-of-the-art results on the KITTI 3D object detection benchmarks while maintaining a fast inference speed.

Citations

PDF

Open Access

More filters

Обнаружение транспортных средств на изображениях загородных шоссе на основе метода Single shot multibox Detector

Р Ю Чуйков, +1 more

Posted Content

PointPillars: Fast Encoders for Object Detection from Point Clouds

Alex H. Lang, +5 more

- 14 Dec 2018 -

arXiv: Learning

TL;DR: PointPillars as mentioned in this paper utilizes PointNets to learn a representation of point clouds organized in vertical columns (pillars), which can be used with any standard 2D convolutional detection architecture.

...read moreread less

Proceedings ArticleDOI

PointPillars: Fast Encoders for Object Detection From Point Clouds

Alex H. Lang, +5 more

TL;DR: benchmarks suggest that PointPillars is an appropriate encoding for object detection in point clouds, and proposes a lean downstream network.

...read moreread less

Posted Content

PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud

Shaoshuai Shi, +2 more

- 11 Dec 2018 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Extensive experiments on the 3D detection benchmark of KITTI dataset show that the proposed architecture outperforms state-of-the-art methods with remarkable margins by using only point cloud as input.

...read moreread less

Journal ArticleDOI

Deep Learning for 3D Point Clouds: A Survey

Yulan Guo, +5 more

- 01 Dec 2021 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This paper presents a comprehensive review of recent progress in deep learning methods for point clouds, covering three major tasks, including 3D shape classification, 3D object detection and tracking, and 3D point cloud segmentation.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

You Only Look Once: Unified, Real-Time Object Detection

Joseph Redmon, +3 more

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

Posted Content

Fast R-CNN

Ross Girshick

- 30 Apr 2015 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper proposes a Fast Region-based Convolutional Network method (Fast R-CNN) for object detection that builds on previous work to efficiently classify object proposals using deep convolutional networks.

...read moreread less

Posted Content

Rich feature hierarchies for accurate object detection and semantic segmentation

Ross Girshick, +3 more

- 11 Nov 2013 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper proposes a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012 -- achieving a mAP of 53.3%.

...read moreread less

Book ChapterDOI

SSD: Single Shot MultiBox Detector

Wei Liu, +6 more

- 08 Dec 2015 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: SSD as mentioned in this paper discretizes the output space of bounding boxes into a set of default boxes over different aspect ratios and scales per feature map location, and combines predictions from multiple feature maps with different resolutions to naturally handle objects of various sizes.

...read moreread less

Proceedings Article

Mask R-CNN

Kaiming He, +3 more

TL;DR: This work presents a conceptually simple, flexible, and general framework for object instance segmentation that outperforms all existing, single-model entries on every task, including the COCO 2016 challenge winners.

...read moreread less

Collapse

SECOND: Sparsely Embedded Convolutional Detection

Citations

Обнаружение транспортных средств на изображениях загородных шоссе на основе метода Single shot multibox Detector

PointPillars: Fast Encoders for Object Detection from Point Clouds

PointPillars: Fast Encoders for Object Detection From Point Clouds

PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud

Deep Learning for 3D Point Clouds: A Survey

References

You Only Look Once: Unified, Real-Time Object Detection

Fast R-CNN

Rich feature hierarchies for accurate object detection and semantic segmentation

SSD: Single Shot MultiBox Detector

Mask R-CNN

Related Papers (5)

VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection

Multi-view 3D Object Detection Network for Autonomous Driving

Are we ready for autonomous driving? The KITTI vision benchmark suite

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space