Object Recognition in Aerial Images Using Convolutional Neural Networks

doi:10.3390/JIMAGING3020021

Open AccessJournal ArticleDOI

Object Recognition in Aerial Images Using Convolutional Neural Networks

Matija Radovic, +2 more

- 14 Jun 2017 -

Journal of Imaging

- Vol. 3, Iss: 2, pp 21

TLDR

Using a convolutional neural network implemented in the “YOLO” (“You Only Look Once”) platform, objects can be tracked, detected, and classified from video feeds supplied by UAVs in real-time.

Abstract:

There are numerous applications of unmanned aerial vehicles (UAVs) in the management of civil infrastructure assets. A few examples include routine bridge inspections, disaster management, power line surveillance and traffic surveying. As UAV applications become widespread, increased levels of autonomy and independent decision-making are necessary to improve the safety, efficiency, and accuracy of the devices. This paper details the procedure and parameters used for the training of convolutional neural networks (CNNs) on a set of aerial images for efficient and automated object recognition. Potential application areas in the transportation field are also highlighted. The accuracy and reliability of CNNs depend on the network’s training and the selection of operational parameters. This paper details the CNN training procedure and parameter selection. The object recognition results show that by selecting a proper set of parameters, a CNN can detect and classify objects with a high level of accuracy (97.5%) and computational efficiency. Furthermore, using a convolutional neural network implemented in the “YOLO” (“You Only Look Once”) platform, objects can be tracked, detected (“seen”), and classified (“comprehended”) from video feeds supplied by UAVs in real-time.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Evaluation of Different Machine Learning Methods and Deep-Learning Convolutional Neural Networks for Landslide Detection

Omid Ghorbanzadeh, +5 more

- 20 Jan 2019 -

Remote Sensing

TL;DR: The CNN method is still in its infancy as most researchers will either use predefined parameters in solutions like Google TensorFlow or will apply different settings in a trial-and-error manner, Nevertheless, deep-learning can improve landslide mapping in the future if the effects of the different designs are better understood, enough training samples exist, and the results of augmentation strategies to artificially increase the number of existing samples are better understanding.

...read moreread less

Journal ArticleDOI

Drone-surveillance for search and rescue in natural disaster

Balmukund Mishra, +3 more

- 15 Apr 2020 -

Computer Communications

TL;DR: An image dataset for human action detection for SAR inspired by the pyramidal feature extraction of SSD for human detection and action recognition is proposed and proposed model achieves 7% higher mAP value when applied to standard Okutama dataset in comparison with the state-of-the-art detection models in literature.

...read moreread less

Journal ArticleDOI

Spatial prediction of groundwater potential mapping based on convolutional neural network (CNN) and support vector regression (SVR)

Mahdi Panahi, +4 more

- 01 Sep 2020 -

Journal of Hydrology

TL;DR: In this article, a machine learning algorithm (MLA) and a deep learning algorithm(DLA) were used to develop groundwater potential maps using support vector regression (SVR) and convolution neural network (CNN) functions, respectively.

...read moreread less

Journal ArticleDOI

Unsupervised Human Detection with an Embedded Vision System on a Fully Autonomous UAV for Search and Rescue Operations

Eleftherios Lygouras, +5 more

- 14 Aug 2019 -

Sensors

TL;DR: Using deep learning techniques, the implemented embedded system was capable of detecting open water swimmers and allowed the UAV to provide assistance accurately in a fully unsupervised manner, thus enhancing first responder operational capabilities.

...read moreread less

Journal ArticleDOI

Convolutional neural networks for object detection in aerial imagery for disaster response and recovery

Yalong Pi, +2 more

- 01 Jan 2020 -

Advanced Engineering Informatics

TL;DR: This research introduces and evaluates a series of convolutional neural network (CNN) models for ground object detection from aerial views of disaster’s aftermath that are capable of recognizing critical ground assets including building roofs (both damaged and undamaged), vehicles, vegetation, debris, and flooded areas.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings ArticleDOI

You Only Look Once: Unified, Real-Time Object Detection

Joseph Redmon, +3 more

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

Journal ArticleDOI

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

Kaiming He, +3 more

- 01 Sep 2015 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work equips the networks with another pooling strategy, "spatial pyramid pooling", to eliminate the above requirement, and develops a new network structure, called SPP-net, which can generate a fixed-length representation regardless of image size/scale.

...read moreread less

Book ChapterDOI

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

Kaiming He, +3 more

TL;DR: This work equips the networks with another pooling strategy, “spatial pyramid pooling”, to eliminate the above requirement, and develops a new network structure, called SPP-net, which can generate a fixed-length representation regardless of image size/scale.

...read moreread less

Posted Content

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition

Jeff Donahue, +6 more

- 06 Oct 2013 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: DeCAF, an open-source implementation of deep convolutional activation features, along with all associated network parameters, are released to enable vision researchers to be able to conduct experimentation with deep representations across a range of visual concept learning paradigms.

...read moreread less

IEEE Transactions on Pattern Analysis an...

Object Recognition in Aerial Images Using Convolutional Neural Networks

Citations

Evaluation of Different Machine Learning Methods and Deep-Learning Convolutional Neural Networks for Landslide Detection

Drone-surveillance for search and rescue in natural disaster

Spatial prediction of groundwater potential mapping based on convolutional neural network (CNN) and support vector regression (SVR)

Unsupervised Human Detection with an Embedded Vision System on a Fully Autonomous UAV for Search and Rescue Operations

Convolutional neural networks for object detection in aerial imagery for disaster response and recovery

References

ImageNet Classification with Deep Convolutional Neural Networks

You Only Look Once: Unified, Real-Time Object Detection

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition

Related Papers (5)

You Only Look Once: Unified, Real-Time Object Detection

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

SSD: Single Shot MultiBox Detector

Deep Residual Learning for Image Recognition

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks