Arbitrary-Oriented Scene Text Detection via Rotation Proposals

doi:10.1109/TMM.2018.2818020

Open AccessJournal ArticleDOI

Arbitrary-Oriented Scene Text Detection via Rotation Proposals

Jianqi Ma, +6 more

- 23 Mar 2018 -

IEEE Transactions on Multimedia

- Vol. 20, Iss: 11, pp 3111-3122

TLDR

The Rotation Region Proposal Networks are designed to generate inclined proposals with text orientation angle information that are adapted for bounding box regression to make the proposals more accurately fit into the text region in terms of the orientation.

Abstract:

This paper introduces a novel rotation-based framework for arbitrary-oriented text detection in natural scene images. We present the Rotation Region Proposal Networks , which are designed to generate inclined proposals with text orientation angle information. The angle information is then adapted for bounding box regression to make the proposals more accurately fit into the text region in terms of the orientation. The Rotation Region-of-Interest pooling layer is proposed to project arbitrary-oriented proposals to a feature map for a text region classifier. The whole framework is built upon a region-proposal-based architecture, which ensures the computational efficiency of the arbitrary-oriented text detection compared with previous text detection systems. We conduct experiments using the rotation-based framework on three real-world scene text detection datasets and demonstrate its superiority in terms of effectiveness and efficiency over previous approaches.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Deep Learning for Generic Object Detection: A Survey

Li Liu, +7 more

- 01 Feb 2020 -

International Journal of Computer Vision

TL;DR: A comprehensive survey of the recent achievements in this field brought about by deep learning techniques, covering many aspects of generic object detection: detection frameworks, object feature representation, object proposal generation, context modeling, training strategies, and evaluation metrics.

...read moreread less

Posted Content

Object Detection in 20 Years: A Survey

Zhengxia Zou, +3 more

- 13 May 2019 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper extensively reviews 400+ papers of object detection in the light of its technical evolution, spanning over a quarter-century's time (from the 1990s to 2019), and makes an in-deep analysis of their challenges as well as technical improvements in recent years.

...read moreread less

Journal ArticleDOI

A Survey of Deep Learning-Based Object Detection

Licheng Jiao, +6 more

- 05 Sep 2019 -

IEEE Access

TL;DR: This survey provides a comprehensive overview of a variety of object detection methods in a systematic manner, covering the one-stage and two-stage detectors, and lists the traditional and new applications.

...read moreread less

Proceedings ArticleDOI

Learning RoI Transformer for Oriented Object Detection in Aerial Images

Jian Ding, +4 more

TL;DR: The core idea of RoI Transformer is to apply spatial transformations on RoIs and learn the transformation parameters under the supervision of oriented bounding box (OBB) annotations.

...read moreread less

Proceedings ArticleDOI

SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects

Xue Yang, +7 more

TL;DR: A sampling fusion network is devised which fuses multi-layer feature with effective anchor sampling, to improve the sensitivity to small objects, and the IoU constant factor is added to the smooth L1 loss to address the boundary problem for the rotating bounding box.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

Proceedings ArticleDOI

You Only Look Once: Unified, Real-Time Object Detection

Joseph Redmon, +3 more

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

Proceedings ArticleDOI

Fast R-CNN

Ross Girshick

TL;DR: Fast R-CNN as discussed by the authors proposes a Fast Region-based Convolutional Network method for object detection, which employs several innovations to improve training and testing speed while also increasing detection accuracy and achieves a higher mAP on PASCAL VOC 2012.

...read moreread less

Proceedings Article

Faster R-CNN: towards real-time object detection with region proposal networks

Shaoqing Ren, +3 more

TL;DR: Ren et al. as discussed by the authors proposed a region proposal network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals.

...read moreread less

Proceedings Article

Spatial transformer networks

Max Jaderberg, +3 more

TL;DR: This work introduces a new learnable module, the Spatial Transformer, which explicitly allows the spatial manipulation of data within the network, and can be inserted into existing convolutional architectures, giving neural networks the ability to actively spatially transform feature maps.

...read moreread less

Collapse

Arbitrary-Oriented Scene Text Detection via Rotation Proposals

Citations

Deep Learning for Generic Object Detection: A Survey

Object Detection in 20 Years: A Survey

A Survey of Deep Learning-Based Object Detection

Learning RoI Transformer for Oriented Object Detection in Aerial Images

SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects

References

Very Deep Convolutional Networks for Large-Scale Image Recognition

You Only Look Once: Unified, Real-Time Object Detection

Fast R-CNN

Faster R-CNN: towards real-time object detection with region proposal networks

Spatial transformer networks

Related Papers (5)

SSD: Single Shot MultiBox Detector

Deep Residual Learning for Image Recognition

Feature Pyramid Networks for Object Detection

Faster R-CNN: towards real-time object detection with region proposal networks

You Only Look Once: Unified, Real-Time Object Detection