作者:Jianqi Ma, Weiyuan Shao, Hao Ye, Li Wang, Hong Wang, Yingbin Zheng, Xiangyang Xue

发布时间:2018

发布期刊:Arxiv

**论文全称:**Arbitrary-Oriented Scene Text Detection via Rotation Proposals

论文地址:https://arxiv.org/pdf/1703.01086.pdf

代码:

地位:

一、 摘要

Abstract—This paper introduces a novel rotation-based framework for arbitrary-oriented text detection in natural scene images. We present the Rotation Region Proposal Networks (RRPN), which are designed to generate inclined proposals with text orientation angle information. The angle information is then adapted for bounding box regression to make the proposals more accurately fifit into the text region in terms of the orientation. The Rotation Region-of-Interest (RRoI) pooling layer is proposed to project arbitrary-oriented proposals to a feature map for a text region classififier. The whole framework is built upon a region proposal-based architecture, which ensures the computational effificiency of the arbitrary-oriented text detection compared with previous text detection systems. We conduct experiments using the rotation-based framework on three real-world scene text detection datasets and demonstrate its superiority in terms of effectiveness and effificiency over previous approaches.

二、研究背景

2.1 文本识别的难点

**自然场景图像中的文本进行检测的难点:**光线不均匀、模糊、透视失真、方向多变等复杂情况

2.2 已有的研究方法

三、网络架构

Untitled

  1. 在框架的最前面采用VGG-16卷积层来提取特征,它们由两个兄弟分支共享,即RRPN和最后一个卷积层的特征图的克隆