作者：Jingru Yi, Pengxiang Wu, Bo Liu, Qiaoying Huang, Hui Qu, Dimitris Metaxas

发布时间：2020

发布期刊：Arixv

论文全称：Oriented Object Detection in Aerial Images with Box Boundary-Aware Vectors

论文地址：https://paperswithcode.com/paper/oriented-object-detection-in-aerial-images

代码：https://github.com/yijingru/BBAVectors-Oriented-Object-Detection

一、摘要

Oriented object detection in aerial images is a challenging task as the objects in aerial images are displayed in arbitrary directions and are usually densely packed. Current oriented object detection methods mainly rely on two stage anchor-based detectors. However, the anchor-based detectors typically suffer from a severe imbalance issue between the positive and negative anchor boxes. To address this issue, in this work we extend the horizontal keypoint based object detector to the oriented object detection task. In particular, we fifirst detect the center keypoints of the objects, based on which we then regress the box boundary aware vectors (BBAVectors) to capture the oriented bounding boxes. The box boundary-aware vectors are distributed in the four quadrants of a Cartesian coordinate system for all arbitrarily oriented objects. To relieve the diffificulty of learning the vectors in the corner cases, we further classify the oriented bounding boxes into horizontal and rotational bounding boxes. In the experiment, we show that learning the box boundary-aware vectors is superior to directly predicting the width, height, and angle of an oriented bounding box, as adopted in the baseline method. Besides, the proposed method competes favorably with state-of-the-art methods. Code is available at https:// github.com/yijingru/BBAVectors-Oriented-Object-Detection.

航空图像中的定向目标检测是一项具有挑战性的任务，因为航空图像中的目标是以任意的方向显示的，并且通常是密集排列的
将基于水平关键点的目标检测器扩展到定向目标检测任务
BBAVectors-Oriented-Object-Detection（本文算法）
- 首先检测对象的中心关键点，然后在此基础上回归box boundary-aware vectors(BBAVectors)来捕获定向边界框(OBB)
- 为了缓解在拐角情况下对向量的学习困难，作者进一步将定向边界框分为水平边界框和旋转边界框。

二、Introduction

目前已有的用于遥感图像的方法主要是基于anchor的两阶段检测器
- 在第一阶段，这些检测器在特征图上密集地分布anchor，然后对目标盒和anchor参数之间offset进行回归，以提供候选区域。
- 在第二阶段，将感兴趣区域(ROI)特征集中起来，以细化方框参数并对对象类别进行分类。
这些方法（例如R²CNN，ROI Transformer，R²PN，R-DFPN，ICN）通常使用中心点、宽度、高度和角度来定义定向边界框（OBB）
- 上述的方法都存在着明显的缺点：
  1. anchor的设计比较复杂
  2. anchor 的横纵比和大小的选择需要仔细的调整
  3. 正样本anchor和负样本anchor之间的极度不平衡会降低训练速度和导致次优
  4. 第二阶段的裁剪和回归策略会造成很大的计算成本
目前，为了克服上述anchor的缺点，基于关键点检测的算法被开发出来，这些方法检测bbox（bounding box）的角点，然后通过比较这些点的嵌入距离或中心距离来对这些点进行分组，这种方法可以很好的提高检测性能，但是有一个缺点：分组步骤非常的耗时。
- CenterNet直接检测目标中心，并回归边界盒的宽度(w)和高度(h)，在相当的精度下实现更快的速度，CenterNet可以通过学习一个额外的角度θ来扩展到任意方向检测任务（遥感图像），如下图(a)所示，回归中心点，宽w，高h和角度θ
- 本文算法（由CenterNet扩展），虽然本文算法由CenterNet扩展，但是不需要回归中心点，宽w，高h和角度θ，而是学习盒子边界感知向量（box bounday-aware vectors, BBAVectors）来捕获对象的旋转边界框，如下图(b)所示，t，r，l，b分别表示上、右、下和左的方框边界感知向量（上右下左向量分别位于笛卡尔积的第二、一、四、三象限，即对于所有任意方向的对象，在笛卡尔坐标系的四个象限中定义了其盒子边界感知向量），图(c)说明了向量非常接近xy轴的情况**，这种情况模型很难区分向量类型，即向量是属于上右下左的那种类型，为了解决这个问题，本文将边界框分为水平边界框（HBB）和旋转边界框（RBB），并对他们分别处理**

Untitled

本文贡献主要有：
- 提出了盒子边界感知向量(BBAVectors)来描述OBB。这个策略既简单又有效。对于所有任意定向的物体，在同一笛卡尔坐标系中测量BBAVectors。与中心点，宽度、高度和角度的基线方法（baseline methodd）相比，BBAVector获得了更好的性能
- 将基于中心关键点的对象检测器（CenterNet）扩展到面向目标检测任务。该模型是单阶段、anchor-free，快速、准确。它在DOTA和HRSC2016数据集上实现了最先进的性能。
如上图所示，该网络的是建立在U型架构上的，使用ResNet101 Conv1-5作为骨干网络，在主干网络的顶部，对特征图进行上采样，并输出一个比输入图像小4倍的特征图，在上采样过程中，通过跳层连接将深层与浅层结合起来，以共享高级语义信息和低级更精细的细节，即首先通过双线性插值将深层变到与浅层相同大小，上采样的特征图通过3×3卷积层进行细化，然后将细化后的特征图与浅层连接起来，然后是一个1×1的卷积层来细化通道级的特征。在潜在层中使用了批处理归一化（BN）和ReLU激活

三、Box Boundary-Aware Vectors（BBAVectors）

Untitled