论文全称:Going deeper with convolutions

作者:Christian Szegedy,Wei Liu,Yangqing Jia,Pierre Sermane,Scott Reed,Dragomir Anguelov,Dumitru Erhan,Vincent Vanhoucke,Andrew Rabinovich

论文地址:https://arxiv.org/abs/1409.4842

单位:

发表机构:Arxiv

**发表年份:**2014

一、摘要

We propose a deep convolutional neural network architecture codenamed Inception, which was responsible for setting the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14). The main hallmark of this architecture is the improved utilization of the computing resources inside the network. This was achieved by a carefully crafted design that allows for increasing the depth and width of the network while keeping the computational budget constant. To optimize quality, the architectural decisions were based on the Hebbian principle and the intuition of multi-scale processing. One particular incarnation used in our submission for ILSVRC14 is called GoogLeNet, a 22 layers deep network, the quality of which is assessed in the context of classification and detection.

  1. 设计出Inception v1,并使用该网络参加 ImageNet Large-Scale Visual Recognition Challenge 2014(ILSVRC2014)的分类和检测比赛,并取得了冠军
  2. Inception模块在增加网络深度和宽度的同时减少参数量和计算量
  3. 本文用Inception模块构建的网络称为GoogLeNet,共22层

二、Introduction