Factors of Transferability for a Generic ConvNet Representation

Azizpour, Hossein; Razavian, Ali Sharif; Sullivan, Josephine; Maki, Atsuto; Carlsson, Stefan

Computer Science > Computer Vision and Pattern Recognition

arXiv:1406.5774 (cs)

[Submitted on 22 Jun 2014 (v1), last revised 15 Jul 2015 (this version, v3)]

Title:Factors of Transferability for a Generic ConvNet Representation

Authors:Hossein Azizpour, Ali Sharif Razavian, Josephine Sullivan, Atsuto Maki, Stefan Carlsson

View PDF

Abstract:Evidence is mounting that Convolutional Networks (ConvNets) are the most effective representation learning method for visual recognition tasks. In the common scenario, a ConvNet is trained on a large labeled dataset (source) and the feed-forward units activation of the trained network, at a certain layer of the network, is used as a generic representation of an input image for a task with relatively smaller training set (target). Recent studies have shown this form of representation transfer to be suitable for a wide range of target visual recognition tasks. This paper introduces and investigates several factors affecting the transferability of such representations. It includes parameters for training of the source ConvNet such as its architecture, distribution of the training data, etc. and also the parameters of feature extraction such as layer of the trained ConvNet, dimensionality reduction, etc. Then, by optimizing these factors, we show that significant improvements can be achieved on various (17) visual recognition tasks. We further show that these visual recognition tasks can be categorically ordered based on their distance from the source task such that a correlation between the performance of tasks and their distance from the source task w.r.t. the proposed factors is observed.

Comments:	Extended version of the workshop paper with more experiments and updated text and title. Original CVPR15 DeepVision workshop paper title: "From Generic to Specific Deep Representations for Visual Recognition"
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1406.5774 [cs.CV]
	(or arXiv:1406.5774v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1406.5774

Submission history

From: Hossein Azizpour [view email]
[v1] Sun, 22 Jun 2014 21:57:46 UTC (522 KB)
[v2] Wed, 17 Dec 2014 15:37:50 UTC (1,192 KB)
[v3] Wed, 15 Jul 2015 10:02:19 UTC (1,636 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Factors of Transferability for a Generic ConvNet Representation

Submission history

Access Paper:

Current browse context:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Factors of Transferability for a Generic ConvNet Representation

Submission history

Access Paper:

Current browse context:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators