Add densenet models #116

gpleiss · 2017-03-21T20:43:38Z

See issue #97.

It's not super-memory optimized (i.e. there's a concatenation at every layer). This is consistent with the original Torch implementation, and prevents some gross autograd hacks.

Pretrained models for the model zoo are available here. They're converted over from the original Torch implementation (ported from LuaTorch).

soumith · 2017-03-21T20:49:46Z

awesome. do the DenseNet pre-trained models expect the same normalization as the rest of the models?

I'm going to upload your models to the pytorch s3 bucket so that you can make them available via the pretrained=True

soumith · 2017-03-21T20:50:33Z

You will have to change the naming of the pretrained models as described here:
http://pytorch.org/docs/model_zoo.html i.e. filename-<sha256>.ext

torchvision/models/densenet.py

+
+        # First convolution
+        self.features = nn.Sequential()
+        self.features.add_module('conv0', nn.Conv2d(3, num_init_features, kernel_size=7, stride=2, padding=3, bias=False))


torchvision/models/densenet.py

+        drop_rate (float) - dropout rate after each dense layer
+        num_classes (int) - number of classification classes
+    """
+    def __init__(self, growth_rate=32, block_config=(6,12,24,16), num_init_features=64, bn_size=4, drop_rate=0, num_classes=1000):


gpleiss · 2017-03-21T21:07:12Z

@soumith - yeah, same normalization as the other ImageNet models. I'll rename the files

gpleiss · 2017-03-21T21:31:14Z

@fmasa no lint errors anymore. Sorry for not reading the contributing guidelines earlier!
@soumith just renamed the files: https://drive.google.com/drive/folders/0B0Y2k_mEJpY9R3dSSGQ0YXhfa2c?usp=sharing

soumith · 2017-03-21T21:52:24Z

They are now uploaded to the bucket and available via URLs:

https://download.pytorch.org/models/densenet121-1e136e00.pth
https://download.pytorch.org/models/densenet*

fmasa · 2017-03-22T08:31:41Z

@gpleiss I think you meant @fmassa 😁

gpleiss · 2017-03-22T13:52:38Z

@soumith Sorry, the files I linked to earlier were serialized version of the models, not the model states.

Here's a link to the correct files: https://drive.google.com/drive/folders/0B0Y2k_mEJpY9NXFBa1ktRUo3YlU?usp=sharing

soumith · 2017-03-23T01:11:46Z

@gpleiss the new files have been uploaded to the same place i.e. https://download.pytorch.org/models/

gpleiss · 2017-03-23T12:18:55Z

pretrained=True is now ready

soumith · 2017-03-23T13:50:00Z

Thank you! this is good stuff.

trypag · 2017-03-26T22:09:27Z

It's weird, I have never seen DenseNet using this configuration for the first convolution. Checking other implementations from the authors, they also used conv 3x3, stride=1, padding=1
Is it your choice to make this design changes or am I missing something ?
To be clear, I am referring to this line https://github.com/pytorch/vision/blob/master/torchvision/models/densenet.py#L128

gpleiss · 2017-03-26T23:02:12Z

@trypag the cifar models use 3x3 convolution for the first layer, but the ImageNet models use 7x7 convolution. The author's implementation is only for the CIFAR models. However, if you download their pretrained imagenet models the first layer is a 7x7 convolution.

trypag · 2017-03-27T11:33:03Z

Alright thanks @gpleiss !

farleylai · 2017-12-06T22:58:02Z

The authors address the memory efficiency in the followup paper and updated repo with underlying shared memory and re-computation on bp. The current pytorch code just uses torch.cat() directly. Any plan to formalize the technique? It is likely densenet based CNNs are going to prosper for other applications.

* initial commit of ssd code * some readme fixes * some bugfixes, adding model download * requirements.txt for dockerfile * switching the backbone to R34 * updating ssd300.py file * removing imports that are no longer needed * bug fixes for resnet backbone update (pytorch#112)

fmassa reviewed Mar 21, 2017

View reviewed changes

torchvision/models/densenet.py Outdated

# First convolution

self.features = nn.Sequential()

self.features.add_module('conv0', nn.Conv2d(3, num_init_features, kernel_size=7, stride=2, padding=3, bias=False))

This comment was marked as off-topic.

Sign in to view

fmassa reviewed Mar 21, 2017

View reviewed changes

gpleiss force-pushed the densenet branch from cc01316 to 9aeb453 Compare March 21, 2017 21:21

Add densenet models

9136437

gpleiss force-pushed the densenet branch from 9aeb453 to 9136437 Compare March 21, 2017 21:29

Add pretrained densenet models

1de2dd3

soumith merged commit 831ba8c into pytorch:master Mar 23, 2017

datumbox mentioned this pull request Feb 8, 2021

Small discrepancy in accuracy & Some results of pretrained classifiers are missing in doc #3152

Closed

Add densenet models #116

Add densenet models #116

Uh oh!

Conversation

gpleiss commented Mar 21, 2017 • edited by datumbox Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

soumith commented Mar 21, 2017

Uh oh!

soumith commented Mar 21, 2017

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

gpleiss commented Mar 21, 2017

Uh oh!

gpleiss commented Mar 21, 2017

Uh oh!

soumith commented Mar 21, 2017

Uh oh!

fmasa commented Mar 22, 2017

Uh oh!

gpleiss commented Mar 22, 2017

Uh oh!

soumith commented Mar 23, 2017

Uh oh!

gpleiss commented Mar 23, 2017

Uh oh!

soumith commented Mar 23, 2017

Uh oh!

trypag commented Mar 26, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gpleiss commented Mar 26, 2017

Uh oh!

trypag commented Mar 27, 2017

Uh oh!

farleylai commented Dec 6, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

gpleiss commented Mar 21, 2017 •

edited by datumbox

Loading

trypag commented Mar 26, 2017 •

edited

Loading

farleylai commented Dec 6, 2017 •

edited

Loading