Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
metafile.yml		metafile.yml
seresnet101_8xb32_in1k.py		seresnet101_8xb32_in1k.py
seresnet50_8xb32_in1k.py		seresnet50_8xb32_in1k.py
seresnext101-32x4d_8xb32_in1k.py		seresnext101-32x4d_8xb32_in1k.py
seresnext50-32x4d_8xb32_in1k.py		seresnext50-32x4d_8xb32_in1k.py

README.md

SEResNet

Squeeze-and-Excitation Networks

Abstract

The central building block of convolutional neural networks (CNNs) is the convolution operator, which enables networks to construct informative features by fusing both spatial and channel-wise information within local receptive fields at each layer. A broad range of prior research has investigated the spatial component of this relationship, seeking to strengthen the representational power of a CNN by enhancing the quality of spatial encodings throughout its feature hierarchy. In this work, we focus instead on the channel relationship and propose a novel architectural unit, which we term the "Squeeze-and-Excitation" (SE) block, that adaptively recalibrates channel-wise feature responses by explicitly modelling interdependencies between channels. We show that these blocks can be stacked together to form SENet architectures that generalise extremely effectively across different datasets. We further demonstrate that SE blocks bring significant improvements in performance for existing state-of-the-art CNNs at slight additional computational cost. Squeeze-and-Excitation Networks formed the foundation of our ILSVRC 2017 classification submission which won first place and reduced the top-5 error to 2.251%, surpassing the winning entry of 2016 by a relative improvement of ~25%.

How to use it?

Predict image

from mmpretrain import inference_model

predict = inference_model('seresnet50_8xb32_in1k', 'demo/bird.JPEG')
print(predict['pred_class'])
print(predict['pred_score'])

Use the model

import torch
from mmpretrain import get_model

model = get_model('seresnet50_8xb32_in1k', pretrained=True)
inputs = torch.rand(1, 3, 224, 224)
out = model(inputs)
print(type(out))
# To extract features.
feats = model.extract_feat(inputs)
print(type(feats))

Train/Test Command

Prepare your dataset according to the docs.

Train:

python tools/train.py configs/seresnet/seresnet50_8xb32_in1k.py

Test:

python tools/test.py configs/seresnet/seresnet50_8xb32_in1k.py https://download.openmmlab.com/mmclassification/v0/se-resnet/se-resnet50_batch256_imagenet_20200804-ae206104.pth

Models and results

Image Classification on ImageNet-1k

Model	Pretrain	Params (M)	Flops (G)	Top-1 (%)	Top-5 (%)	Config	Download
`seresnet50_8xb32_in1k`	From scratch	28.09	4.13	77.74	93.84	config	model \| log
`seresnet101_8xb32_in1k`	From scratch	49.33	7.86	78.26	94.07	config	model \| log

Citation

@inproceedings{hu2018squeeze,
  title={Squeeze-and-excitation networks},
  author={Hu, Jie and Shen, Li and Sun, Gang},
  booktitle={Proceedings of the IEEE conference on computer vision and pattern recognition},
  pages={7132--7141},
  year={2018}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

seresnet

seresnet

README.md

SEResNet

Abstract

How to use it?

Models and results

Image Classification on ImageNet-1k

Citation

Files

seresnet

Directory actions

More options

Directory actions

More options

Latest commit

History

seresnet

Folders and files

parent directory

README.md

SEResNet

Abstract

How to use it?

Models and results

Image Classification on ImageNet-1k

Citation