FGVCLib is an open-source and well documented library for Fine-grained Visual Classification. It is based on Pytorch with performance and friendly API. Our code is pythonic, and the design is consistent with torchvision. You can easily develop new algorithms, or readily apply existing algorithms. The branch works with torch 1.12.1, torchvision 0.13.1.
Major features
-
Modular Design
We decompose the detection framework into different components and one can easily construct a customized object detection framework by combining different modules.
-
State of the art We implement state-of-the-art methods by the FGVCLib, PMG, MCL, API-Net, CAL, TransFG, PIM.
Please refer to Installation for installation instructions.
Please see get_started.md for the basic usage of FGVCLib. We provide the tutorials for:
Fine-grained Visual Classification | Other |
|
Backbones | Encoders | Heads | Necks | Sotas |
|
|
|
|
|
We used fgvclib to replicate the state-of-the-art model, and the following table shows the results of our experiment.
SOTA | Result of the paper | Result of the official code | Result of the FGVCLib |
---|---|---|---|
API-Net | 88.1 | 87.2 | 86.8 |
CAL | 90.6 | 89.6 | 89.5 |
TransFG | 91.7 | 91.1 | 89.3 |
PIM | 92.8 | 91.9 | 91.4 |
Thanks for your attention! If you have any suggestion or question, you can leave a message here or contact us directly:
In addition to fgvclib, we have developed a practical mini program on fine-grained visual classification, which can be accessed by scanning the two-dimensional code below.