CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor

Shuyang Sun*, Runjia Li*, Philip Torr, Xiuye Gu, Siyang Li

[arXiv] [Project] [Code] [Demo]

The code is fully released at Google Research.

Installation

Requirements

Anaconda 3
PyTorch ≥ 1.7 and torchvision that matches the PyTorch installation. Install them together at pytorch.org to make sure of this.
conda env create --name ENV_NAME --file=env.yml

Getting Started

Demo

We have set up an online demo. Currently, the web demo does not support SAM since it's just a CPU-only server. You can check it out at: here

Run Demo Locally

If you want to test an image locally, you can simply run

python3 demo.py --cfg-path=YOUR_CFG_PATH --output_path=SAVE_PATH

Evaluation with Benchmarks

Data preparation: See Preparing Datasets for CaR
Evaluate: python3 evaluate.py --cfg-path=CFG_PATH You can find configs for each dataset under configs.

Citing CaR

@inproceedings{clip_as_rnn,
  title = {CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor},
  author = {Sun, Shuyang and Li, Runjia and Torr, Philip and Gu, Xiuye and Li, Siyang},
  year = {2024},
  booktitle = {CVPR},
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
DATA.md		DATA.md
README.md		README.md
env.yml		env.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor

Installation

Requirements

Getting Started

Demo

Run Demo Locally

Evaluation with Benchmarks

Citing CaR

About

Releases

Packages

Contributors 2

kevin-ssy/CLIP_as_RNN

Folders and files

Latest commit

History

Repository files navigation

CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor

Installation

Requirements

Getting Started

Demo

Run Demo Locally

Evaluation with Benchmarks

Citing CaR

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages