Installation

This codebase is tested on Ubuntu 20.04.2 LTS with python 3.8. Follow the below steps to create environment and install dependencies.

Setup conda environment (recommended).

# Create a conda environment
conda create -y -n pomp python=3.8

# Activate the environment
conda activate pomp

# Install torch and torchvision
# Please refer to https://pytorch.org/ if you need a different cuda version
pip install torch==1.9.0+cu111 torchvision==0.10.0+cu111 torchaudio==0.9.0 -f https://download.pytorch.org/whl/torch_stable.html

Clone POMP code repository and install requirements

# Clone clip-openness code base
git clone https://github.com/amazon-science/prompt-pretraining.git

cd prompt-pretraining/

# Install requirements
pip install -r requirements.txt

# Update setuptools package 
pip install setuptools==59.5.0

Install dassl library.

cd third_party/Dassl.pytorch/

# Install dependencies
pip install -r requirements.txt

# Install this library (no need to re-build if the source code is modified)
python setup.py develop
cd ../..

To apply the pre-trained POMP prompt to semantic segmentation and object detection, please refer to ZSSeg and Detic codebases, respectively.