Run h2oGPT using Docker

Make sure Docker & Nvidia Containers are setup correctly by following instructions here.

Specify the required model using HF_MODEL parameter. All open-source models are posted on 🤗 H2O.ai's Hugging Face page.

docker run \
  --runtime=nvidia --shm-size=64g \
  -e HF_MODEL=h2oai/h2ogpt-gm-oasst1-en-2048-open-llama-7b \
  -p 8888:8888 -p 7860:7860 \
  --rm --init \
  -v `pwd`/h2ogpt_env:/h2ogpt_env \
  gcr.io/vorvan/h2oai/h2ogpt-runtime:0.1.0

Navigate to http://localhost:7860/ & start using h2oGPT.

To run h2oGPT with custom entrypoint, refer here.