validate.py
gets perplexity. You can use the scriptvalidate.sh
which contains some arguments I used.train.py
trains a grover model from scratch. The scripttrain_tpu_adafactor.sh
could also help. You probably don't want to do this unless you have a lot of money for TPUs. However, it might be a good idea to finetune Grover to a different domain.
During training, you can use tensorboard by running the following commands:
ssh -L 6006:localhost:6006 myservername
tensorboard --logdir="grover":"gs://MYOUTPUTPATH" --port=6006