This repository is dedicated to the NLP part of the DEEP project. The code is tightly coupled with AWS Sagemaker.
You can access the full documentation here (if you are already browsing the documentation on the web, it links to the same page you are at now)
Contact Stefano to get the AWS credentials, install the AWS CLI
Clone the repo and pull the data
git clone <deep_experiments_repo>
cd deep-experiments
Create a new conda environment:
conda create -n deepl python=3.9.1
Install necessary libraries:
make dev-install
Pull the data:
dvc pull
Ask Stefano for a AWS user account and a new Notebook instance on AWS.
The notebook instance will be created in the region us-east-1
, switch to this
region if you don't find your instance.
The notebook instance comes with the repo already cloned.
Once it is ready, start the instance and click on Open Jupyter.
Open the Jupyter terminal and cd
to the deep-experiments
repo. It should be:
cd SageMaker/deep-experiments
Run:
make cloud-install
(This must be run everytime the instance is activated)
Pull the data:
dvc pull
data
contains the datadeep
contains the codedocker
contains the Dockerfile used to build the test environment.notebooks
contains all the Jupyter Notebook, divided by category and person working on themscripts
contains the training scripts necessary for Sagemakerrequirements
contains all the Python requirements for the different configurations