Skip to content

Latest commit

 

History

History
53 lines (36 loc) · 1.7 KB

00_getting_started.md

File metadata and controls

53 lines (36 loc) · 1.7 KB

Getting Started

dSPRINT can be run on a any x64 Linux machine (either a local desktop/laptop or on a cluster). The only requirements are a 64 bit Anaconda installation, and plenty of disk space (around 400G in a typical case).

Install Anaconda

If you don't already have Anaconda installed, use the instructions at https://www.anaconda.com/products/individual to download and install Anaconda for your platform. (Note: The lite version of Anaconda, miniconda will work just fine).

On a cluster environment, you may already have Anaconda available by way of Environment Modules. If you run:

module avail anaconda

or

module avail conda

and you see one or more entries returned, you can execute module load anaconda or module load conda to bring the conda command in your PATH. If this is the first time you're using conda, you will also want to execute conda init, which is a required step for recent versions of conda.

Set up environment

After cloning the code in this repository:

git clone https://github.com/vineetbansal/dsprint-pipeline.git

cd to the folder where you clone the repository (this folder will have the file environment.yml and config.json, among others), and create a new conda environment where you can run the dSPRINT pipeline:

cd dsprint-pipeline
conda env create -f environment.yml

Activate the newly created environment, which is called dsprint3:

conda activate dsprint3

Proceed to the Downloading Data tutorial.