-
Notifications
You must be signed in to change notification settings - Fork 1
4. Advanced usage
This page provides an example of a common advanced usage of sgRNAble:
- Genome-wide sgRNA library design for genome-wide screening in a user-defined host.
This advanced usage requires the identification of all open reading frames (ORFs) in the host genome. This will be accomplished here using the Prodigal protein-coding gene prediction tool1 (see GitHub repo here). Installation instruction can be found through the GitHub repo, or Prodigal can be installed in an Anaconda enviroment using the commands below:
conda create --name Prodigal
conda activate Prodigal
conda install -c bioconda prodigal
conda deactivate
The following example describes the usage of sgRNAble to design a library of sgRNAs targeting all chromosomal genes in an E. coli MG1655 host.
Navigate to a working directory in which sgRNAble can be run. Place input files in sub-directory named /data
(this directory structure is not necessary and relative or absolute paths can be used to select input or direct output anywhere on machine).
This example requires the installation of Progidal, as described above, and the following file that be downloaded from the sgRNAble GitHub repository in the sgRNAble/tests/data/
directory:
-
ecoli_genome.fasta
- host genome
If Prodigal was installed following the instructions above, activate the Prodigal
Anaconda environment using:
conda activate Prodigal
The following command will identify protein coding ORFs in the E. coli MG1655 host genome (ecoli_genome.fasta
) and output a .fasta
file (ecoli_target_ORFs.fasta
) containing the nucleotide sequences of all the open-reading frames predicted by Prodigal. This will serve as the TARGET_SEQUENCE
input file for sgRNAble.
prodigal -i ./data/ecoli_genome.fasta -d ./data/ecoli_target_ORFs.fasta
Check you /data
directory for the ecoli_target_ORFs.fasta
output file.
The following command will identify and rank 10 sgRNAs targeting each predicted ORF in the E. coli MG1655 host and provide output in a directory named ecoli_library_output
. sgRNAble will use 4 threads and a miximum of 2GB of RAM.
sgrnable -t ./data/ecoli_target_ORFs.fasta \
-g ./data/ecoli_genome.fasta \
-o ./ecoli_library_output \
-th 4 \
-m 2
Optional arguments can be used to specify additional sgRNA design criteria.
- Hyatt, D., Chen, GL., LoCascio, P.F. et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11, 119 (2010). https://doi.org/10.1186/1471-2105-11-119