LAST UPDATED: 2017-03-25
scripts
:.pbs
scripts for submitting jobs on QUESTlib
: library functionspipeline
: main code for simulation, regression, and analysis
PBS scripts for running grouping exhaustive regression of receptors and single amino acid variants. Make sure Quest is setup as follows (starred files for shuffling only):
Pipeline was written specifically for running on Northwestern's high performance computing core QUEST.
The working directory on QUEST contains the following:
`~/Matlab/`
logs/ -- standard out and error logs
results/ -- results stored here
Vectorizations.mat
Data_R_*.mat
Data_P_*.mat
Data_Shuffle.mat
*.m (all Matlab files)
*.pbs (all submission scripts)
run_regress_P.pbs
By Property regression. There are 255 vectorizations, completed in sets of 25, for 9 different responses. Run using:
msub -t runP[1-11] run_regress_P.pbs
run_regress_R.pbs
By Residue regression. There are 8191 vectorizations, completed in sets of 25, for 9 different responses. Run using:
msub -t runR[1-328] run_regress_R.pbs
run_merge_R.pbs
/ run_merge_P.pbs
Merges results into a single Matlab file. Run only after previous jobs are completed.
msub -t mergeP[1-7] run_merge_P.pbs
msub -t mergeR[1-7] run_merge_R.pbs
run_shuffle_P.pbs
By Property shuffled regression. There are 255 vectorizations, completed in sets of 25, for 51 reps. Run using:
msub -t shuffleP[1-11] run_shuffle_P.pbs
run_shuffle_R.pbs
By Residue shuffled regression. There are 8191 vectorizations, completed in sets of 25, for 51 reps. Run using:
msub -t shuffleR[1-328] run_shuffle_R.pbs