Skip to content

DAWells/cdhit_clstr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

CDhit .clstr

After clustering CDhit produces a .clstr file detailing which sequences went into which clusters. However, it is not in a machine friendly format. This script converts it to csv while preserving sequence length, and clustering identity.

Can be called as a command line tool. Takes 2 arguments, the cd-hit .clstr file and the output path. code/analysis/clstr_to_csv.py data/raw/eg.clstr data/processed/eg.csv

About

Reformat CDhit's `.clstr` files for easy use

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages