Indexing a 2.6GB Plant Genome Using STAR (genomeGenerate Mode) Terminated Due to RAM Limit — Can the Genome Be Split for Indexing, or Are There Other Solutions? #2235

Vijithkumar2020 · 2024-11-05T20:20:15Z

I am trying to index my plant genome that was de novo assembled, using the STAR aligner tool. The assembly file contains 2,976,459 contigs with N50 being 1,293kb.

The following command was used:

STAR --runThreadN 8 \
--runMode genomeGenerate \
--genomeDir /path/
--genomeFastaFiles /path/*.fa \
--genomeSAindexNbases 14 \
--genomeSAsparseD 2

And the error that was encountered was

EXITING because of FATAL PARAMETER ERROR: limitGenomeGenerateRAM=31000000000is too small for your genome

SOLUTION: please specify --limitGenomeGenerateRAM not less than 2080695648522 and make that much RAM available

System capabilities: CPU with 8 cores and 244GB RAM.

One of the suggestions I was given was that the contig counts be the potential culprit. I haven't checked for any duplicates in my assembly file. Since I don't have assembly data of any related species under the same genus, I doubt if scaffolding using tools like RagTag would be helpful. Therefore I am looking forward to suggestions as to how to perform indexing within my system capacity.

Note: my end-goal is to perform BRAKER (with RNA seq and protein data), and for the Stringtie2 to work, the aligned reads need to have XS tags. With STAR aligner, this is possible.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Indexing a 2.6GB Plant Genome Using STAR (genomeGenerate Mode) Terminated Due to RAM Limit — Can the Genome Be Split for Indexing, or Are There Other Solutions? #2235

Indexing a 2.6GB Plant Genome Using STAR (genomeGenerate Mode) Terminated Due to RAM Limit — Can the Genome Be Split for Indexing, or Are There Other Solutions? #2235

Vijithkumar2020 commented Nov 5, 2024

Indexing a 2.6GB Plant Genome Using STAR (genomeGenerate Mode) Terminated Due to RAM Limit — Can the Genome Be Split for Indexing, or Are There Other Solutions? #2235

Indexing a 2.6GB Plant Genome Using STAR (genomeGenerate Mode) Terminated Due to RAM Limit — Can the Genome Be Split for Indexing, or Are There Other Solutions? #2235

Comments

Vijithkumar2020 commented Nov 5, 2024