protr 1.7-1
New features
-
crossSetSim()
now gains two new argumentsbatches
andverbose
.The
batches
argument allows users to split the similarity computations into multiple batches, which is useful when dealing with a large number of sequences and limited RAM. Theverbose
argument enables progress updates during the computation. This bringscrossSetSim()
to feature parity withparSeqSim()
(thanks, @ofleitas, #41). -
A new function
crossSetSimDisk()
has been implemented as a disk-based version ofcrossSetSim()
.This function follows a similar approach to
parSeqSimDisk()
, where partial results from each batch are cached on the hard drive and merged at the end. This allows for processing larger protein sequence sets that may not fit into RAM (#41).