![]() ![]() To address this, we consider the general problem of computing an approximate distance between two sequences and describe Mash, a general-purpose toolkit that utilizes the MinHash technique to reduce large sequences (or sequence sets) to compressed sketch representations. New methods are needed that can manage and help organize this scale of data. When BLAST was first published in 1990, there were less than 50 million bases of nucleotide sequence in the public archives now a single sequencing instrument can produce over 1 trillion bases per run.
0 Comments
Leave a Reply. |