Master of Science in Computer Science
First Committee Member
Second Committee Member
Third Committee Member
Fourth Committee Member
Number of Pages
In this thesis, we report on the use of minhash techniques to improve the draft assembly of a genome mapping. More specifically, we use minhash to compare the scaffolds of sea urchin and sea cucumber genomes.
One of the main contributions of this thesis is the implementation of minhash with the Message Passing Interface (MPI) utilizing Intel Phi co-processors. It is shown that our implementation significantly reduces the processing time for identification of k-mer similarities.
data mining; data science; genome mapping; minhash; parallel programming; pattern recognition
Varghese, Saju, "Enhancing the Draft Assembly with Minhash" (2016). UNLV Theses, Dissertations, Professional Papers, and Capstones. 2910.