Apply

Assistant Professor of CS Timothy Becker Publishes a New Algorithm Method Journal Article

August 22, 2021
Submitted By: Timothy Becker

Abstract: Structural Variation (SV) calling and genotyping remain an ongoing challenge using next generation sequencing technologies. The gold standard approach for genome consortia has been to utilise multiple SV calling algorithms and then merge the results based on SV type and coordinates and more recently to make use of multiple sequencing technologies for each sample cell line. This ensemble strategy provides more comprehensive SV calling but comes at the cost of high-compute run time. We make use of popular open-source machine learning libraries to formulate a new data representation suitable for mining whole genome sequences in a fraction of the ensemble time. We then compare the results to several well-established methods and ensembles. Our pure machine learning method demonstrates a new direction in technique, where feature selection and region filtering are no longer required to achieve desirable false positive rates.

Taken from The International Journal Of Data Mining and Bioinformatics (IJDMB), 2021, volume 25, issue 1/2, pp.37 - 52.

http://dx.doi.org/10.1504/IJDMB.2021.116880