Mbmc: An Effective Markov Chain Approach For Binning Metagenomic Reads From Environmental Shotgun Sequencing Projects
Abstract
Metagenomics is a next-generation omics field currently impacting postgenomic life sciences and medicine. Binning metagenomic reads is essential for the understanding of microbial function, compositions, and interactions in given environments. Despite the existence of dozens of computational methods for metagenomic read binning, it is still very challenging to bin reads. This is especially true for reads from unknown species, from species with similar abundance, and/or from low-abundance species in environmental samples. In this study, we developed a novel taxonomy-dependent and alignment-free approach called MBMC (Metagenomic Binning by Markov Chains). Different from all existing methods, MBMC bins reads by measuring the similarity of reads to the trained Markov chains for different taxa instead of directly comparing reads with known genomic sequences. By testing on more than 24 simulated and experimental datasets with species of similar abundance, species of low abundance, and/or unknown species, we report here that MBMC reliably grouped reads from different species into separate bins. Compared with four existing approaches, we demonstrated that the performance of MBMC was comparable with existing approaches when binning reads from sequenced species, and superior to existing approaches when binning reads from unknown species. MBMC is a pivotal tool for binning metagenomic reads in the current era of Big Data and postgenomic integrative biology. The MBMC software can be freely downloaded at http://hulab.ucf.edu/research/projects/metagenomics/MBMC.html.
Publication Date
8-1-2016
Publication Title
OMICS A Journal of Integrative Biology
Volume
20
Issue
8
Number of Pages
470-479
Document Type
Article
Personal Identifier
scopus
DOI Link
https://doi.org/10.1089/omi.2016.0081
Copyright Status
Unknown
Socpus ID
84981537800 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/84981537800
STARS Citation
Wang, Ying; Hu, Haiyan; and Li, Xiaoman, "Mbmc: An Effective Markov Chain Approach For Binning Metagenomic Reads From Environmental Shotgun Sequencing Projects" (2016). Scopus Export 2015-2019. 3288.
https://stars.library.ucf.edu/scopus2015/3288