Mbmc: An Effective Markov Chain Approach For Binning Metagenomic Reads From Environmental Shotgun Sequencing Projects

Abstract

Metagenomics is a next-generation omics field currently impacting postgenomic life sciences and medicine. Binning metagenomic reads is essential for the understanding of microbial function, compositions, and interactions in given environments. Despite the existence of dozens of computational methods for metagenomic read binning, it is still very challenging to bin reads. This is especially true for reads from unknown species, from species with similar abundance, and/or from low-abundance species in environmental samples. In this study, we developed a novel taxonomy-dependent and alignment-free approach called MBMC (Metagenomic Binning by Markov Chains). Different from all existing methods, MBMC bins reads by measuring the similarity of reads to the trained Markov chains for different taxa instead of directly comparing reads with known genomic sequences. By testing on more than 24 simulated and experimental datasets with species of similar abundance, species of low abundance, and/or unknown species, we report here that MBMC reliably grouped reads from different species into separate bins. Compared with four existing approaches, we demonstrated that the performance of MBMC was comparable with existing approaches when binning reads from sequenced species, and superior to existing approaches when binning reads from unknown species. MBMC is a pivotal tool for binning metagenomic reads in the current era of Big Data and postgenomic integrative biology. The MBMC software can be freely downloaded at http://hulab.ucf.edu/research/projects/metagenomics/MBMC.html.

Publication Date

8-1-2016

Publication Title

OMICS A Journal of Integrative Biology

Volume

20

Issue

8

Number of Pages

470-479

Document Type

Article

Personal Identifier

scopus

DOI Link

https://doi.org/10.1089/omi.2016.0081

Socpus ID

84981537800 (Scopus)

Source API URL

https://api.elsevier.com/content/abstract/scopus_id/84981537800

This document is currently not available here.

Share

COinS