Title
Dna Sequence Compression Using The Burrows-Wheeler Transform
Keywords
Burrows-Wheeler Transform; BWT; DNA sequence compression; repetition structures
Abstract
We investigate off-line dictionary oriented approaches to DNA sequence compression, based on the Burrows-Wheeler Transform (BWT). The preponderance of short repeating patterns is an important phenomenon in biological sequences. Here, we propose off-line methods to compress DNA sequences that exploit the different repetition structures inherent in such sequences. Repetition analysis is performed based on the relationship between the BWT and important pattern matching data structures, such as the suffix tree and suffix array. We discuss how the proposed approach can be incorporated in the BWT compression pipeline.
Publication Date
1-1-2002
Publication Title
Proceedings - IEEE Computer Society Bioinformatics Conference, CSB 2002
Number of Pages
303-313
Document Type
Article; Proceedings Paper
Personal Identifier
scopus
DOI Link
https://doi.org/10.1109/CSB.2002.1039352
Copyright Status
Unknown
Socpus ID
13844291872 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/13844291872
STARS Citation
Adjeroh, D.; Zhang, Y.; and Mukherjee, A., "Dna Sequence Compression Using The Burrows-Wheeler Transform" (2002). Scopus Export 2000s. 2746.
https://stars.library.ucf.edu/scopus2000/2746