Title

Dna Sequence Compression Using The Burrows-Wheeler Transform

Keywords

Burrows-Wheeler Transform; BWT; DNA sequence compression; repetition structures

Abstract

We investigate off-line dictionary oriented approaches to DNA sequence compression, based on the Burrows-Wheeler Transform (BWT). The preponderance of short repeating patterns is an important phenomenon in biological sequences. Here, we propose off-line methods to compress DNA sequences that exploit the different repetition structures inherent in such sequences. Repetition analysis is performed based on the relationship between the BWT and important pattern matching data structures, such as the suffix tree and suffix array. We discuss how the proposed approach can be incorporated in the BWT compression pipeline.

Publication Date

1-1-2002

Publication Title

Proceedings - IEEE Computer Society Bioinformatics Conference, CSB 2002

Number of Pages

303-313

Document Type

Article; Proceedings Paper

Personal Identifier

scopus

DOI Link

https://doi.org/10.1109/CSB.2002.1039352

Socpus ID

13844291872 (Scopus)

Source API URL

https://api.elsevier.com/content/abstract/scopus_id/13844291872

This document is currently not available here.

Share

COinS