Binary Interval Search: a scalable algorithm for counting interval intersections Journal Article uri icon

Overview

abstract

  • Abstract; Motivation: The comparison of diverse genomic datasets is fundamental to understand genome biology. Researchers must explore many large datasets of genome intervals (e.g. genes, sequence alignments) to place their experimental results in a broader context and to make new discoveries. Relationships between genomic datasets are typically measured by identifying intervals that intersect, that is, they overlap and thus share a common genome interval. Given the continued advances in DNA sequencing technologies, efficient methods for measuring statistically significant relationships between many sets of genomic features are crucial for future discovery.; Results: We introduce the Binary Interval Search (BITS) algorithm, a novel and scalable approach to interval set intersection. We demonstrate that BITS outperforms existing methods at counting interval intersections. Moreover, we show that BITS is intrinsically suited to parallel computing architectures, such as graphics processing units by illustrating its utility for efficient Monte Carlo simulations measuring the significance of relationships between sets of genomic intervals.; Availability:  https://github.com/arq5x/bits.; Contact:  arq5x@virginia.edu; Supplementary information: Supplementary data are available at Bioinformatics online.

publication date

  • January 1, 2013

has restriction

  • green

Date in CU Experts

  • June 21, 2018 11:46 AM

Full Author List

  • Layer RM; Skadron K; Robins G; Hall IM; Quinlan AR

author count

  • 5

Other Profiles

International Standard Serial Number (ISSN)

  • 1367-4803

Electronic International Standard Serial Number (EISSN)

  • 1367-4811

Additional Document Info

start page

  • 1

end page

  • 7

volume

  • 29

issue

  • 1