BM-BC: A Bayesian Method of Base Calling for Solexa Sequence Data

Access full-text files

Date

2012-08

Authors

Ji, Yuan
Mitra, Riten
Quintana, Fernando
Jara, Alejandro
Mueller, Peter
Liu, Ping
Lu, Yue
Liang, Shoudan

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Base calling is a critical step in the Solexa next-generation sequencing procedure. It compares the position-specific intensity measurements that reflect the signal strength of four possible bases (A, C, G, T) at each genomic position, and outputs estimates of the true sequences for short reads of DNA or RNA. We present a Bayesian method of base calling, BM-BC, for Solexa-GA sequencing data. The Bayesian method builds on a hierarchical model that accounts for three sources of noise in the data, which are known to affect the accuracy of the base calls: fading, phasing, and cross-talk between channels. We show that the new method improves the precision of base calling compared with currently leading methods. Furthermore, the proposed method provides a probability score that measures the confidence of each base call. This probability score can be used to estimate the false discovery rate of the base calling or to rank the precision of the estimated DNA sequences, which in turn can be useful for downstream analysis such as sequence alignment.

Description

LCSH Subject Headings

Citation

Ji, Yuan, Riten Mitra, Fernando Quintana, Alejandro Jara, Peter Mueller, Ping Liu, Yue Lu, and Shoudan Liang. "BM-BC: a Bayesian method of base calling for Solexa sequence data." BMC bioinformatics, Vol. 13, No. 13 (Aug., 2012): 1.