Recent advances in the development of technology based on Raman scattering as a chemical analytical technique have made it possible to quantitatively detect spectral mixtures of multiple DNA sequences. However, to fully exploit these techniques inferential methodologies are required which can deconvolute the observed mixture and infer the compositionof distinct DNA sequences present in the overall composite. Inferring the spectral decomposition is posed as a model selection problem for a bilinear statistical model, and the required Markov chain Monte Carlo inferentialmethodology is developed. In particular a Gibbs sampler and reversible Jump Markov chain Monte Carlo methods are presented along with techniques based on the Bayesian Information Criterion, Thermodynamic Integration and the method proposed by Chib. The results reported in this paper are particularly encouraging highlighting that for multiplexed Raman spectra, inference of the composition of original sequences present in the mixture is possible to acceptable levels of accuracy. This statistical methodology makes the exploitation of multiplexed surface enhanced resonance Raman scattering spectra in disease identification a reality. A website containing the spectral data used in the paper as well as Matlab scripts implementing the proposed statistical methods is available at http://www.dcs.gla.ac.uk/inference/RJMCMC.
- bayesian methods
- raman spectra
- thermodynamic integration
- reversible jump
Zhong, M., Girolami, M., Faulds, K., & Graham, D. (2011). Bayesian methods to detect dye labelled DNA oligonucleotides in multiplexed raman spectra. Journal of Applied Statistics, 60(2), 187-206. https://doi.org/10.1111/j.1467-9876.2010.00744.x