Bayesian variable selection using spike-and-slab priors with application to high dimensional electroencephalography data by local modelling

Journal of the Royal Statistical Society: Series C (Applied Statistics), 68(5)

Journal of the Royal Statistical Society: Series C (Applied Statistics), 68(5), (2019).


Because of the immense technological advances, very often we encounter data in high dimensions. Any set of measurements taken at multiple time points for multiple subjects leads to data of more than two dimensions (a matrix of covariates for each subject). We present a Bayesian variable‐selection method to identify the active regions in the brain as a response to a certain stimulus. We build binary classification models of subject level responses by using binary regression with Gaussian models on the latent variables. We also study the scaled normal priors on the latent variables, as they cover a large family of distributions. We use continuous spike‐and‐slab priors to incorporate variable selection within the modelling. Because of the computational complexity, we build many local (at different time points) models and make predictions by utilizing the temporal structure between the local models. We perform two‐stage variable selection for each of these local models. We demonstrate the effectiveness of such modelling through the results of a simulation study. We then present the performance of these models on multisubject neuroimaging (electroencephalography) data to study the effects on the functional states of the frontal cortex and parietal lobe for chronic exposure of alcohol.

Shariq Mohammed

My research interests include Bayesian modeling, variable selection, data integration - cancer imaging genomics, statistical methods for medical imaging, precision health.