Jump to ContentJump to Main Navigation
Bayesian Statistics 9$
Users without a subscription are not able to see the full content.

José M. Bernardo, M. J. Bayarri, James O. Berger, A. P. Dawid, David Heckerman, Adrian F. M. Smith, and Mike West

Print publication date: 2011

Print ISBN-13: 9780199694587

Published to Oxford Scholarship Online: January 2012

DOI: 10.1093/acprof:oso/9780199694587.001.0001

Show Summary Details
Page of

PRINTED FROM OXFORD SCHOLARSHIP ONLINE (www.oxfordscholarship.com). (c) Copyright Oxford University Press, 2019. All Rights Reserved. Under the terms of the licence agreement, an individual user may print out a PDF of a single chapter of a monograph in OSO for personal use (for details see www.oxfordscholarship.com/page/privacy-policy).date: 19 April 2019

Bayesian Models for Sparse Regression Analysis of High Dimensional Data *

Bayesian Models for Sparse Regression Analysis of High Dimensional Data *

Chapter:
(p.539) Bayesian Models for Sparse Regression Analysis of High Dimensional Data*
Source:
Bayesian Statistics 9
Author(s):

Sylvia Richardson

Leonardo Bottolo

Jeffrey S. Rosenthal

Publisher:
Oxford University Press
DOI:10.1093/acprof:oso/9780199694587.003.0018

This paper considers the task of building efficient regression models for sparse multivariate analysis of high dimensional data sets, in particular it focuses on cases where the numbers q of responses Y = ( y k ,1 ≤ kq) and p of predictors X = ( x j , 1 ≤ jp) to analyse jointly are both large with respect to the sample size n, a challenging bi‐directional task. The analysis of such data sets arise commonly in genetical genomics, with X linked to the DNA characteristics and Y corresponding to measurements of fundamental biological processes such as transcription, protein or metabolite production. Building on the Bayesian variable selection set‐up for the linear model and associated efficient MCMC algorithms developed for single responses, we discuss the generic framework of hierarchical related sparse regressions, where parallel regressions of y k on the set of covariates X are linked in a hierarchical fashion, in particular through the prior model of the variable selection indicators γ kj , which indicate among the covariates x j those which are associated to the response y k in each multivariate regression. Structures for the joint model of the γ kj , which correspond to different compromises between the aims of controlling sparsity and that of enhancing the detection of predictors that are associated with many responses (“hot spots”), will be discussed and a new multiplicative model for the probability structure of the γ kj will be presented. To perform inference for these models in high dimensional set‐ups, novel adaptive MCMC algorithms are needed. As sparsity is paramount and most of the associations expected to be zero, new algorithms that progressively focus on part of the space where the most interesting associations occur are of great interest. We shall discuss their formulation and theoretical properties, and demonstrate their use on simulated and real data from genomics.

Keywords:   Adaptive MCMC scanning, eQTL, Genomics, Hierarchically related regressions, variable selection

Oxford Scholarship Online requires a subscription or purchase to access the full text of books within the service. Public users can however freely search the site and view the abstracts and keywords for each book and chapter.

Please, subscribe or login to access full text content.

If you think you should have access to this title, please contact your librarian.

To troubleshoot, please check our FAQs , and if you can't find the answer there, please contact us .