Cahiers du CEREMADE

Unité Mixte de Recherche du C.N.R.S. N°7534
 
Abstract : We consider the choice of an optimal sample size for multiple comparison problems. The motivating application is the choice of the number of microarray experiments to be carried out when learning about differential gene expression. However, the approach is valid in any application that involves multiple comparison in a large number of hypothesis tests. We discuss two decision problems in the context of this setup, the sample size selection and the decision about the multiple comparisons. The focus of the discussion is on the sample size selection. For the]multiple comparison we assume an approach as in \Genovese and Wasserman (2002), based on controlling posterior expected false discovery rate (FDR). For the sample size selection we adopt a decision theoretic solution, using expected false negative rate (FNR) as decision criterion, combined with a power analysis as sensitivity diagnostic. Posterior expected FDR and marginal FNR are computed with respect to an assumed parametric probability model. In our implementation we use a version of the model proposed in Nweton et al. (2001). But the discussion is independent of the chosen probability model. The approach is valid for any model that includes positive prior probabilities for the null hypotheses in the multiple comparisons, and that allows efficient marginal and posterior simulation. Posterior and marginal simulation can be by dependent Markov chain Monte Carlo simulation.
 
 
OPTIMAL SAMPLE SIZE FOR MULTIPLE TESTING: THE CASE OF GENE EXPRESSION MICROARRAYS
MULLER P., PARMIGIANI G., ROBERT Christian P.
ROUSSEAU Judith
2002-46
19-12-2002
 
Université de PARIS - DAUPHINE
Place du Maréchal de Lattre De Tassigny - 75775 PARIS CEDEX 16 - FRANCE
Téléphone : +33 (0)1 44-05-49-23 - fax : +33 (0)1 44-05-45-99