: Application

Application 27837

Title:	Statistical Methods for Large Scale Genetic Studies
Lead Institution:	Stanford University
Principal investigator:	Professor Chiara Sabatti

WARNING: the interactive features of this website use CSS3, which your browser does not support. To use the full features of this website, please update your browser.

About

Our goal is to develop new data analysis methods that are well suited to discover the many genetic signals that influence traits of medical relevance. We aim to increase the sensitivity of current tools, by accounting for the known complexity: it is likely that many different genetic variants contribute to the traits, possibly interacting with each other, and our models capitalize on this. At the same time, we want to minimize the number of false positives results, which are unfortunately quite likely when one searches for possible associations among as many possibilities as those in genomewide studies of multiple traits. The UK Biobank data has one of the largest sample sizes in genetics data and to take fully advantage of this new data analysis methods are needed. Approaches with increased sensitivity and specificity in genetic association studies will facilitate the identification of the biological pathways perturbed in diseases. They will allow us to zoom in more precisely on the important biology?identifying relevant genes even when their effects are small, while avoiding false leads. This knowledge is important for risk assessment, therapy choices, and drug development. We will use the UK Biobank data to identify the concrete challenges presented by the analysis of large datasets and to test the performance of the methods that we will develop, relying both on simulations and on comparative data analysis. We will use the genotype data to generate artificial traits with known genetic architecture and evaluate the performance of different methods in recovering it. We will also use measured traits to understand what type of genetic architecture is likely to be important for medical relevant phenotypes. Because our focus is on the development of methods applicable to large samples, taking advantage of the more detail information they contain, we are interested in working with the full cohort.

1 Return

Return ID	App ID	Description	Archive Date
2580	27837	Multi-resolution localization of causal variants across the genome	27 Oct 2020

12 Publications

Pub ID	Title	Author(s)	Year	Journal
11855	Catch me if you can: signal localization with knockoff e-values	Paula Gablenz (+1)	2024	Journal of the Royal Statistical Society Series B Statistical Methodology
11017	Conformalized survival analysis	Emmanuel Candès (+2)	2023	Journal of the Royal Statistical Society Series B Statistical Methodology
10268	Derandomizing Knockoffs	Zhimei Ren (+2)	2021	Journal of the American Statistical Association
4848	False discovery rate control in genome-wide association studies with population structure	Matteo Sesia (+4)	2021	Proceedings of the National Academy of Sciences of the United States of America
2581	Multi-resolution localization of causal variants across the genome	Matteo Sesia (+4)	2020	Nature Communications
18115	Searching for Local Associations while Controlling the False Discovery Rate	Paula Gablenz (+3)	2026	Journal of the American Statistical Association
10494	Searching for robust associations with a multi-environment knockoff filter	S Li (+4)	2021	Biometrika
13346	Second-order group knockoffs with applications to genome-wide association studies	Benjamin B Chu (+6)	2024	Bioinformatics
13428	Second-order group knockoffs with applications to genome-wide association studies	Benjamin B Chu (+6)	2024	Bioinformatics
13578	Second-order group knockoffs with applications to genome-wide association studies	Benjamin B Chu (+6)	2024	Bioinformatics
8196	Simultaneous high-probability bounds on the false discovery proportion in structured, regression and online settings	Eugene Katsevich (+1)	2020	The Annals of Statistics
8068	Transfer Learning in Genome-Wide Association Studies with Knockoffs	Shuangning Li (+3)	2022	Sankhya B

1 Category

Category ID	Description	Items
1017	Genomics	30

Enabling scientific discoveries that improve human health