: Application

Application 17984

Title:	Genotype-environment interactions and sub-classification of disease
Lead Institution:	Georgia Institute of Technology
Principal investigator:	Professor Greg Gibson

WARNING: the interactive features of this website use CSS3, which your browser does not support. To use the full features of this website, please update your browser.

About

Our research utilizing the UK Biobank survey, health record, and genomic data will focus on two broad interest areas: analysis of genotype-environment interactions, and statistical approaches to sub-classification of disease. The Biobank will allow us to evaluate how genetic factors have different effects in conjunction with diverse lifestyles. For example, consuming large amounts of caffeine may interact with genetic risk of obesity to elevate the likelihood of heart attack. Conversely, thousands of people with coronary disease may be subdivided into smaller sets who have features in common that would not be detected without the scale of the Biobank. We will also test whether subsets of samples that are clustered from clinical information are enriched for different genetic ancestries. Since people vary in their risk of developing a wide range of diseases because of the joint influences of genetic variation and differences in the environment, including lifestyle choices, our research has implications for discovery of genetic factors, prediction of the course of disease in individuals, and advocacy for public policy decisions. The sub-classification of groups of patients who share etiological factors has the potential to define what treatments are most effective for patients, or to identify high risk groups of healthy adults for whom simple interventions can prevent illness. Several advanced statistical approaches will be used to detect interactions between genetic and environmental factors as sources of disease risk. The idea is that combinations of genes and behaviors reinforce or cancel one another, and it takes very large datasets to evaluate the repeatability of the effects. In addition, we will use something called tensor factorization to combine health record and genotype data to discover novel combinations of variables shared by small sets of patients. The full cohort will be included in our studies.

9 Publications

Pub ID	Title	Author(s)	Year	Journal
19147	Assessment of genetic and metabolite associations of branched chain amino acids with metabolic disease in the UK Biobank using Mendelian randomization	Jedrzej Konarkowski (+2)	2025	BMC Medical Genomics
12479	Canalization of the Polygenic Risk for Common Diseases and Traits in the UK Biobank Cohort	Sini Nagpal (+2)	2022	Molecular Biology and Evolution
19065	Greater value add from electronic health records than polygenic risk scores for predicting myocardial infarction in machine learning	Monica Isgut (+8)	2025	Communications Medicine
9547	Highly elevated polygenic risk scores are better predictors of myocardial infarction risk early in life than later	Monica Isgut (+3)	2021	Genome Medicine
13432	Identifying and characterizing disease subpopulations that most benefit from polygenic risk scores	Monica Isgut (+7)	2024	Scientific Reports
10705	Mendelian Randomization Indicates a Causal Role for Omega-3 Fatty Acids in Inflammatory Bowel Disease	Courtney Astore (+2)	2022	International Journal of Molecular Sciences
12067	Pervasive Modulation of Obesity Risk by the Environment and Genomic Background	Sini Nagpal (+2)	2018	Genes
7298	Stratification of risk of progression to colectomy in ulcerative colitis via measured and predicted gene expression	Angela Mo (+40)	2021	American Journal of Human Genetics
5875	The Medical Genome Reference Bank contains whole genome and phenotype data of 2570 healthy elderly	Mark Pinese (+37)	2020	Nature Communications

Enabling scientific discoveries that improve human health