: Application

Application 26664

Title:	Novel machine-learning framework for improved inference in GWAS
Lead Institution:	Hebrew University of Jerusalem
Principal investigator:	Professor Michal Linial

WARNING: the interactive features of this website use CSS3, which your browser does not support. To use the full features of this website, please update your browser.

About

Finding associations between genotypes to phenotypes (GWAS) is crucial for future precise personalized medicine. Current approaches are very limited due to the small number of available samples relative to the huge amount of studied genetic variants (e.g. SNPs). We want to develop a new GWAS framework making use of machine-learning approaches that would allow much stronger statistical inference. An improved statistical framework would allow finding many new associations and have a positive impact in bringing these discoveries to clinics. The success of this method is strongly dependent on abundance of genetic data coupled with rich phenotypic information. The low discovery rate of GWAS undermines the research community?s efforts at bringing personalized medicine. An improved GWAS methodology is thus a burning need that should be at the top of the public interest as a health-related research. Success of our project will not only directly find new genetic associations, but will also allow future studies making better use of genotype-phenotype databases like the UK Biobank. Our project has the potential of improving the understanding of complex diseases inflicting a substantial fraction of the population (such as diabetes type II, asthma, and cardiovascular, autoimmune and neurodegenerative diseases) We will process tens of thousands of UK Biobank?s high-quality genetic samples, each comprising of close to 800,000 informative SNPs, in order to predict the damage on each of the ~20,000 human genes for each of the studied individuals. Combining this assessments with the detailed phenotypic data available in the UK Biobank, we will use statistical methods to uncover the associations between these phonotypes to specific genes. We will use machine-learning algorithms in order to assign the proper effect size of each gene. Any individual with genetic information (DNA sequencing or SNP-array data) will improve the quality of our model and overall effectiveness of our framework. The key for success in finding significant associations despite the unavoidable variation in the population is having a very large dataset of individuals. According to our estimates, we will need at least tens of thousands of samples. A rich repertoire of phenotypes is crucial (to serve both as predicted and predicting variables). Therefore, we will need the entire set of individuals with genomic data coupled with all of their phenotypic attributes.

8 Publications

Pub ID	Title	Author(s)	Year	Journal
12697	Body Mass Index and Birth Weight Improve Polygenic Risk Score for Type 2 Diabetes	Avigail Moldovan (+3)	2021	Journal of Personalized Medicine
4825	Expanding cancer predisposition genes with ultra-rare cancer-exclusive human variations	Roni Rasnic (+2)	2020	Scientific Reports
7846	Gene-based association study reveals a distinct female genetic signal in primary hypertension	Roei Zucker (+2)	2023	Human Genetics
7458	Genetic association studies of alterations in protein function expose recessive effects on cancer predisposition	Nadav Brandes (+2)	2021	Scientific Reports
13222	PWAS Hub for exploring gene-based associations of common complex diseases	Guy Kelman (+3)	2024	Genome Research
14612	PWAS Hub: exploring gene-based associations of complex diseases with sex dependency	Roei Zucker (+2)	2024	Nucleic Acids Research
5628	PWAS: proteome-wide association study - linking genes and phenotypes by functional variation in proteins	Nadav Brandes (+2)	2020	Genome Biology
11877	Revealing the genetic complexity of hypothyroidism: integrating complementary association methods	Roei Zucker (+4)	2024	Frontiers in Genetics

Enabling scientific discoveries that improve human health