: Publication 5438

Publication 5438

Title:	Multi-scale inference of genetic trait architecture using biologically annotated neural networks
Journal:	PLOS Genetics
Published:	19 Aug 2021
Pubmed:	https://pubmed.ncbi.nlm.nih.gov/34411094/
DOI:	https://doi.org/10.1371/journal.pgen.1009754
URL:	https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1009754&type=printable
Citations:	19 (11 in last 2 years) as of 8 Aug 2024

WARNING: the interactive features of this website use CSS3, which your browser does not support. To use the full features of this website, please update your browser.

Abstract

In this article, we present Biologically Annotated Neural Networks (BANNs), a nonlinear probabilistic framework for association mapping in genome-wide association (GWA) studies. BANNs are feedforward models with partially connected architectures that are based on biological annotations. This setup yields a fully interpretable neural network where the input layer encodes SNP-level effects, and the hidden layer models the aggregated effects among SNP-sets. We treat the weights and connections of the network as random variables with prior distributions that reflect how genetic effects manifest at different genomic scales. The BANNs software uses variational inference to provide posterior summaries which allow researchers to simultaneously perform (i) mapping with SNPs and (ii) enrichment analyses with SNP-sets on complex traits. Through simulations, we show that our method improves upon state-of-the-art association mapping and enrichment approaches across a wide range of genetic architectures. We then further illustrate the benefits of BANNs by analyzing real GWA data assayed in approximately 2,000 heterogenous stock of mice from the Wellcome Trust Centre for Human Genetics and approximately 7,000 individuals from the Framingham Heart Study. Lastly, using a random subset of individuals of European ancestry from the UK Biobank, we show that BANNs is able to replicate known associations in high and low-density lipoprotein cholesterol content.

13 Keywords

Animals
Genome
Genome-Wide Association Study
Genomics
Genotype
Humans
Models, Genetic
Molecular Sequence Annotation
Multifactorial Inheritance
Neural Networks, Computer
Phenotype
Polymorphism, Single Nucleotide
Software

6 Authors

Pinar Demetci
Wei Cheng
Gregory Darnell
Xiang Zhou
Sohini Ramachandran
Lorin Crawford

1 Application

Application ID	Title
22419	Identifying gene networks underlying complex traits in the UK Biobank

Enabling scientific discoveries that improve human health