|Exome sequencing and characterization of coding variation
This paper describes the first tranche of large-scale exome sequence data for 49,960 study participants, revealing approximately 4 million coding variants (of which ~98.4% have frequency <1%). The data includes 231,631 predicted loss of function variants, a >10-fold increase compared to imputed sequence for the same participants. Nearly all genes (>97%) had ≥1 predicted loss of function carrier, and most genes (>69%) had ≥0 loss of function carriers. We illustrate the power of characterizing loss of function variation in this large population through association analyses across 1,741 phenotypes. In addition to replicating a range of established associations, it describes novel loss of function variants with large effects on disease traits, including PIEZO1 on varicose veins, COL6A1 on corneal resistance, MEPE on bone density, and IQGAP2 and GMPR on blood cell traits. It further demonstrates the value of exome sequencing by surveying the prevalence of pathogenic variants of clinical significance in this population, finding that 2% of the population has a medically actionable variant. Additionally, it uses phenotypic data to characterize the relationship between rare BRCA1 and BRCA2 pathogenic variants and cancer risk.
Click URL to view: http://www.biorxiv.org/content/10.1101/572347v1
Note: this resource material is outside the Showcase system and cannot take responsibility for its content or accuracy.