Abstract
Even modest improvements in the probability of success of selecting drug targets which are ultimately approved can substantially reduce the costs of research and development. Drug targets with human genetic evidence of disease association are twice as likely to lead to approved drugs. A key enabler of identifying and validating these genetically validated targets is access to association results from genome-wide genotyping, whole-exome sequencing, and whole-genome sequencing studies with observable traits (often diseases) across large numbers of individuals. Today, linkage between genotype and real-world data (RWD) provides significant opportunities to not only increase the statistical power of genome-wide association studies by ascertaining additional cases for diseases of interest, but also to improve diversity and coverage of association studies across the disease phenome. As RWD-genetics linked resources continue to grow in diversity of participants, breadth of data captured, length of observation, and number of participants, there is a greater need to leverage the experience of RWD experts, clinicians, and highly experienced geneticists together to understand which lessons and frameworks from general research using RWD sources are relevant to improve genetics-driven drug discovery and development. This paper describes new challenges and opportunities for phenotypes enabled by diverse RWD sources, considerations in the use of RWD phenotypes for disease gene identification across the disease phenome, and challenges and opportunities in leveraging RWD phenotypes in target validation. The paper concludes with views on the future directions for phenotype development using RWD, and key questions requiring further research and development to advance this nascent field.</p>