: Publication 3976

Publication 3976

Title:	Accurate, scalable and integrative haplotype estimation
Journal:	Nature Communications
Published:	28 Nov 2019
Pubmed:	https://pubmed.ncbi.nlm.nih.gov/31780650/
DOI:	https://doi.org/10.1038/s41467-019-13225-y
URL:	https://www.nature.com/articles/s41467-019-13225-y.pdf
Citations:	431 (218 in last 2 years) as of 8 Aug 2024

WARNING: the interactive features of this website use CSS3, which your browser does not support. To use the full features of this website, please update your browser.

Abstract

The number of human genomes being genotyped or sequenced increases exponentially and efficient haplotype estimation methods able to handle this amount of data are now required. Here we present a method, SHAPEIT4, which substantially improves upon other methods to process large genotype and high coverage sequencing datasets. It notably exhibits sub-linear running times with sample size, provides highly accurate haplotypes and allows integrating external phasing information such as large reference panels of haplotypes, collections of pre-phased variants and long sequencing reads. We provide SHAPEIT4 in an open source format and demonstrate its performance in terms of accuracy and running times on two gold standard datasets: the UK Biobank data and the Genome In A Bottle.</p>

11 Keywords

Biological Specimen Banks
Data Interpretation, Statistical
Datasets as Topic
Genotype
Haplotypes
High-Throughput Nucleotide Sequencing
Humans
Polymorphism, Single Nucleotide
Sample Size
Sequence Analysis, DNA
Software

5 Authors

Olivier Delaneau
Jean-François Zagury
Matthew R. Robinson
Jonathan L. Marchini
Emmanouil T. Dermitzakis

1 Application

Application ID	Title
35520	Improving estimation and prediction of common complex disease risk

Enabling scientific discoveries that improve human health