Assembly and diploid architecture of an individual human genome via single-molecule technologies. Academic Article uri icon

Overview

abstract

  • We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality.

publication date

  • June 29, 2015

Research

keywords

  • Computational Biology
  • Genome, Human
  • High-Throughput Nucleotide Sequencing
  • Polymorphism, Single Nucleotide

Identity

PubMed Central ID

  • PMC4646949

Scopus Document Identifier

  • 84938423867

Digital Object Identifier (DOI)

  • 10.1101/008003

PubMed ID

  • 26121404

Additional Document Info

volume

  • 12

issue

  • 8