Cue: a deep-learning framework for structural variant discovery and genotyping. Academic Article uri icon

Overview

abstract

  • Structural variants (SVs) are a major driver of genetic diversity and disease in the human genome and their discovery is imperative to advances in precision medicine. Existing SV callers rely on hand-engineered features and heuristics to model SVs, which cannot scale to the vast diversity of SVs nor fully harness the information available in sequencing datasets. Here we propose an extensible deep-learning framework, Cue, to call and genotype SVs that can learn complex SV abstractions directly from the data. At a high level, Cue converts alignments to images that encode SV-informative signals and uses a stacked hourglass convolutional neural network to predict the type, genotype and genomic locus of the SVs captured in each image. We show that Cue outperforms the state of the art in the detection of several classes of SVs on synthetic and real short-read data and that it can be easily extended to other sequencing platforms, while achieving competitive performance.

publication date

  • March 23, 2023

Research

keywords

  • Deep Learning
  • Software

Identity

PubMed Central ID

  • PMC10152467

Scopus Document Identifier

  • 85150653969

Digital Object Identifier (DOI)

  • 10.24433/CO.8949236.v2

PubMed ID

  • 36959322

Additional Document Info

volume

  • 20

issue

  • 4