Deep integrative models for large-scale human genomics

Arnór I Sigurdsson, Ioannis Louloudis, Karina Banasik, David Westergaard, Ole Winther, Ole Lund, Sisse Rye Ostrowski, Christian Erikstrup, Ole Birger Vesterager Pedersen, Mette Nyegaard, Søren Brunak, Bjarni J Vilhjálmsson, Simon Rasmussen*, DBDS Genomic Consortium , Mona Ameri Chalmer (Member of study group), Maria Didriksen (Member of study group), Joseph Dowsett (Member of study group), Thomas Folkmann Hansen (Member of study group), Lisette Kogelman (Member of study group)

*Corresponding author for this work
5 Citations (Scopus)

Abstract

Polygenic risk scores (PRSs) are expected to play a critical role in precision medicine. Currently, PRS predictors are generally based on linear models using summary statistics, and more recently individual-level data. However, these predictors mainly capture additive relationships and are limited in data modalities they can use. We developed a deep learning framework (EIR) for PRS prediction which includes a model, genome-local-net (GLN), specifically designed for large-scale genomics data. The framework supports multi-task learning, automatic integration of other clinical and biochemical data, and model explainability. When applied to individual-level data from the UK Biobank, the GLN model demonstrated a competitive performance compared to established neural network architectures, particularly for certain traits, showcasing its potential in modeling complex genetic relationships. Furthermore, the GLN model outperformed linear PRS methods for Type 1 Diabetes, likely due to modeling non-additive genetic effects and epistasis. This was supported by our identification of widespread non-additive genetic effects and epistasis in the context of T1D. Finally, we constructed PRS models that integrated genotype, blood, urine, and anthropometric data and found that this improved performance for 93% of the 290 diseases and disorders considered. EIR is available at https://github.com/arnor-sigurdsson/EIR.

Original languageEnglish
JournalNucleic Acids Research
Volume51
Issue number12
Pages (from-to)e67
ISSN0305-1048
DOIs
Publication statusPublished - 7 Jul 2023

Keywords

  • Genetic Predisposition to Disease
  • Genome, Human
  • Genome-Wide Association Study
  • Genomics/methods
  • Genotype
  • Humans
  • Models, Genetic
  • Multifactorial Inheritance
  • Polymorphism, Single Nucleotide
  • Risk Factors

Fingerprint

Dive into the research topics of 'Deep integrative models for large-scale human genomics'. Together they form a unique fingerprint.

Cite this