TY - JOUR
T1 - A knowledge graph to interpret clinical proteomics data
AU - Santos, Alberto
AU - Colaço, Ana R
AU - Nielsen, Annelaura B
AU - Niu, Lili
AU - Strauss, Maximilian
AU - Geyer, Philipp E
AU - Coscia, Fabian
AU - Albrechtsen, Nicolai J Wewer
AU - Mundt, Filip
AU - Jensen, Lars Juhl
AU - Mann, Matthias
N1 - © 2022. The Author(s).
PY - 2022/5
Y1 - 2022/5
N2 - Implementing precision medicine hinges on the integration of omics data, such as proteomics, into the clinical decision-making process, but the quantity and diversity of biomedical data, and the spread of clinically relevant knowledge across multiple biomedical databases and publications, pose a challenge to data integration. Here we present the Clinical Knowledge Graph (CKG), an open-source platform currently comprising close to 20 million nodes and 220 million relationships that represent relevant experimental data, public databases and literature. The graph structure provides a flexible data model that is easily extendable to new nodes and relationships as new databases become available. The CKG incorporates statistical and machine learning algorithms that accelerate the analysis and interpretation of typical proteomics workflows. Using a set of proof-of-concept biomarker studies, we show how the CKG might augment and enrich proteomics data and help inform clinical decision-making.
AB - Implementing precision medicine hinges on the integration of omics data, such as proteomics, into the clinical decision-making process, but the quantity and diversity of biomedical data, and the spread of clinically relevant knowledge across multiple biomedical databases and publications, pose a challenge to data integration. Here we present the Clinical Knowledge Graph (CKG), an open-source platform currently comprising close to 20 million nodes and 220 million relationships that represent relevant experimental data, public databases and literature. The graph structure provides a flexible data model that is easily extendable to new nodes and relationships as new databases become available. The CKG incorporates statistical and machine learning algorithms that accelerate the analysis and interpretation of typical proteomics workflows. Using a set of proof-of-concept biomarker studies, we show how the CKG might augment and enrich proteomics data and help inform clinical decision-making.
KW - Algorithms
KW - Decision Making, Computer-Assisted
KW - Knowledge Bases
KW - Machine Learning
KW - Pattern Recognition, Automated
KW - Precision Medicine/methods
KW - Proteomics/standards
UR - http://www.scopus.com/inward/record.url?scp=85123916062&partnerID=8YFLogxK
U2 - 10.1038/s41587-021-01145-6
DO - 10.1038/s41587-021-01145-6
M3 - Journal article
C2 - 35102292
VL - 40
SP - 692
EP - 702
JO - Nature Biotechnology
JF - Nature Biotechnology
SN - 1087-0156
IS - 5
ER -