Skip to main navigation Skip to search Skip to main content

Reliability of [18F]FDG PET/CT for post-treatment surveillance of non-small cell lung cancer: agreement among multiple centers

Kasper Foged Guldbrandsen*, Markus Nowak Lonsdale, Hanne Marie Nellemann, Catharina Mølgaard Bylov, Joan Fledelius, Karin Hjorthaug, Barbara Jolanta Jørgensen, Martin Krakauer, Mette Schødt, Peter Michael Gørtz, Mie Kiszka Nielsen, Anne Lerberg Nielsen, Annemarie Gjelstrup Amtoft, Elisabeth Albrecht-Beste, Danijela Dejanovic, Marie Josée Zareh Lausten-Thomsen, Paw Christian Holdgaard, Magdalene Kubik, Søren Steen Nielsen, Oke GerkeTorben Riis Rasmussen, Barbara Malene Fischer

*Corresponding author for this work

Abstract

PURPOSE: Fluorine-18 fluorodeoxyglucose positron emission tomography/computed tomography ([18F]FDG PET/CT) has shown promise for post-treatment surveillance in patients with non-small cell lung cancer (NSCLC). This study evaluated interobserver agreement of PET/CT interpretation for NSCLC surveillance in a multicenter setting.

METHODS: Nine teams from seven centers, each team consisting of a nuclear medicine specialist and a radiologist, participated in the study. A total of 150 PET/CT scans were selected, and each was independently reviewed by two randomly assigned teams. Scans were performed six months post-treatment for scheduled recurrence assessment in stage Ia-IIIc NSCLC patients. Each scan was evaluated for suspicion of recurrence using two methods; without any pre-specified criteria (conventional assessment) and using pre-specified, qualitative criteria (Hopkins criteria). Both scoring methods were compared to a reference standard to assess accuracy.

RESULTS: Conventional assessment showed moderate interobserver agreement (κ = 0.55, 95% CI 0.41-0.69; 79% overall agreement) for the diagnosis of recurrence. Hopkins criteria demonstrated substantial agreement (κ = 0.61, 95% CI 0.45-0.77; 87% overall agreement). There was no difference in the area under the curve (AUC) between conventional assessment (0.80, 95% CI 0.72-0.88) and Hopkins criteria (0.82, 95% CI 0.74-0.90) compared to the reference standard (p = 0.21).

CONCLUSIONS: Interobserver agreement for [18F]FDG PET/CT interpretation in NSCLC surveillance was moderate to substantial. While applying pre-specified reporting criteria did not significantly improve the agreement, it did not hinder the diagnostic accuracy. Efforts to reduce the variability of reporting, including continuous training and structured reporting, could improve the clinical impact of this technology.

Original languageEnglish
JournalEuropean Journal of Nuclear Medicine and Molecular Imaging
Volume53
Issue number1
Pages (from-to)275-282
Number of pages8
ISSN1619-7070
DOIs
Publication statusPublished - Dec 2025

Keywords

  • Interobserver agreement
  • Lung cancer
  • Recurrence
  • Surveillance
  • [ F]FDG PET/CT
  • Reproducibility of Results
  • Carcinoma, Non-Small-Cell Lung/diagnostic imaging
  • Humans
  • Middle Aged
  • Male
  • Positron Emission Tomography Computed Tomography/standards
  • Lung Neoplasms/diagnostic imaging
  • Aged, 80 and over
  • Fluorodeoxyglucose F18
  • Female
  • Aged
  • Radiopharmaceuticals
  • Observer Variation

Fingerprint

Dive into the research topics of 'Reliability of [18F]FDG PET/CT for post-treatment surveillance of non-small cell lung cancer: agreement among multiple centers'. Together they form a unique fingerprint.

Cite this