Advancing total tumour volume estimation in colorectal liver metastases: development and evaluation of a self-learning auto-segmentation model
Inez Margaretha Verpalen, Amsterdam / Netherlands
Author Block: J. I. Bereska1, M. Zeeuw1, L. Wagenaar1, M. G. Besselink1, H. Marquering1, J. Stoker1, Å. Fretland2, G. Kazemier1, I. M. Verpalen1; 1Amsterdam/NL, 2Oslo/NOPurpose: Total tumour volume (TTV) assessments have been shown to be prognostic of overall and recurrence-free survival in patients with colorectal cancer liver metastases (CRLM). However, the labor-intensive nature of these assessments has hampered their clinical adoption. This study aimed to develop an auto-segmentation model for CRLM on contrast-enhanced portal venous phase CT scans to facilitate the clinical adoption of TTV assessments.Methods or Background: We developed a self-learning-based segmentation model to segment CRLM using 760 portal venous phase CTs (CT-PVP) of 363 patients with 13,739 CRLM from the Amsterdam University Medical Centre. We used a self-learning setup in which we first trained a teacher model on 99 manually segmented CT-PVPs segmented by three radiologists and combined using the Simultaneous Truth and Performance Level Estimation (STAPLE) algorithm. The teacher model was then used to segment the remaining 661 CT-PVPs for training the student model. We used Intraclass Correlation Coefficient (ICC) to compare the TTV obtained from the student model's segmentations against that obtained from the STAPLE-combined radiologist's segmentations.Results or Findings: We evaluated the student model in an external test set of 50 CT-PVPs from 35 patients with 72 CRLM from the Oslo University hospital. The student model reached a DICE similarity score of - 83 for segmenting CRLM. There was no significant difference between the student model's DICE scores and interrater DICE scores. The ICC between the student model's and the STAPLE-combined TTV was 0.97, signifying near perfect agreement.
Conclusion: Segmentation models can provide accurate and efficient assessments of TTV in CRLM patients.Limitations: Our study's limitations include its retrospective design, lack of global data, and an external test cohort that differs from the training set, underlining the need for prospective, internationally diverse studies for more robust validation.Funding for this study: This study was funded by the KWF (project number 14002).Has your study been approved by an ethics committee? YesEthics committee - additional information: The Medical Ethics Review Committee of the Amsterdam UMC, the Regional Ethical Committee of South Eastern Norway, and the Data Protection Officer of Oslo University Hospital approved this study protocol.