Segmentation variability impacts the robustness of Magnetic Resonance Imaging (MRI) and Positron Emission Tomography (PET) radiomic features.
Only a small number (19%) of radiomic features were classified as robust.
Grey-level texture features are more robust than first-order or shape features.
Prostate-Specific Membrane Antigen (PSMA)-PET has more radiomic features with ‘excellent’ robustness than T2 or Apparent Diffusion Coefficient (ADC) MRI.
A random forest model using manual prostate segmentation-based radiomic features best predicted biochemical recurrence (BCR).
Radiomic features from MRI and PET are an emerging tool with potential to improve prostate cancer outcomes. However, feature robustness due to image segmentation variations is currently unknown. Therefore, this study aimed to evaluate the robustness of radiomic features with segmentation variations and their impact on predicting biochemical recurrence (BCR).
Multi-scanner, pre-radiation therapy imaging from 142 patients with localised prostate cancer was used. Imaging included T2-weighted (T2), apparent diffusion coefficient (ADC) MRI, and prostate-specific membrane antigen (PSMA)-PET. The prostate gland and intraprostatic tumours were manually and automatically segmented, and differences were quantified using Dice Coefficient (DC). Radiomic features including shape, first-order, and texture features were extracted for each segmentation from original and filtered images. Intraclass Correlation Coefficient (ICC) and Mean Absolute Percentage Difference (MAPD) were used to assess feature robustness. Random forest (RF) models were developed for each segmentation using robust features to predict BCR.
Prostate gland segmentations were more consistent (mean DC = 0.78) than tumour segmentations (mean DC = 0.46). 112 (3.6 %) radiomic features demonstrated ‘excellent’ robustness (ICC > 0.9 and MAPD < 1 %), and 480 features (15.4 %) demonstrated ‘good’ robustness (ICC > 0.75 and MAPD < 5 %). PET imaging provided more features with excellent robustness than T2 and ADC. RF models showed strong predictive power for BCR with a mean area under the receiver-operator-characteristics curve (AUC) of 0.89 (range 0.85–0.93).