Author Block: C. JEON, J. PARK; Seoul/KR
Purpose: Automated fracture detection via deep learning-based object detection can enhance diagnostic efficiency from radiographs. While one-stage models offer real-time processing, two-stage models provide higher precision. This study compares YOLOv8, RetinaNet, and Faster R-CNN on the FracAtlas X-ray dataset to determine the model offering the optimal trade-off between accuracy and practical applicability for clinical use.
Methods or Background: Object detection models were trained on the FracAtlas dataset (4,083 radiographs, 922 annotated fractures) using Python 3.9.0 with Detectron2 and Ultralytics frameworks. Model performance was assessed via confusion matrix-based indicators (Precision, Recall, F1-score), mean Average Precision at IoU 0.50 (
[email protected]), and inference speed (frames per second, FPS).
Results or Findings: Faster R-CNN achieved the highest diagnostic accuracy with
[email protected] = 0.82, Precision = 0.92, Recall = 0.75, and F1-score = 0.82, but had a limited inference speed of 5 FPS. YOLOv8 demonstrated real-time performance with 45 FPS but lower accuracy (
[email protected] = 0.62, Recall = 0.57), particularly struggling with subtle fracture detection. RetinaNet produced intermediate results, yielding
[email protected] = 0.67 and 10 FPS. The superior feature extraction of Faster R-CNN underscores the clinical benefit of two-stage approaches when diagnostic accuracy is critical.
Conclusion: Faster R-CNN delivers the highest fracture detection accuracy, making it well-suited for clinical use, though its low FPS restricts real-time deployment. One-stage models excel in speed but fall short for complex diagnostic demands. Future research should explore lightweight optimisation of Faster R-CNN and larger datasets to enhance generalizability. Two-stage models remain valuable for high-precision medical image analysis.
Limitations: This study was limited to a single FracAtlas X-ray image dataset and an imbalanced fractured-to-non-fractured ratio (1:4.6), which may affect model performance.
Funding for this study: No funding was received for this study.
Has your study been approved by an ethics committee? Not applicable
Ethics committee - additional information: