Abstract
Machine learning in medical ultrasound faces a major challenge: the prohibitive costs of producing and annotating clinical data. Optimizing the data collection and annotation will improve model training efficiency, reducing project cost and times. This paper prescribes a 2-phase method for cost optimization based on iterative accuracy/sample size predictions, and active learning for annotation optimization. Methods: Using public breast, fetal, and lung ultrasound datasets we can: Optimize data collection by statistically predicting accuracy for a desired dataset size; and optimize labeling efficiency using Active Learning, where predictions with lowest certainty were labelled manually using feedback. A practical case study on BUSI data was used to demonstrate the method prescribed in this work. Results: With small data subsets, ~10%, dataset size vs. final accuracy relations can be predicted with diminishing results after 50% usage. Manual annotation was reduced by ~10% using active learning to focus the annotation. Conclusion: This led to cost reductions of 50%-66%, depending on requirements and initial cost model, on BUSI dataset with a negligible accuracy drop of 3.75% from theoretical maximums.
Clinical Relevance— This work provides methodology to optimize dataset size and manual data labelling, this allows generation of cost-effective datasets, of interest to all, but particularly for financially limited trials and feasibility studies, Reducing the time burden on annotating clinicians.
Clinical Relevance— This work provides methodology to optimize dataset size and manual data labelling, this allows generation of cost-effective datasets, of interest to all, but particularly for financially limited trials and feasibility studies, Reducing the time burden on annotating clinicians.
Original language | English |
---|---|
Title of host publication | 2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC) |
Place of Publication | Piscataway, N.J. |
Publisher | IEEE |
Number of pages | 4 |
ISBN (Print) | 9798350324471 |
DOIs | |
Publication status | Published - 11 Dec 2023 |
Event | 45th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2023) - International Convention Centre, Sydney, Australia Duration: 24 Jul 2023 → 27 Jul 2023 |
Conference
Conference | 45th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2023) |
---|---|
Country/Territory | Australia |
City | Sydney |
Period | 24/07/23 → 27/07/23 |
Keywords
- machine learning
- ultrasound
- deep learning
- neural networks