Prescriptive method for optimizing cost of data collection and annotation in machine learning of clinical ultrasound

Research output: Chapter in Book/Report/Conference proceedingConference contribution book

2 Citations (Scopus)
31 Downloads (Pure)

Abstract

Machine learning in medical ultrasound faces a major challenge: the prohibitive costs of producing and annotating clinical data. Optimizing the data collection and annotation will improve model training efficiency, reducing project cost and times. This paper prescribes a 2-phase method for cost optimization based on iterative accuracy/sample size predictions, and active learning for annotation optimization. Methods: Using public breast, fetal, and lung ultrasound datasets we can: Optimize data collection by statistically predicting accuracy for a desired dataset size; and optimize labeling efficiency using Active Learning, where predictions with lowest certainty were labelled manually using feedback. A practical case study on BUSI data was used to demonstrate the method prescribed in this work. Results: With small data subsets, ~10%, dataset size vs. final accuracy relations can be predicted with diminishing results after 50% usage. Manual annotation was reduced by ~10% using active learning to focus the annotation. Conclusion: This led to cost reductions of 50%-66%, depending on requirements and initial cost model, on BUSI dataset with a negligible accuracy drop of 3.75% from theoretical maximums.

Clinical Relevance— This work provides methodology to optimize dataset size and manual data labelling, this allows generation of cost-effective datasets, of interest to all, but particularly for financially limited trials and feasibility studies, Reducing the time burden on annotating clinicians.
Original languageEnglish
Title of host publication2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)
Place of PublicationPiscataway, N.J.
PublisherIEEE
Number of pages4
ISBN (Print)9798350324471
DOIs
Publication statusPublished - 11 Dec 2023
Event45th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2023) - International Convention Centre, Sydney, Australia
Duration: 24 Jul 202327 Jul 2023

Conference

Conference45th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2023)
Country/TerritoryAustralia
CitySydney
Period24/07/2327/07/23

Keywords

  • machine learning
  • ultrasound
  • deep learning
  • neural networks

Fingerprint

Dive into the research topics of 'Prescriptive method for optimizing cost of data collection and annotation in machine learning of clinical ultrasound'. Together they form a unique fingerprint.

Cite this