Combining astrophysical datasets with CRUMB

Fiona A. M. Porter, Anna M. M. Scaife

Research output: Contribution to conferencePaperpeer-review

Abstract

At present, the field of astronomical machine learning lacks widely-used benchmarking datasets; most research employs custom-made datasets which are often not publicly released, making comparisons between models difficult. In this paper we present CRUMB, a publicly-available image dataset of Fanaroff-Riley galaxies constructed from four "parent" datasets extant in the literature. In addition to providing the largest image dataset of these galaxies, CRUMB uses a two-tier labelling system: a "basic" label for classification and a "complete" label which provides the original class labels used in the four parent datasets, allowing for disagreements in an image's class between different datasets to be preserved and selective access to sources from any desired combination of the parent datasets.
Original languageEnglish
Number of pages6
Publication statusPublished - 17 Nov 2023
Externally publishedYes
EventAdvances in Neural Information Processing Systems - New Orleans, United States
Duration: 10 Dec 202316 Dec 2023

Conference

ConferenceAdvances in Neural Information Processing Systems
Abbreviated titleNIPS
Country/TerritoryUnited States
CityNew Orleans
Period10/12/2316/12/23

Keywords

  • astro-ph.IM
  • astro-ph.GA

Fingerprint

Dive into the research topics of 'Combining astrophysical datasets with CRUMB'. Together they form a unique fingerprint.

Cite this