Machine learning for literature classification during systematic literature review – establishing the minimum threshold for labelling papers

Vivek Venugopal, Aylin Ates, Peter McKiernan

Research output: Contribution to conferencePaperpeer-review

121 Downloads (Pure)

Abstract

Taking inspiration from the use of machine learning in the field of medicine for
literature classification, this paper explores the use of machine learning to aid the
classification of documents during systematic literature reviews in the field of
business and management studies. The performances of two machine learning
models, SVM and Logistic regression, are compared. The dataset used is a labelled dataset on weak signal literature. The data is iteratively split into training and testing sets with the aim of minimising the training set. The models were evaluated on Sensitivity (Recall), Precision, Specificity, Accuracy, and f1_Score to find the optimal training split. The optimal value was found to be between 40% to 50%. Which meant only 40% to 50% of the dataset needed to be labelled for the machine learning model to predict the labels for the rest of the dataset. Even though machine learning will not eliminate the labour involved in systematic literature reviews, it will save the amount of labour involved and the amount of time required.
Original languageEnglish
Number of pages18
Publication statusPublished - 2 Sept 2022
Event36th Annual Conference of the British Academy of Management: The British Academy of Management Conference 2022 - Alliance Manchester Business School, Manchester, United Kingdom
Duration: 31 Aug 20222 Sept 2022
https://www.bam.ac.uk/events-landing/conference.html
https://www.bam.ac.uk/events-landing/past-conferences/2022-conference.html

Conference

Conference36th Annual Conference of the British Academy of Management
Abbreviated titleBAM 2022
Country/TerritoryUnited Kingdom
CityManchester
Period31/08/222/09/22
Internet address

Keywords

  • support vector machine (SVM)
  • machine learning (ML)
  • systematic literature review
  • literature classification

Fingerprint

Dive into the research topics of 'Machine learning for literature classification during systematic literature review – establishing the minimum threshold for labelling papers'. Together they form a unique fingerprint.

Cite this