Using Bottleneck Adapters to Identify Cancer in Clinical Notes under Low-Resource Constraints

ISARIC Clinical Characterisation Group, Omid Rohanian, Hannah Jauncey, Mohammadmahdi Nouriborji, Vinod Kumar Chauhan, Bronner P. Gonçalves, Christiana Kartsonaki, Laura Merson, David Clifton

Research output: Chapter in Book/Report/Conference proceedingConference contribution book

Abstract

Processing information locked within clinical health records is a challenging task that remains an active area of research in biomedical NLP. In this work, we evaluate a broad set of machine learning techniques ranging from simple RNNs to specialised transformers such as BioBERT on a dataset containing clinical notes along with a set of annotations indicating whether a sample is cancer-related or not. Furthermore, we specifically employ efficient fine-tuning methods from NLP, namely, bottleneck adapters and prompt tuning, to adapt the models to our specialised task. Our evaluations suggest that fine-tuning a frozen BERT model pre-trained on natural language and with bottleneck adapters outperforms all other strategies, including full fine-tuning of the specialised BioBERT model. Based on our findings, we suggest that using bottleneck adapters in low-resource situations with limited access to labelled data or processing capacity could be a viable strategy in biomedical text mining. The code used in the experiments are going to be made available at [LINK ANONYMIZED].

Original languageEnglish
Title of host publicationBioNLP 2023 - BioNLP and BioNLP-ST, Proceedings of the Workshop
EditorsDina Demner-fushman, Sophia Ananiadou, Kevin Cohen
Pages62-78
Number of pages17
ISBN (Electronic)9781959429852
DOIs
Publication statusPublished - 31 Jul 2023
Externally publishedYes
Event22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, BioNLP 2023 - Toronto, Canada
Duration: 13 Jul 2023 → …

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN (Print)0736-587X

Conference

Conference22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, BioNLP 2023
Country/TerritoryCanada
CityToronto
Period13/07/23 → …

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Computational linguistics
  • Natural language processing systems
  • Clinical notes

Fingerprint

Dive into the research topics of 'Using Bottleneck Adapters to Identify Cancer in Clinical Notes under Low-Resource Constraints'. Together they form a unique fingerprint.

Cite this