Abstract
Processing information locked within clinical health records is a challenging task that remains an active area of research in biomedical NLP. In this work, we evaluate a broad set of machine learning techniques ranging from simple RNNs to specialised transformers such as BioBERT on a dataset containing clinical notes along with a set of annotations indicating whether a sample is cancer-related or not. Furthermore, we specifically employ efficient fine-tuning methods from NLP, namely, bottleneck adapters and prompt tuning, to adapt the models to our specialised task. Our evaluations suggest that fine-tuning a frozen BERT model pre-trained on natural language and with bottleneck adapters outperforms all other strategies, including full fine-tuning of the specialised BioBERT model. Based on our findings, we suggest that using bottleneck adapters in low-resource situations with limited access to labelled data or processing capacity could be a viable strategy in biomedical text mining. The code used in the experiments are going to be made available at [LINK ANONYMIZED].
| Original language | English |
|---|---|
| Title of host publication | BioNLP 2023 - BioNLP and BioNLP-ST, Proceedings of the Workshop |
| Editors | Dina Demner-fushman, Sophia Ananiadou, Kevin Cohen |
| Pages | 62-78 |
| Number of pages | 17 |
| ISBN (Electronic) | 9781959429852 |
| DOIs | |
| Publication status | Published - 31 Jul 2023 |
| Externally published | Yes |
| Event | 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, BioNLP 2023 - Toronto, Canada Duration: 13 Jul 2023 → … |
Publication series
| Name | Proceedings of the Annual Meeting of the Association for Computational Linguistics |
|---|---|
| ISSN (Print) | 0736-587X |
Conference
| Conference | 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, BioNLP 2023 |
|---|---|
| Country/Territory | Canada |
| City | Toronto |
| Period | 13/07/23 → … |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Computational linguistics
- Natural language processing systems
- Clinical notes
Fingerprint
Dive into the research topics of 'Using Bottleneck Adapters to Identify Cancer in Clinical Notes under Low-Resource Constraints'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver