Abstract
Space agencies execute complex satellite operations that need to be supported by the technical knowledge contained in their extensive information systems. Knowledge bases (KB) are an effective way of storing and accessing such information at scale. In this work we present a system, developed for the Euro-pean Space Agency (ESA), that can answer complex natural language queries, to support engineers in accessing the information contained in a KB that models the orbital space debris environment. Our system is based on
a pipeline which first generates a sequence of basic database operations, called a sketch, from a natural language question, then specializes the sketch into a concrete query program with mentions of entities, attributes and relations, and finally executes the program against the database. This pipeline decomposition approach enables us to train the system by lever-aging out-of-domain data and semi-synthetic data generated by GPT-3, thus reducing over-fitting and shortcut learning even with limited amount of in-domain training data. Our code can be found at https://github.com/PaulDrm/DISCOSQA.
a pipeline which first generates a sequence of basic database operations, called a sketch, from a natural language question, then specializes the sketch into a concrete query program with mentions of entities, attributes and relations, and finally executes the program against the database. This pipeline decomposition approach enables us to train the system by lever-aging out-of-domain data and semi-synthetic data generated by GPT-3, thus reducing over-fitting and shortcut learning even with limited amount of in-domain training data. Our code can be found at https://github.com/PaulDrm/DISCOSQA.
Original language | English |
---|---|
Title of host publication | Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics |
Subtitle of host publication | Volume 5: Industry Track |
Place of Publication | Stroudsburg, PA |
Pages | 487-499 |
Number of pages | 13 |
ISBN (Electronic) | 9781959429685 |
Publication status | Published - 31 Jul 2023 |
Event | The 61st Annual Meeting of the Association for Computational Linguistics - Toronto Duration: 9 Jul 2023 → 15 Jul 2023 |
Conference
Conference | The 61st Annual Meeting of the Association for Computational Linguistics |
---|---|
City | Toronto |
Period | 9/07/23 → 15/07/23 |
Keywords
- space debris
- knowledge base
- complex natural language queries