Knowledge Base Question Answering for Space Debris Queries
- URL: http://arxiv.org/abs/2305.19734v1
- Date: Wed, 31 May 2023 10:55:41 GMT
- Title: Knowledge Base Question Answering for Space Debris Queries
- Authors: Paul Darm, Antonio Valerio Miceli-Barone, Shay B. Cohen, Annalisa
Riccardi
- Abstract summary: We present a system, developed for the European Space Agency (ESA), that can answer complex natural language queries.
Our system is based on a pipeline which first generates a sequence of basic database operations, called a %program sketch, from a natural language question.
This pipeline decomposition approach enables us to train the system by leveraging out-of-domain data and semi-synthetic data generated by GPT-3.
- Score: 24.37269129187282
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Space agencies execute complex satellite operations that need to be supported
by the technical knowledge contained in their extensive information systems.
Knowledge bases (KB) are an effective way of storing and accessing such
information at scale. In this work we present a system, developed for the
European Space Agency (ESA), that can answer complex natural language queries,
to support engineers in accessing the information contained in a KB that models
the orbital space debris environment. Our system is based on a pipeline which
first generates a sequence of basic database operations, called a %program
sketch, from a natural language question, then specializes the sketch into a
concrete query program with mentions of entities, attributes and relations, and
finally executes the program against the database. This pipeline decomposition
approach enables us to train the system by leveraging out-of-domain data and
semi-synthetic data generated by GPT-3, thus reducing overfitting and shortcut
learning even with limited amount of in-domain training data. Our code can be
found at \url{https://github.com/PaulDrm/DISCOSQA}.
Related papers
- Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey [59.3507264893654]
Issue resolution is a complex Software Engineering task integral to real-world development.<n> benchmarks like SWE-bench revealed this task as profoundly difficult for large language models.<n>This paper presents a systematic survey of this emerging domain.
arXiv Detail & Related papers (2026-01-15T18:55:03Z) - CEDAR: Context Engineering for Agentic Data Science [3.1662160826016756]
CEDAR is an application for automating data science tasks with an agentic setup.<n>We show that these can be alleviated via effective context engineering.<n>Fault tolerance and context management are introduced via iterative code generation and smart history rendering.
arXiv Detail & Related papers (2026-01-10T16:05:04Z) - Ground-Compose-Reinforce: Grounding Language in Agentic Behaviours using Limited Data [17.614395125762304]
Grounding language in perception and action is a key challenge when building situated agents that can interact with humans, or other agents, via language.<n>We propose Ground-Compose-Reinforce, an end-to-end, neurosymbolic framework for training RL agents directly from high-level task specifications.
arXiv Detail & Related papers (2025-07-14T19:05:15Z) - Spatial Knowledge Graph-Guided Multimodal Synthesis [78.11669780958657]
We introduce a novel multimodal synthesis approach guided by spatial knowledge graphs, grounded in the concept of knowledge-to-data generation.<n>In experiments, data synthesized from diverse types of spatial knowledge, including direction and distance, enhance the spatial perception and reasoning abilities of MLLMs markedly.<n>We hope that the idea of knowledge-based data synthesis can advance the development of spatial intelligence.
arXiv Detail & Related papers (2025-05-28T17:50:21Z) - AI Agents for Ground-Based Gamma Astronomy [0.0]
We present two prototypes that integrate with the Cherenkov Telescope Array Observatory pipelines for operations and offline data analysis.
These AI agents offer a transformative approach to system management and data analysis by automating complex tasks and providing intelligent assistance.
arXiv Detail & Related papers (2025-03-02T09:55:54Z) - Boost, Disentangle, and Customize: A Robust System2-to-System1 Pipeline for Code Generation [58.799397354312596]
Large language models (LLMs) have demonstrated remarkable capabilities in various domains, particularly in system 1 tasks.
Recent research on System2-to-System1 methods surge, exploring the System 2 reasoning knowledge via inference-time computation.
In this paper, we focus on code generation, which is a representative System 2 task, and identify two primary challenges.
arXiv Detail & Related papers (2025-02-18T03:20:50Z) - The S2 Hierarchical Discrete Global Grid as a Nexus for Data Representation, Integration, and Querying Across Geospatial Knowledge Graphs [4.358099505067763]
This paper outlines the implementation of Google's S2 Geometry within KnowWhereGraph.
Ultimately, this work demonstrates the potential of DGGS frameworks, particularly S2, for building scalable GeoKGs.
arXiv Detail & Related papers (2024-10-18T18:30:05Z) - Developing PUGG for Polish: A Modern Approach to KBQA, MRC, and IR Dataset Construction [43.045596895389345]
We introduce a modern, semi-automated approach for creating datasets, encompassing tasks such as KBQA, Machine Reading (MRC), and Information Retrieval (IR)
We provide a comprehensive implementation, insightful findings, detailed statistics, and evaluation of baseline models.
arXiv Detail & Related papers (2024-08-05T09:23:49Z) - Relational Database Augmented Large Language Model [59.38841050766026]
Large language models (LLMs) excel in many natural language processing (NLP) tasks.
They can only incorporate new knowledge through training or supervised fine-tuning processes.
This precise, up-to-date, and private information is typically stored in relational databases.
arXiv Detail & Related papers (2024-07-21T06:19:10Z) - How to Understand Whole Software Repository? [64.19431011897515]
An excellent understanding of the whole repository will be the critical path to Automatic Software Engineering (ASE)
We develop a novel method named RepoUnderstander by guiding agents to comprehensively understand the whole repositories.
To better utilize the repository-level knowledge, we guide the agents to summarize, analyze, and plan.
arXiv Detail & Related papers (2024-06-03T15:20:06Z) - DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain
Question Answering over Knowledge Base and Text [73.68051228972024]
Large Language Models (LLMs) have exhibited impressive generation capabilities, but they suffer from hallucinations when relying on their internal knowledge.
Retrieval-augmented LLMs have emerged as a potential solution to ground LLMs in external knowledge.
arXiv Detail & Related papers (2023-10-31T04:37:57Z) - A Simple Baseline for Knowledge-Based Visual Question Answering [78.00758742784532]
This paper is on the problem of Knowledge-Based Visual Question Answering (KB-VQA)
Our main contribution in this paper is to propose a much simpler and readily reproducible pipeline.
Contrary to recent approaches, our method is training-free, does not require access to external databases or APIs, and achieves state-of-the-art accuracy on the OK-VQA and A-OK-VQA datasets.
arXiv Detail & Related papers (2023-10-20T15:08:17Z) - Semantic Web Enabled Geographic Question Answering Framework: GeoTR [0.0]
In this study, a question answering framework that converts Turkish natural language input into SPARQL queries in the geographical domain is proposed.
A novel Turkish ontology, which covers a 10th grade geography lesson named Spatial Synthesis Turkey, has been developed to be used as a linked data provider.
A hybrid system architecture that combines natural language processing techniques with linked data technologies to generate answers is also proposed.
arXiv Detail & Related papers (2023-01-11T23:20:43Z) - SpaceQA: Answering Questions about the Design of Space Missions and
Space Craft Concepts [57.012600276711005]
We present SpaceQA, to the best of our knowledge the first open-domain QA system in Space mission design.
SpaceQA is part of an initiative by the European Space Agency (ESA) to facilitate the access, sharing and reuse of information about Space mission design.
arXiv Detail & Related papers (2022-10-07T09:41:39Z) - Design and Development of Rule-based open-domain Question-Answering
System on SQuAD v2.0 Dataset [0.0]
We have proposed a rule-based open-domain question-answering system which is capable of answering questions of any domain from a corresponding context passage.
We have used 1000 questions from SQuAD 2.0 dataset for testing the developed system and it gives satisfactory results.
arXiv Detail & Related papers (2022-03-27T07:51:18Z) - Data-to-Value: An Evaluation-First Methodology for Natural Language
Projects [3.9378507882929554]
"Data to Value" (D2V) is a new methodology for big data text analytics projects.
It is guided by a detailed catalog of questions in order to avoid a disconnect between big data text analytics project team and the topic.
arXiv Detail & Related papers (2022-01-19T17:04:52Z) - Differentiable Reasoning over a Virtual Knowledge Base [156.94984221342716]
We consider the task of answering complex multi-hop questions using a corpus as a virtual knowledge base (KB)
In particular, we describe a neural module, DrKIT, that traverses textual data like a KB, softly following paths of relations between mentions of entities in the corpus.
DrKIT is very efficient, processing 10-100x more queries per second than existing multi-hop systems.
arXiv Detail & Related papers (2020-02-25T03:13:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.