Open-Domain Question-Answering for COVID-19 and Other Emergent Domains
- URL: http://arxiv.org/abs/2110.06962v1
- Date: Wed, 13 Oct 2021 18:06:14 GMT
- Title: Open-Domain Question-Answering for COVID-19 and Other Emergent Domains
- Authors: Sharon Levy, Kevin Mo, Wenhan Xiong, William Yang Wang
- Abstract summary: We present an open-domain question-answering system for the emergent biomedical domain of COVID-19.
Despite the small data size, we are able to successfully train the system to retrieve answers from a large-scale corpus of published COVID-19 scientific papers.
- Score: 61.615197623034085
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Since late 2019, COVID-19 has quickly emerged as the newest biomedical
domain, resulting in a surge of new information. As with other emergent
domains, the discussion surrounding the topic has been rapidly changing,
leading to the spread of misinformation. This has created the need for a public
space for users to ask questions and receive credible, scientific answers. To
fulfill this need, we turn to the task of open-domain question-answering, which
we can use to efficiently find answers to free-text questions from a large set
of documents. In this work, we present such a system for the emergent domain of
COVID-19. Despite the small data size available, we are able to successfully
train the system to retrieve answers from a large-scale corpus of published
COVID-19 scientific papers. Furthermore, we incorporate effective re-ranking
and question-answering techniques, such as document diversity and multiple
answer spans. Our open-domain question-answering system can further act as a
model for the quick development of similar systems that can be adapted and
modified for other developing emergent domains.
Related papers
- Augmenting Scientific Creativity with Retrieval across Knowledge Domains [31.735564282883352]
We develop an exploratory search system in which end-users can select a portion of text core to their interest from a paper abstract.
Our case studies with scientists uncover opportunities and design implications for systems aimed at facilitating cross-domain exploration and inspiration.
arXiv Detail & Related papers (2022-06-02T22:55:51Z) - Addressing Issues of Cross-Linguality in Open-Retrieval Question
Answering Systems For Emergent Domains [67.99403521976058]
We demonstrate a cross-lingual open-retrieval question answering system for the emergent domain of COVID-19.
Our system adopts a corpus of scientific articles to ensure that retrieved documents are reliable.
We show that a deep semantic retriever greatly benefits from training on our English-to-all data and significantly outperforms a BM25 baseline in the cross-lingual setting.
arXiv Detail & Related papers (2022-01-26T19:27:32Z) - Recent Advances in Automated Question Answering In Biomedical Domain [0.06922389632860546]
In the past few decades there has been a proliferation of acquisition of knowledge and consequently there has been an exponential growth in new scientific articles in the field of biomedicine.
It has become difficult to keep track of all the information in the domain, even for domain experts.
With the improvements in commercial search engines, users can type in their queries and get a small set of documents most relevant for answering their query.
This has necessitated the development of efficient QA systems which aim to find exact and precise answers to user provided natural language questions.
arXiv Detail & Related papers (2021-11-10T20:51:29Z) - End-to-End QA on COVID-19: Domain Adaptation with Synthetic Training [13.731352294133211]
End-to-end question answering requires both information retrieval and machine reading comprehension.
Recent work has successfully trained neural IR systems using only supervised question answering (QA) examples from open-domain datasets.
We combine our neural IR and MRC systems and show significant improvements in end-to-end QA on the CORD-19 collection.
arXiv Detail & Related papers (2020-12-02T18:59:59Z) - CMT in TREC-COVID Round 2: Mitigating the Generalization Gaps from Web
to Special Domain Search [89.48123965553098]
This paper presents a search system to alleviate the special domain adaption problem.
The system utilizes the domain-adaptive pretraining and few-shot learning technologies to help neural rankers mitigate the domain discrepancy.
Our system performs the best among the non-manual runs in Round 2 of the TREC-COVID task.
arXiv Detail & Related papers (2020-11-03T09:10:48Z) - Knowledge-Aided Open-Domain Question Answering [58.712857964048446]
We propose a knowledge-aided open-domain QA (KAQA) method which targets at improving relevant document retrieval and answer reranking.
During document retrieval, a candidate document is scored by considering its relationship to the question and other documents.
During answer reranking, a candidate answer is reranked using not only its own context but also the clues from other documents.
arXiv Detail & Related papers (2020-06-09T13:28:57Z) - Knowledge Graph Simple Question Answering for Unseen Domains [9.263766921991452]
We propose a data-centric domain adaptation framework that is applicable to new domains.
We use distant supervision to extract a set of keywords that express each relation of the unseen domain.
Our framework significantly improves over zero-shot baselines and is robust across domains.
arXiv Detail & Related papers (2020-05-25T11:34:54Z) - Dense Passage Retrieval for Open-Domain Question Answering [49.028342823838486]
We show that retrieval can be practically implemented using dense representations alone.
Our dense retriever outperforms a strong Lucene-BM25 system largely by 9%-19% absolute in terms of top-20 passage retrieval accuracy.
arXiv Detail & Related papers (2020-04-10T04:53:17Z) - Mind the Gap: Enlarging the Domain Gap in Open Set Domain Adaptation [65.38975706997088]
Open set domain adaptation (OSDA) assumes the presence of unknown classes in the target domain.
We show that existing state-of-the-art methods suffer a considerable performance drop in the presence of larger domain gaps.
We propose a novel framework to specifically address the larger domain gaps.
arXiv Detail & Related papers (2020-03-08T14:20:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.