Denmark's Participation in the Search Engine TREC COVID-19 Challenge:
Lessons Learned about Searching for Precise Biomedical Scientific Information
on COVID-19
- URL: http://arxiv.org/abs/2011.12684v2
- Date: Thu, 26 Nov 2020 12:42:21 GMT
- Title: Denmark's Participation in the Search Engine TREC COVID-19 Challenge:
Lessons Learned about Searching for Precise Biomedical Scientific Information
on COVID-19
- Authors: Lucas Chaves Lima, Casper Hansen, Christian Hansen, Dongsheng Wang,
Maria Maistro, Birger Larsen, Jakob Grue Simonsen and Christina Lioma
- Abstract summary: University of Copenhagen and Aalborg University participated in the 2020 TREC-COVID Challenge.
The aim of the competition was to find the best search engine strategy for retrieving precise biomedical scientific information on COVID-19.
- Score: 22.96824848167245
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This report describes the participation of two Danish universities,
University of Copenhagen and Aalborg University, in the international search
engine competition on COVID-19 (the 2020 TREC-COVID Challenge) organised by the
U.S. National Institute of Standards and Technology (NIST) and its Text
Retrieval Conference (TREC) division. The aim of the competition was to find
the best search engine strategy for retrieving precise biomedical scientific
information on COVID-19 from the largest, at that point in time, dataset of
curated scientific literature on COVID-19 -- the COVID-19 Open Research Dataset
(CORD-19). CORD-19 was the result of a call to action to the tech community by
the U.S. White House in March 2020, and was shortly thereafter posted on Kaggle
as an AI competition by the Allen Institute for AI, the Chan Zuckerberg
Initiative, Georgetown University's Center for Security and Emerging
Technology, Microsoft, and the National Library of Medicine at the US National
Institutes of Health. CORD-19 contained over 200,000 scholarly articles (of
which more than 100,000 were with full text) about COVID-19, SARS-CoV-2, and
related coronaviruses, gathered from curated biomedical sources. The TREC-COVID
challenge asked for the best way to (a) retrieve accurate and precise
scientific information, in response to some queries formulated by biomedical
experts, and (b) rank this information decreasingly by its relevance to the
query.
In this document, we describe the TREC-COVID competition setup, our
participation to it, and our resulting reflections and lessons learned about
the state-of-art technology when faced with the acute task of retrieving
precise scientific information from a rapidly growing corpus of literature, in
response to highly specialised queries, in the middle of a pandemic.
Related papers
- Medical-VLBERT: Medical Visual Language BERT for COVID-19 CT Report
Generation With Alternate Learning [70.71564065885542]
We propose to use the medical visual language BERT (Medical-VLBERT) model to identify the abnormality on the COVID-19 scans.
This model adopts an alternate learning strategy with two procedures that are knowledge pretraining and transferring.
For automatic medical report generation on the COVID-19 cases, we constructed a dataset of 368 medical findings in Chinese and 1104 chest CT scans.
arXiv Detail & Related papers (2021-08-11T07:12:57Z) - Rapid COVID-19 Risk Screening by Eye-region Manifestations [64.6260390977642]
There are more and more ocular manifestations that have been reported in the COVID-19 patients as growing clinical evidence.
We propose a new fast screening method of analyzing the eye-region images, captured by common CCD and CMOS cameras.
Our model for COVID-19 rapid prescreening have the merits of the lower cost, fully self-performed, non-invasive, importantly real-time, and thus enables the continuous health surveillance.
arXiv Detail & Related papers (2021-06-12T01:56:10Z) - COVIDScholar: An automated COVID-19 research aggregation and analysis
platform [0.0]
As of October 2020, over 81,000 COVID-19 related scientific papers have been released, at a rate of over 250 per day.
This has created a challenge to traditional methods of engagement with the research literature.
We present an analysis of trends in COVID-19 research over the course of 2020.
arXiv Detail & Related papers (2020-12-07T18:17:11Z) - Repurposing TREC-COVID Annotations to Answer the Key Questions of
CORD-19 [4.847073702809032]
coronavirus disease 2019 (COVID-19) began in Wuhan, China in late 2019 and to date has infected over 14M people worldwide.
White House aggregated over 200,000 journal articles related to a variety of coronaviruses and tasked the community with answering key questions related to the corpus.
We set out to repurpose the relevancy annotations for TREC-COVID tasks to identify journal articles in CORD-19 which are relevant to the key questions posed by CORD-19.
arXiv Detail & Related papers (2020-08-27T19:51:07Z) - Understanding the temporal evolution of COVID-19 research through
machine learning and natural language processing [66.63200823918429]
The outbreak of the novel coronavirus disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has been continuously affecting human lives and communities around the world.
We used multiple data sources, i.e., PubMed and ArXiv, and built several machine learning models to characterize the landscape of current COVID-19 research.
Our findings confirm the types of research available in PubMed and ArXiv differ significantly, with the former exhibiting greater diversity in terms of COVID-19 related issues.
arXiv Detail & Related papers (2020-07-22T18:02:39Z) - A Survey on Applications of Artificial Intelligence in Fighting Against
COVID-19 [75.84689958489724]
The COVID-19 pandemic caused by the SARS-CoV-2 virus has spread rapidly worldwide, leading to a global outbreak.
As a powerful tool against COVID-19, artificial intelligence (AI) technologies are widely used in combating this pandemic.
This survey presents medical and AI researchers with a comprehensive view of the existing and potential applications of AI technology in combating COVID-19.
arXiv Detail & Related papers (2020-07-04T22:48:15Z) - CO-Search: COVID-19 Information Retrieval with Semantic Search, Question
Answering, and Abstractive Summarization [53.67205506042232]
CO-Search is a retriever-ranker semantic search engine designed to handle complex queries over the COVID-19 literature.
To account for the domain-specific and relatively limited dataset, we generate a bipartite graph of document paragraphs and citations.
We evaluate our system on the data of the TREC-COVID information retrieval challenge.
arXiv Detail & Related papers (2020-06-17T01:32:48Z) - What country, university or research institute, performed the best on
COVID-19? Bibliometric analysis of scientific literature [12.168506735496715]
We conduct data mining to discover the countries, universities and companies, that produced the most research on Covid-19 since the pandemic started.
We present some interesting findings, but despite analysing all available records on COVID-19 from the Web of Science Core Collection, we failed to reach any significant conclusions.
arXiv Detail & Related papers (2020-05-19T14:36:12Z) - CORD-19: The COVID-19 Open Research Dataset [28.556291682259477]
CORD-19 is a growing resource of scientific papers on COVID-19 and related historical coronavirus research.
Since its release, CORD-19 has been downloaded over 200K times and has served as the basis of many COVID-19 text mining and discovery systems.
arXiv Detail & Related papers (2020-04-22T17:10:18Z) - Mapping the Landscape of Artificial Intelligence Applications against
COVID-19 [59.30734371401316]
COVID-19, the disease caused by the SARS-CoV-2 virus, has been declared a pandemic by the World Health Organization.
We present an overview of recent studies using Machine Learning and, more broadly, Artificial Intelligence to tackle many aspects of the COVID-19 crisis.
arXiv Detail & Related papers (2020-03-25T12:30:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.