CMT in TREC-COVID Round 2: Mitigating the Generalization Gaps from Web
to Special Domain Search
- URL: http://arxiv.org/abs/2011.01580v1
- Date: Tue, 3 Nov 2020 09:10:48 GMT
- Title: CMT in TREC-COVID Round 2: Mitigating the Generalization Gaps from Web
to Special Domain Search
- Authors: Chenyan Xiong, Zhenghao Liu, Si Sun, Zhuyun Dai, Kaitao Zhang, Shi Yu,
Zhiyuan Liu, Hoifung Poon, Jianfeng Gao and Paul Bennett
- Abstract summary: This paper presents a search system to alleviate the special domain adaption problem.
The system utilizes the domain-adaptive pretraining and few-shot learning technologies to help neural rankers mitigate the domain discrepancy.
Our system performs the best among the non-manual runs in Round 2 of the TREC-COVID task.
- Score: 89.48123965553098
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Neural rankers based on deep pretrained language models (LMs) have been shown
to improve many information retrieval benchmarks. However, these methods are
affected by their the correlation between pretraining domain and target domain
and rely on massive fine-tuning relevance labels. Directly applying pretraining
methods to specific domains may result in suboptimal search quality because
specific domains may have domain adaption problems, such as the COVID domain.
This paper presents a search system to alleviate the special domain adaption
problem. The system utilizes the domain-adaptive pretraining and few-shot
learning technologies to help neural rankers mitigate the domain discrepancy
and label scarcity problems. Besides, we also integrate dense retrieval to
alleviate traditional sparse retrieval's vocabulary mismatch obstacle. Our
system performs the best among the non-manual runs in Round 2 of the TREC-COVID
task, which aims to retrieve useful information from scientific literature
related to COVID-19. Our code is publicly available at
https://github.com/thunlp/OpenMatch.
Related papers
- Unsupervised Domain Adaptation for Anatomical Landmark Detection [5.070344284426738]
We propose a novel framework for anatomical landmark detection under the setting of unsupervised domain adaptation (UDA)
The framework leverages self-training and domain adversarial learning to address the domain gap during adaptation.
Our experiments on cephalometric and lung landmark detection show the effectiveness of the method, which reduces the domain gap by a large margin and outperforms other UDA methods consistently.
arXiv Detail & Related papers (2023-08-25T10:22:13Z) - Multi-scale Feature Alignment for Continual Learning of Unlabeled
Domains [3.9498537297431167]
generative feature-driven image replay in conjunction with a dual-purpose discriminator enables the generation of images with realistic features for replay.
We present detailed ablation experiments studying our proposed method components and demonstrate a possible use-case of our continual UDA method for an unsupervised patch-based segmentation task.
arXiv Detail & Related papers (2023-02-02T18:19:01Z) - Domain Adaptation from Scratch [24.612696638386623]
We present a new learning setup, domain adaptation from scratch'', which we believe to be crucial for extending the reach of NLP to sensitive domains.
In this setup, we aim to efficiently annotate data from a set of source domains such that the trained model performs well on a sensitive target domain.
Our study compares several approaches for this challenging setup, ranging from data selection and domain adaptation algorithms to active learning paradigms.
arXiv Detail & Related papers (2022-09-02T05:55:09Z) - Target and Task specific Source-Free Domain Adaptive Image Segmentation [73.78898054277538]
We propose a two-stage approach for source-free domain adaptive image segmentation.
We focus on generating target-specific pseudo labels while suppressing high entropy regions.
In the second stage, we focus on adapting the network for task-specific representation.
arXiv Detail & Related papers (2022-03-29T17:50:22Z) - Stagewise Unsupervised Domain Adaptation with Adversarial Self-Training
for Road Segmentation of Remote Sensing Images [93.50240389540252]
Road segmentation from remote sensing images is a challenging task with wide ranges of application potentials.
We propose a novel stagewise domain adaptation model called RoadDA to address the domain shift (DS) issue in this field.
Experiment results on two benchmarks demonstrate that RoadDA can efficiently reduce the domain gap and outperforms state-of-the-art methods.
arXiv Detail & Related papers (2021-08-28T09:29:14Z) - Universal Cross-Domain Retrieval: Generalizing Across Classes and
Domains [27.920212868483702]
We propose SnMpNet, which incorporates two novel losses to account for the unseen classes and domains encountered during testing.
Specifically, we introduce a novel Semantic Neighborhood loss to bridge the knowledge gap between seen and unseen classes.
We also introduce a mix-up based supervision at image-level as well as semantic-level of the data for training with the Mixture Prediction loss.
arXiv Detail & Related papers (2021-08-18T19:21:04Z) - Cross-domain Contrastive Learning for Unsupervised Domain Adaptation [108.63914324182984]
Unsupervised domain adaptation (UDA) aims to transfer knowledge learned from a fully-labeled source domain to a different unlabeled target domain.
We build upon contrastive self-supervised learning to align features so as to reduce the domain discrepancy between training and testing sets.
arXiv Detail & Related papers (2021-06-10T06:32:30Z) - Contrastive Learning and Self-Training for Unsupervised Domain
Adaptation in Semantic Segmentation [71.77083272602525]
UDA attempts to provide efficient knowledge transfer from a labeled source domain to an unlabeled target domain.
We propose a contrastive learning approach that adapts category-wise centroids across domains.
We extend our method with self-training, where we use a memory-efficient temporal ensemble to generate consistent and reliable pseudo-labels.
arXiv Detail & Related papers (2021-05-05T11:55:53Z) - Mind the Gap: Enlarging the Domain Gap in Open Set Domain Adaptation [65.38975706997088]
Open set domain adaptation (OSDA) assumes the presence of unknown classes in the target domain.
We show that existing state-of-the-art methods suffer a considerable performance drop in the presence of larger domain gaps.
We propose a novel framework to specifically address the larger domain gaps.
arXiv Detail & Related papers (2020-03-08T14:20:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.