The Second Place Solution for ICCV2021 VIPriors Instance Segmentation
Challenge
- URL: http://arxiv.org/abs/2112.01072v1
- Date: Thu, 2 Dec 2021 09:23:02 GMT
- Title: The Second Place Solution for ICCV2021 VIPriors Instance Segmentation
Challenge
- Authors: Bo Yan, Fengliang Qi, Leilei Cao and Hongbin Wang
- Abstract summary: The Visual Inductive Priors(VIPriors) for Data-Efficient Computer Vision challenges ask competitors to train models from scratch in a data-deficient setting.
We introduce the technical details of our submission to the ICCV 2021 VIPriors instance segmentation challenge.
Our approach can achieve 40.2%AP@0.50:0.95 on the test set of ICCV 2021 VIPriors instance segmentation challenge.
- Score: 6.087398773657721
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The Visual Inductive Priors(VIPriors) for Data-Efficient Computer Vision
challenges ask competitors to train models from scratch in a data-deficient
setting. In this paper, we introduce the technical details of our submission to
the ICCV2021 VIPriors instance segmentation challenge. Firstly, we designed an
effective data augmentation method to improve the problem of data-deficient.
Secondly, we conducted some experiments to select a proper model and made some
improvements for this task. Thirdly, we proposed an effective training strategy
which can improve the performance. Experimental results demonstrate that our
approach can achieve a competitive result on the test set. According to the
competition rules, we do not use any external image or video data and
pre-trained weights. The implementation details above are described in section
2 and section 3. Finally, our approach can achieve 40.2\%AP@0.50:0.95 on the
test set of ICCV2021 VIPriors instance segmentation challenge.
Related papers
- Leveraging the Power of Data Augmentation for Transformer-based Tracking [64.46371987827312]
We propose two data augmentation methods customized for tracking.
First, we optimize existing random cropping via a dynamic search radius mechanism and simulation for boundary samples.
Second, we propose a token-level feature mixing augmentation strategy, which enables the model against challenges like background interference.
arXiv Detail & Related papers (2023-09-15T09:18:54Z) - Boosting Visual-Language Models by Exploiting Hard Samples [126.35125029639168]
HELIP is a cost-effective strategy tailored to enhance the performance of existing CLIP models.
Our method allows for effortless integration with existing models' training pipelines.
On comprehensive benchmarks, HELIP consistently boosts existing models to achieve leading performance.
arXiv Detail & Related papers (2023-05-09T07:00:17Z) - Task-Specific Data Augmentation and Inference Processing for VIPriors
Instance Segmentation Challenge [9.43662534739698]
We develop task-specific data augmentation strategy and task-specific inference processing strategy.
We demonstrate the applicability of proposed method on VIPriors Instance Challenge.
Experimental results demonstrate that proposed method can achieve a competitive result on the test set of 2022 VIPriors Instance Challenge.
arXiv Detail & Related papers (2022-11-21T09:15:30Z) - The Third Place Solution for CVPR2022 AVA Accessibility Vision and
Autonomy Challenge [12.37168905253371]
This paper introduces the technical details of our submission to the CVPR2022 AVA Challenge.
Firstly, we conducted some experiments to help employ proper model and data augmentation strategy for this task.
Secondly, an effective training strategy was applied to improve the performance.
arXiv Detail & Related papers (2022-06-28T03:05:37Z) - Self-Supervised Pre-Training for Transformer-Based Person
Re-Identification [54.55281692768765]
Transformer-based supervised pre-training achieves great performance in person re-identification (ReID)
Due to the domain gap between ImageNet and ReID datasets, it usually needs a larger pre-training dataset to boost the performance.
This work aims to mitigate the gap between the pre-training and ReID datasets from the perspective of data and model structure.
arXiv Detail & Related papers (2021-11-23T18:59:08Z) - "Knights": First Place Submission for VIPriors21 Action Recognition
Challenge at ICCV 2021 [39.990872080183884]
This report presents "Knights" to solve the action recognition task on a small subset of Kinetics400ViPriors.
Our approach has 3 main components: state-of-the-art Temporal Contrastive self-supervised pretraining, video transformer models, and optical flow modality.
arXiv Detail & Related papers (2021-10-14T22:47:31Z) - An Empirical Study of Vehicle Re-Identification on the AI City Challenge [19.13038665501964]
The Track2 is a vehicle re-identification (ReID) task with both the real-world data and synthetic data.
We mainly focus on four points, i.e. training data, unsupervised domain-adaptive (UDA) training, post-processing, model ensembling in this challenge.
With aforementioned techniques, our method finally achieves 0.7445 mAP score, yielding the first place in the competition.
arXiv Detail & Related papers (2021-05-20T12:20:52Z) - VIPriors 1: Visual Inductive Priors for Data-Efficient Deep Learning
Challenges [8.50468505606714]
We offer four data-impaired challenges, where models are trained from scratch, and we reduce the number of training samples to a fraction of the full set.
To encourage data efficient solutions, we prohibited the use of pre-trained models and other transfer learning techniques.
arXiv Detail & Related papers (2021-03-05T15:58:17Z) - Efficient Conditional Pre-training for Transfer Learning [71.01129334495553]
We propose efficient filtering methods to select relevant subsets from the pre-training dataset.
We validate our techniques by pre-training on ImageNet in both the unsupervised and supervised settings.
We improve standard ImageNet pre-training by 1-3% by tuning available models on our subsets and pre-training on a dataset filtered from a larger scale dataset.
arXiv Detail & Related papers (2020-11-20T06:16:15Z) - LID 2020: The Learning from Imperfect Data Challenge Results [242.86700551532272]
Learning from Imperfect Data workshop aims to inspire and facilitate the research in developing novel approaches.
We organize three challenges to find the state-of-the-art approaches in weakly supervised learning setting.
This technical report summarizes the highlights from the challenge.
arXiv Detail & Related papers (2020-10-17T13:06:12Z) - 2nd Place Scheme on Action Recognition Track of ECCV 2020 VIPriors
Challenges: An Efficient Optical Flow Stream Guided Framework [57.847010327319964]
We propose a data-efficient framework that can train the model from scratch on small datasets.
Specifically, by introducing a 3D central difference convolution operation, we proposed a novel C3D neural network-based two-stream framework.
It is proved that our method can achieve a promising result even without a pre-trained model on large scale datasets.
arXiv Detail & Related papers (2020-08-10T09:50:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.