LID 2020: The Learning from Imperfect Data Challenge Results
- URL: http://arxiv.org/abs/2010.11724v1
- Date: Sat, 17 Oct 2020 13:06:12 GMT
- Title: LID 2020: The Learning from Imperfect Data Challenge Results
- Authors: Yunchao Wei, Shuai Zheng, Ming-Ming Cheng, Hang Zhao, Liwei Wang,
Errui Ding, Yi Yang, Antonio Torralba, Ting Liu, Guolei Sun, Wenguan Wang,
Luc Van Gool, Wonho Bae, Junhyug Noh, Jinhwan Seo, Gunhee Kim, Hao Zhao, Ming
Lu, Anbang Yao, Yiwen Guo, Yurong Chen, Li Zhang, Chuangchuang Tan, Tao Ruan,
Guanghua Gu, Shikui Wei, Yao Zhao, Mariia Dobko, Ostap Viniavskyi, Oles
Dobosevych, Zhendong Wang, Zhenyuan Chen, Chen Gong, Huanqing Yan, Jun He
- Abstract summary: Learning from Imperfect Data workshop aims to inspire and facilitate the research in developing novel approaches.
We organize three challenges to find the state-of-the-art approaches in weakly supervised learning setting.
This technical report summarizes the highlights from the challenge.
- Score: 242.86700551532272
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Learning from imperfect data becomes an issue in many industrial applications
after the research community has made profound progress in supervised learning
from perfectly annotated datasets. The purpose of the Learning from Imperfect
Data (LID) workshop is to inspire and facilitate the research in developing
novel approaches that would harness the imperfect data and improve the
data-efficiency during training. A massive amount of user-generated data
nowadays available on multiple internet services. How to leverage those and
improve the machine learning models is a high impact problem. We organize the
challenges in conjunction with the workshop. The goal of these challenges is to
find the state-of-the-art approaches in the weakly supervised learning setting
for object detection, semantic segmentation, and scene parsing. There are three
tracks in the challenge, i.e., weakly supervised semantic segmentation (Track
1), weakly supervised scene parsing (Track 2), and weakly supervised object
localization (Track 3). In Track 1, based on ILSVRC DET, we provide pixel-level
annotations of 15K images from 200 categories for evaluation. In Track 2, we
provide point-based annotations for the training set of ADE20K. In Track 3,
based on ILSVRC CLS-LOC, we provide pixel-level annotations of 44,271 images
for evaluation. Besides, we further introduce a new evaluation metric proposed
by \cite{zhang2020rethinking}, i.e., IoU curve, to measure the quality of the
generated object localization maps. This technical report summarizes the
highlights from the challenge. The challenge submission server and the
leaderboard will continue to open for the researchers who are interested in it.
More details regarding the challenge and the benchmarks are available at
https://lidchallenge.github.io
Related papers
- Self-Supervised Pre-Training Boosts Semantic Scene Segmentation on LiDAR
Data [0.0]
We propose to train a self-supervised encoder with Barlow Twins and use it as a pre-trained network in the task of semantic scene segmentation.
The experimental results demonstrate that our unsupervised pre-training boosts performance once fine-tuned on the supervised task.
arXiv Detail & Related papers (2023-09-05T11:29:30Z) - Hierarchical Supervision and Shuffle Data Augmentation for 3D
Semi-Supervised Object Detection [90.32180043449263]
State-of-the-art 3D object detectors are usually trained on large-scale datasets with high-quality 3D annotations.
A natural remedy is to adopt semi-supervised learning (SSL) by leveraging a limited amount of labeled samples and abundant unlabeled samples.
This paper introduces a novel approach of Hierarchical Supervision and Shuffle Data Augmentation (HSSDA), which is a simple yet effective teacher-student framework.
arXiv Detail & Related papers (2023-04-04T02:09:32Z) - Recovering 3D Human Mesh from Monocular Images: A Survey [49.00136388529404]
Estimating human pose and shape from monocular images is a long-standing problem in computer vision.
This survey focuses on the task of monocular 3D human mesh recovery.
arXiv Detail & Related papers (2022-03-03T18:56:08Z) - Semi-supervised 3D Object Detection via Adaptive Pseudo-Labeling [18.209409027211404]
3D object detection is an important task in computer vision.
Most existing methods require a large number of high-quality 3D annotations, which are expensive to collect.
We propose a novel semi-supervised framework based on pseudo-labeling for outdoor 3D object detection tasks.
arXiv Detail & Related papers (2021-08-15T02:58:43Z) - Large-scale Unsupervised Semantic Segmentation [163.3568726730319]
We propose a new problem of large-scale unsupervised semantic segmentation (LUSS) with a newly created benchmark dataset to track the research progress.
Based on the ImageNet dataset, we propose the ImageNet-S dataset with 1.2 million training images and 40k high-quality semantic segmentation annotations for evaluation.
arXiv Detail & Related papers (2021-06-06T15:02:11Z) - An Empirical Study of Vehicle Re-Identification on the AI City Challenge [19.13038665501964]
The Track2 is a vehicle re-identification (ReID) task with both the real-world data and synthetic data.
We mainly focus on four points, i.e. training data, unsupervised domain-adaptive (UDA) training, post-processing, model ensembling in this challenge.
With aforementioned techniques, our method finally achieves 0.7445 mAP score, yielding the first place in the competition.
arXiv Detail & Related papers (2021-05-20T12:20:52Z) - REGRAD: A Large-Scale Relational Grasp Dataset for Safe and
Object-Specific Robotic Grasping in Clutter [52.117388513480435]
We present a new dataset named regrad to sustain the modeling of relationships among objects and grasps.
Our dataset is collected in both forms of 2D images and 3D point clouds.
Users are free to import their own object models for the generation of as many data as they want.
arXiv Detail & Related papers (2021-04-29T05:31:21Z) - 2nd Place Scheme on Action Recognition Track of ECCV 2020 VIPriors
Challenges: An Efficient Optical Flow Stream Guided Framework [57.847010327319964]
We propose a data-efficient framework that can train the model from scratch on small datasets.
Specifically, by introducing a 3D central difference convolution operation, we proposed a novel C3D neural network-based two-stream framework.
It is proved that our method can achieve a promising result even without a pre-trained model on large scale datasets.
arXiv Detail & Related papers (2020-08-10T09:50:28Z) - PointContrast: Unsupervised Pre-training for 3D Point Cloud
Understanding [107.02479689909164]
In this work, we aim at facilitating research on 3D representation learning.
We measure the effect of unsupervised pre-training on a large source set of 3D scenes.
arXiv Detail & Related papers (2020-07-21T17:59:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.