A High-Resolution Dataset for Instance Detection with Multi-View
Instance Capture
- URL: http://arxiv.org/abs/2310.19257v1
- Date: Mon, 30 Oct 2023 03:58:41 GMT
- Title: A High-Resolution Dataset for Instance Detection with Multi-View
Instance Capture
- Authors: Qianqian Shen, Yunhan Zhao, Nahyun Kwon, Jeeeun Kim, Yanan Li, Shu
Kong
- Abstract summary: Instance detection (InsDet) is a long-lasting problem in robotics and computer vision.
Current InsDet are too small in scale by today's standards.
We introduce a new InsDet dataset and protocol.
- Score: 15.298790238028356
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Instance detection (InsDet) is a long-lasting problem in robotics and
computer vision, aiming to detect object instances (predefined by some visual
examples) in a cluttered scene. Despite its practical significance, its
advancement is overshadowed by Object Detection, which aims to detect objects
belonging to some predefined classes. One major reason is that current InsDet
datasets are too small in scale by today's standards. For example, the popular
InsDet dataset GMU (published in 2016) has only 23 instances, far less than
COCO (80 classes), a well-known object detection dataset published in 2014. We
are motivated to introduce a new InsDet dataset and protocol. First, we define
a realistic setup for InsDet: training data consists of multi-view instance
captures, along with diverse scene images allowing synthesizing training images
by pasting instance images on them with free box annotations. Second, we
release a real-world database, which contains multi-view capture of 100 object
instances, and high-resolution (6k x 8k) testing images. Third, we extensively
study baseline methods for InsDet on our dataset, analyze their performance and
suggest future work. Somewhat surprisingly, using the off-the-shelf
class-agnostic segmentation model (Segment Anything Model, SAM) and the
self-supervised feature representation DINOv2 performs the best, achieving >10
AP better than end-to-end trained InsDet models that repurpose object detectors
(e.g., FasterRCNN and RetinaNet).
Related papers
- Matching Anything by Segmenting Anything [109.2507425045143]
We propose MASA, a novel method for robust instance association learning.
MASA learns instance-level correspondence through exhaustive data transformations.
We show that MASA achieves even better performance than state-of-the-art methods trained with fully annotated in-domain video sequences.
arXiv Detail & Related papers (2024-06-06T16:20:07Z) - Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping [14.958823096408175]
Foundation models are a strong trend in deep learning and computer vision.
Here, we focus on training such an object identification model.
Key solution to train such a model is the centroid triplet loss (CTL), which aggregates image features to their centroids.
arXiv Detail & Related papers (2024-04-09T13:01:26Z) - Exploring Robust Features for Few-Shot Object Detection in Satellite
Imagery [17.156864650143678]
We develop a few-shot object detector based on a traditional two-stage architecture.
A large-scale pre-trained model is used to build class-reference embeddings or prototypes.
We perform evaluations on two remote sensing datasets containing challenging and rare objects.
arXiv Detail & Related papers (2024-03-08T15:20:27Z) - Scaling Novel Object Detection with Weakly Supervised Detection
Transformers [21.219817483091166]
We propose the Weakly Supervised Detection Transformer, which enables efficient knowledge transfer from a large-scale pretraining dataset to WSOD finetuning.
Our experiments show that our approach outperforms previous state-of-the-art models on large-scale novel object detection datasets.
arXiv Detail & Related papers (2022-07-11T21:45:54Z) - A Survey of Self-Supervised and Few-Shot Object Detection [19.647681501581225]
Self-supervised methods aim at learning representations from unlabeled data which transfer well to downstream tasks such as object detection.
Few-shot object detection is about training a model on novel (unseen) object classes with little data.
In this survey, we review and characterize the most recent approaches on few-shot and self-supervised object detection.
arXiv Detail & Related papers (2021-10-27T18:55:47Z) - Discovery-and-Selection: Towards Optimal Multiple Instance Learning for
Weakly Supervised Object Detection [86.86602297364826]
We propose a discoveryand-selection approach fused with multiple instance learning (DS-MIL)
Our proposed DS-MIL approach can consistently improve the baselines, reporting state-of-the-art performance.
arXiv Detail & Related papers (2021-10-18T07:06:57Z) - Salient Objects in Clutter [130.63976772770368]
This paper identifies and addresses a serious design bias of existing salient object detection (SOD) datasets.
This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.
We propose a new high-quality dataset and update the previous saliency benchmark.
arXiv Detail & Related papers (2021-05-07T03:49:26Z) - Few-shot Weakly-Supervised Object Detection via Directional Statistics [55.97230224399744]
We propose a probabilistic multiple instance learning approach for few-shot Common Object Localization (COL) and few-shot Weakly Supervised Object Detection (WSOD)
Our model simultaneously learns the distribution of the novel objects and localizes them via expectation-maximization steps.
Our experiments show that the proposed method, despite being simple, outperforms strong baselines in few-shot COL and WSOD, as well as large-scale WSOD tasks.
arXiv Detail & Related papers (2021-03-25T22:34:16Z) - UniT: Unified Knowledge Transfer for Any-shot Object Detection and
Segmentation [52.487469544343305]
Methods for object detection and segmentation rely on large scale instance-level annotations for training.
We propose an intuitive and unified semi-supervised model that is applicable to a range of supervision.
arXiv Detail & Related papers (2020-06-12T22:45:47Z) - Object Instance Mining for Weakly Supervised Object Detection [24.021995037282394]
This paper introduces an end-to-end object instance mining (OIM) framework for weakly supervised object detection.
OIM attempts to detect all possible object instances existing in each image by introducing information propagation on the spatial and appearance graphs.
During the iterative learning process, the less discriminative object instances from the same class can be gradually detected and utilized for training.
arXiv Detail & Related papers (2020-02-04T02:11:39Z) - Stance Detection Benchmark: How Robust Is Your Stance Detection? [65.91772010586605]
Stance Detection (StD) aims to detect an author's stance towards a certain topic or claim.
We introduce a StD benchmark that learns from ten StD datasets of various domains in a multi-dataset learning setting.
Within this benchmark setup, we are able to present new state-of-the-art results on five of the datasets.
arXiv Detail & Related papers (2020-01-06T13:37:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.