BBBD: Bounding Box Based Detector for Occlusion Detection and Order
Recovery
- URL: http://arxiv.org/abs/2204.12841v1
- Date: Wed, 27 Apr 2022 10:56:18 GMT
- Title: BBBD: Bounding Box Based Detector for Occlusion Detection and Order
Recovery
- Authors: Kaziwa Saleh, Zoltan Vamossy
- Abstract summary: Occlusion handling is one of the challenges of object detection and segmentation, and scene understanding.
We propose a simpler and faster method that can perform both operations without any training and only requires the modal segmentation masks.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Occlusion handling is one of the challenges of object detection and
segmentation, and scene understanding. Because objects appear differently when
they are occluded in varying degree, angle, and locations. Therefore,
determining the existence of occlusion between objects and their order in a
scene is a fundamental requirement for semantic understanding. Existing works
mostly use deep learning based models to retrieve the order of the instances in
an image or for occlusion detection. This requires labelled occluded data and
it is time consuming. In this paper, we propose a simpler and faster method
that can perform both operations without any training and only requires the
modal segmentation masks. For occlusion detection, instead of scanning the two
objects entirely, we only focus on the intersected area between their bounding
boxes. Similarly, we use the segmentation mask inside the same area to recover
the depth-ordering. When tested on COCOA dataset, our method achieves +8% and
+5% more accuracy than the baselines in order recovery and occlusion detection
respectively.
Related papers
- Occlusion-Ordered Semantic Instance Segmentation [14.767310282538688]
Occlusion-Ordered Instance and segmentation of Instance (OOSIS)
We develop an approach to OOSIS that extracts instances and their order simultaneously from oriented occlusion boundaries and semantic segmentation.
We achieve better performance than strong baselines on KINS and COCOA datasets.
arXiv Detail & Related papers (2025-04-18T19:52:37Z) - Toward Realistic Camouflaged Object Detection: Benchmarks and Method [11.279532701331647]
Camouflaged object detection (COD) primarily relies on semantic or instance segmentation methods.
We propose a camouflage-aware feature refinement (CAFR) strategy to detect camouflaged objects.
CAFR fully utilizes a clear perception of the current object within the prior knowledge of large models to assist detectors in deeply understanding the distinctions between background and foreground.
arXiv Detail & Related papers (2025-01-13T13:04:00Z) - LESS: Label-Efficient and Single-Stage Referring 3D Segmentation [55.06002976797879]
Referring 3D is a visual-language task that segments all points of the specified object from a 3D point cloud described by a sentence of query.
We propose a novel Referring 3D pipeline, Label-Efficient and Single-Stage, dubbed LESS, which is only under the supervision of efficient binary mask.
We achieve state-of-the-art performance on ScanRefer dataset by surpassing the previous methods about 3.7% mIoU using only binary labels.
arXiv Detail & Related papers (2024-10-17T07:47:41Z) - MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders [93.87585467898252]
We design MonoMAE, a monocular 3D detector inspired by Masked Autoencoders.
MonoMAE consists of two novel designs. The first is depth-aware masking that selectively masks certain parts of non-occluded object queries.
The second is lightweight query completion that works with the depth-aware masking to learn to reconstruct and complete the masked object queries.
arXiv Detail & Related papers (2024-05-13T12:32:45Z) - Object-Centric Multiple Object Tracking [124.30650395969126]
This paper proposes a video object-centric model for multiple-object tracking pipelines.
It consists of an index-merge module that adapts the object-centric slots into detection outputs and an object memory module.
Benefited from object-centric learning, we only require sparse detection labels for object localization and feature binding.
arXiv Detail & Related papers (2023-09-01T03:34:12Z) - A Simple Framework for Open-Vocabulary Segmentation and Detection [85.21641508535679]
We present OpenSeeD, a simple Open-vocabulary and Detection framework that jointly learns from different segmentation and detection datasets.
We first introduce a pre-trained text encoder to encode all the visual concepts in two tasks and learn a common semantic space for them.
After pre-training, our model exhibits competitive or stronger zero-shot transferability for both segmentation and detection.
arXiv Detail & Related papers (2023-03-14T17:58:34Z) - Robust Visual Tracking by Segmentation [103.87369380021441]
Estimating the target extent poses a fundamental challenge in visual object tracking.
We propose a segmentation-centric tracking pipeline that produces a highly accurate segmentation mask.
Our tracker is able to better learn a target representation that clearly differentiates the target in the scene from background content.
arXiv Detail & Related papers (2022-03-21T17:59:19Z) - Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [72.38919601150175]
We propose Bilayer Convolutional Network (BCNet) to segment highly-overlapping objects.
BCNet detects the occluding objects (occluder) and the bottom GCN layer infers partially occluded instance (occludee)
arXiv Detail & Related papers (2021-03-23T06:25:42Z) - IterDet: Iterative Scheme for Object Detection in Crowded Environments [6.2997667081978825]
Deep learning-based detectors usually produce a redundant set of object bounding boxes.
These boxes are filtered using non-maximum suppression (NMS) in order to select exactly one bounding box per object of interest.
This greedy scheme is simple and provides sufficient accuracy for isolated objects but often fails in crowded environments.
In this work we develop an alternative iterative scheme, where a new subset of objects is detected at each iteration.
arXiv Detail & Related papers (2020-05-12T12:04:27Z) - Instance Segmentation of Biomedical Images with an Object-aware
Embedding Learned with Local Constraints [7.151685185368064]
State-of-the-art approaches either perform semantic segmentation or refine object bounding boxes.
Both suffer from crowded objects to varying degrees, merging adjacent objects or suppressing a valid object.
In this work, we assign an embedding vector to each pixel through a deep neural network.
arXiv Detail & Related papers (2020-04-21T08:33:29Z) - BANet: Bidirectional Aggregation Network with Occlusion Handling for
Panoptic Segmentation [30.008473359758632]
Panoptic segmentation aims to perform instance segmentation for foreground instances and semantic segmentation for background stuff simultaneously.
We propose a novel deep panoptic segmentation scheme based on a bidirectional learning pipeline.
The experimental results on COCO panoptic benchmark validate the effectiveness of our proposed method.
arXiv Detail & Related papers (2020-03-31T08:57:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.