Linear Object Detection in Document Images using Multiple Object
Tracking
- URL: http://arxiv.org/abs/2305.16968v1
- Date: Fri, 26 May 2023 14:22:03 GMT
- Title: Linear Object Detection in Document Images using Multiple Object
Tracking
- Authors: Philippe Bernet (1), Joseph Chazalon (1), Edwin Carlinet (1),
Alexandre Bourquelot (1), Elodie Puybareau (1) ((1) EPITA Research Lab.)
- Abstract summary: Linear objects convey substantial information about document structure.
Many approaches can recover some vector representation, but only one closed-source technique introduced in 1994.
We propose a framework for accurate instance segmentation of linear objects in document images using Multiple Object Tracking.
- Score: 58.720142291102135
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Linear objects convey substantial information about document structure, but
are challenging to detect accurately because of degradation (curved, erased) or
decoration (doubled, dashed). Many approaches can recover some vector
representation, but only one closed-source technique introduced in 1994, based
on Kalman filters (a particular case of Multiple Object Tracking algorithm),
can perform a pixel-accurate instance segmentation of linear objects and enable
to selectively remove them from the original image. We aim at re-popularizing
this approach and propose: 1. a framework for accurate instance segmentation of
linear objects in document images using Multiple Object Tracking (MOT); 2.
document image datasets and metrics which enable both vector- and pixel-based
evaluation of linear object detection; 3. performance measures of MOT
approaches against modern segment detectors; 4. performance measures of various
tracking strategies, exhibiting alternatives to the original Kalman filters
approach; and 5. an open-source implementation of a detector which can
discriminate instances of curved, erased, dashed, intersecting and/or
overlapping linear objects.
Related papers
- Object-Centric Multiple Object Tracking [124.30650395969126]
This paper proposes a video object-centric model for multiple-object tracking pipelines.
It consists of an index-merge module that adapts the object-centric slots into detection outputs and an object memory module.
Benefited from object-centric learning, we only require sparse detection labels for object localization and feature binding.
arXiv Detail & Related papers (2023-09-01T03:34:12Z) - Image Segmentation-based Unsupervised Multiple Objects Discovery [1.7674345486888503]
Unsupervised object discovery aims to localize objects in images.
We propose a fully unsupervised, bottom-up approach, for multiple objects discovery.
We provide state-of-the-art results for both unsupervised class-agnostic object detection and unsupervised image segmentation.
arXiv Detail & Related papers (2022-12-20T09:48:24Z) - Deformable One-Dimensional Object Detection for Routing and Manipulation [8.860083597706502]
This paper proposes an approach for detecting deformable one-dimensional objects which can handle crossings and occlusions.
Our algorithm takes an image containing a deformable object and outputs a chain of fixed-length cylindrical segments connected with passive spherical joints.
Our tests and experiments have shown that the method can correctly detect deformable one-dimensional objects in various complex conditions.
arXiv Detail & Related papers (2022-01-18T07:19:17Z) - Ensembling object detectors for image and video data analysis [98.26061123111647]
We propose a method for ensembling the outputs of multiple object detectors for improving detection performance and precision of bounding boxes on image data.
We extend it to video data by proposing a two-stage tracking-based scheme for detection refinement.
arXiv Detail & Related papers (2021-02-09T12:38:16Z) - FMODetect: Robust Detection and Trajectory Estimation of Fast Moving
Objects [110.29738581961955]
We propose the first learning-based approach for detection and trajectory estimation of fast moving objects.
The proposed method first detects all fast moving objects as a truncated distance function to the trajectory.
For the sharp appearance estimation, we propose an energy minimization based deblurring.
arXiv Detail & Related papers (2020-12-15T11:05:34Z) - Slender Object Detection: Diagnoses and Improvements [74.40792217534]
In this paper, we are concerned with the detection of a particular type of objects with extreme aspect ratios, namely textbfslender objects.
For a classical object detection method, a drastic drop of $18.9%$ mAP on COCO is observed, if solely evaluated on slender objects.
arXiv Detail & Related papers (2020-11-17T09:39:42Z) - End-to-End Multi-Object Tracking with Global Response Map [23.755882375664875]
We present a completely end-to-end approach that takes image-sequence/video as input and outputs directly the located and tracked objects of learned types.
Specifically, with our introduced multi-object representation strategy, a global response map can be accurately generated over frames.
Experimental results based on the MOT16 and MOT17 benchmarks show that our proposed on-line tracker achieved state-of-the-art performance on several tracking metrics.
arXiv Detail & Related papers (2020-07-13T12:30:49Z) - Black-box Explanation of Object Detectors via Saliency Maps [66.745167677293]
We propose D-RISE, a method for generating visual explanations for the predictions of object detectors.
We show that D-RISE can be easily applied to different object detectors including one-stage detectors such as YOLOv3 and two-stage detectors such as Faster-RCNN.
arXiv Detail & Related papers (2020-06-05T02:13:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.