OsmLocator: locating overlapping scatter marks with a non-training
generative perspective
- URL: http://arxiv.org/abs/2312.11146v2
- Date: Fri, 22 Dec 2023 15:44:10 GMT
- Title: OsmLocator: locating overlapping scatter marks with a non-training
generative perspective
- Authors: Yuming Qiu, Aleksandra Pizurica, Qi Ming, Nicolas Nadisic
- Abstract summary: Locating overlapping marks faces many difficulties such as no texture, less contextual information, hallow shape and tiny size.
Here, we formulate it as a optimization problem on clustering-based re-visualization from a non-training generative perspective.
We especially built a dataset named 2023 containing hundreds of scatter images with different markers and various levels of overlapping severity, and tested the proposed method and compared it to existing methods.
- Score: 48.50108853199417
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Automated mark localization in scatter images, greatly helpful for
discovering knowledge and understanding enormous document images and reasoning
in visual question answering AI systems, is a highly challenging problem
because of the ubiquity of overlapping marks. Locating overlapping marks faces
many difficulties such as no texture, less contextual information, hallow shape
and tiny size. Here, we formulate it as a combinatorial optimization problem on
clustering-based re-visualization from a non-training generative perspective,
to locate scatter marks by finding the status of multi-variables when an
objective function reaches a minimum. The objective function is constructed on
difference between binarized scatter images and corresponding generated
re-visualization based on their clustering. Fundamentally, re-visualization
tries to generate a new scatter graph only taking a rasterized scatter image as
an input, and clustering is employed to provide the information for such
re-visualization. This method could stably locate severely-overlapping,
variable-size and variable-shape marks in scatter images without dependence of
any training dataset or reference. Meanwhile, we propose an adaptive variant of
simulated annealing which can works on various connected regions. In addition,
we especially built a dataset named SML2023 containing hundreds of scatter
images with different markers and various levels of overlapping severity, and
tested the proposed method and compared it to existing methods. The results
show that it can accurately locate most marks in scatter images with different
overlapping severity and marker types, with about 0.3 absolute increase on an
assignment-cost-based metric in comparison with state-of-the-art methods. This
work is of value to data mining on massive web pages and literatures, and
shedding new light on image measurement such as bubble counting.
Related papers
- Semi-supervised segmentation of land cover images using nonlinear
canonical correlation analysis with multiple features and t-SNE [1.7000283696243563]
Image segmentation is a clustering task whereby each pixel is assigned a cluster label.
In this work, by resorting to label only a small quantity of pixels, a new semi-supervised segmentation approach is proposed.
The proposed semi-supervised RBF-CCA algorithm has been implemented on several remotely sensed multispectral images.
arXiv Detail & Related papers (2024-01-22T17:56:07Z) - A Locality-based Neural Solver for Optical Motion Capture [37.28597049192196]
Given noisy marker data, we propose a new heterogeneous graph neural network which treats markers and joints as different types of nodes.
We show that our method outperforms state-of-the-art methods in terms of prediction accuracy of occluded marker position error.
arXiv Detail & Related papers (2023-09-01T12:40:17Z) - Fine-grained Recognition with Learnable Semantic Data Augmentation [68.48892326854494]
Fine-grained image recognition is a longstanding computer vision challenge.
We propose diversifying the training data at the feature-level to alleviate the discriminative region loss problem.
Our method significantly improves the generalization performance on several popular classification networks.
arXiv Detail & Related papers (2023-09-01T11:15:50Z) - Self-similarity Driven Scale-invariant Learning for Weakly Supervised
Person Search [66.95134080902717]
We propose a novel one-step framework, named Self-similarity driven Scale-invariant Learning (SSL)
We introduce a Multi-scale Exemplar Branch to guide the network in concentrating on the foreground and learning scale-invariant features.
Experiments on PRW and CUHK-SYSU databases demonstrate the effectiveness of our method.
arXiv Detail & Related papers (2023-02-25T04:48:11Z) - Correlation Verification for Image Retrieval [15.823918683848877]
We propose a novel image retrieval re-ranking network named Correlation Verification Networks (CVNet)
CVNet compresses dense feature correlation into image similarity while learning diverse geometric matching patterns from various image pairs.
Our proposed network shows state-of-the-art performance on several retrieval benchmarks with a significant margin.
arXiv Detail & Related papers (2022-04-04T13:18:49Z) - Digging Into Self-Supervised Learning of Feature Descriptors [14.47046413243358]
We propose a set of improvements that combined lead to powerful feature descriptors.
We show that increasing the search space from in-pair to in-batch for hard negative mining brings consistent improvement.
We demonstrate that a combination of synthetic homography transformation, color augmentation, and photorealistic image stylization produces useful representations.
arXiv Detail & Related papers (2021-10-10T12:22:44Z) - Spatial-spectral Hyperspectral Image Classification via Multiple Random
Anchor Graphs Ensemble Learning [88.60285937702304]
This paper proposes a novel spatial-spectral HSI classification method via multiple random anchor graphs ensemble learning (RAGE)
Firstly, the local binary pattern is adopted to extract the more descriptive features on each selected band, which preserves local structures and subtle changes of a region.
Secondly, the adaptive neighbors assignment is introduced in the construction of anchor graph, to reduce the computational complexity.
arXiv Detail & Related papers (2021-03-25T09:31:41Z) - Data Augmentation for Object Detection via Differentiable Neural
Rendering [71.00447761415388]
It is challenging to train a robust object detector when annotated data is scarce.
Existing approaches to tackle this problem include semi-supervised learning that interpolates labeled data from unlabeled data.
We introduce an offline data augmentation method for object detection, which semantically interpolates the training data with novel views.
arXiv Detail & Related papers (2021-03-04T06:31:06Z) - Multi-temporal and multi-source remote sensing image classification by
nonlinear relative normalization [17.124438150480326]
We study a methodology that aligns data from different domains in a nonlinear way through em kernelization.
We successfully test KEMA in multi-temporal and multi-source very high resolution classification tasks, as well as on the task of making a model invariant to shadowing for hyperspectral imaging.
arXiv Detail & Related papers (2020-12-07T08:46:11Z) - High-Order Information Matters: Learning Relation and Topology for
Occluded Person Re-Identification [84.43394420267794]
We propose a novel framework by learning high-order relation and topology information for discriminative features and robust alignment.
Our framework significantly outperforms state-of-the-art by6.5%mAP scores on Occluded-Duke dataset.
arXiv Detail & Related papers (2020-03-18T12:18:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.