Weakly Supervised Instance Attention for Multisource Fine-Grained Object
Recognition with an Application to Tree Species Classification
- URL: http://arxiv.org/abs/2105.10983v2
- Date: Tue, 25 May 2021 20:35:38 GMT
- Title: Weakly Supervised Instance Attention for Multisource Fine-Grained Object
Recognition with an Application to Tree Species Classification
- Authors: Bulut Aygunes, Ramazan Gokberk Cinbis, Selim Aksoy
- Abstract summary: We propose a multisource method to classify relatively small objects.
The proposed method uses a single-source deep instance attention model with parallel branches for joint localization and classification of objects.
We show that all levels of fusion provide higher accuracies compared to the state-of-the-art, with the best performing method of feature-level fusion resulting in 53% accuracy for the recognition of 40 different types of trees.
- Score: 9.668407688201361
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Multisource image analysis that leverages complementary spectral, spatial,
and structural information benefits fine-grained object recognition that aims
to classify an object into one of many similar subcategories. However, for
multisource tasks that involve relatively small objects, even the smallest
registration errors can introduce high uncertainty in the classification
process. We approach this problem from a weakly supervised learning perspective
in which the input images correspond to larger neighborhoods around the
expected object locations where an object with a given class label is present
in the neighborhood without any knowledge of its exact location. The proposed
method uses a single-source deep instance attention model with parallel
branches for joint localization and classification of objects, and extends this
model into a multisource setting where a reference source that is assumed to
have no location uncertainty is used to aid the fusion of multiple sources in
four different levels: probability level, logit level, feature level, and pixel
level. We show that all levels of fusion provide higher accuracies compared to
the state-of-the-art, with the best performing method of feature-level fusion
resulting in 53% accuracy for the recognition of 40 different types of trees,
corresponding to an improvement of 5.7% over the best performing baseline when
RGB, multispectral, and LiDAR data are used. We also provide an in-depth
comparison by evaluating each model at various parameter complexity settings,
where the increased model capacity results in a further improvement of 6.3%
over the default capacity setting.
Related papers
- Enhanced Semantic Segmentation for Large-Scale and Imbalanced Point Clouds [6.253217784798542]
Small-sized objects are prone to be under-sampled or misclassified due to their low occurrence frequency.
We propose the Multilateral Cascading Network (MCNet) for large-scale and sample-imbalanced point cloud scenes.
arXiv Detail & Related papers (2024-09-21T02:23:01Z) - GLCONet: Learning Multi-source Perception Representation for Camouflaged Object Detection [23.872633359324098]
We propose a novel Global-Local Collaborative Optimization Network, called GLCONet.
In this paper, we first design a collaborative optimization strategy to simultaneously model the local details and global long-range relationships.
Experiments demonstrate that the proposed GLCONet method with different backbones can effectively activate potentially significant pixels in an image.
arXiv Detail & Related papers (2024-09-15T02:26:17Z) - Hierarchical Attention and Parallel Filter Fusion Network for Multi-Source Data Classification [33.26466989592473]
We propose a hierarchical attention and parallel filter fusion network for multi-source data classification.
Our proposed method achieves 91.44% and 80.51% of overall accuracy (OA) on the respective datasets.
arXiv Detail & Related papers (2024-08-22T23:14:22Z) - Few-shot Object Localization [37.347898735345574]
This paper defines a novel task named Few-Shot Object localization (FSOL)
It aims to achieve precise localization with limited samples.
This task achieves generalized object localization by leveraging a small number of labeled support samples to query the positional information of objects within corresponding images.
Experimental results demonstrate a significant performance improvement of our approach in the FSOL task, establishing an efficient benchmark for further research.
arXiv Detail & Related papers (2024-03-19T05:50:48Z) - Weakly-supervised Contrastive Learning for Unsupervised Object Discovery [52.696041556640516]
Unsupervised object discovery is promising due to its ability to discover objects in a generic manner.
We design a semantic-guided self-supervised learning model to extract high-level semantic features from images.
We introduce Principal Component Analysis (PCA) to localize object regions.
arXiv Detail & Related papers (2023-07-07T04:03:48Z) - Auto-Transfer: Learning to Route Transferrable Representations [77.30427535329571]
We propose a novel adversarial multi-armed bandit approach which automatically learns to route source representations to appropriate target representations.
We see upwards of 5% accuracy improvements compared with the state-of-the-art knowledge transfer methods.
arXiv Detail & Related papers (2022-02-02T13:09:27Z) - Multi-Source Domain Adaptation for Object Detection [52.87890831055648]
We propose a unified Faster R-CNN based framework, termed Divide-and-Merge Spindle Network (DMSN)
DMSN can simultaneously enhance domain innative and preserve discriminative power.
We develop a novel pseudo learning algorithm to approximate optimal parameters of pseudo target subset.
arXiv Detail & Related papers (2021-06-30T03:17:20Z) - Multi-scale Interactive Network for Salient Object Detection [91.43066633305662]
We propose the aggregate interaction modules to integrate the features from adjacent levels.
To obtain more efficient multi-scale features, the self-interaction modules are embedded in each decoder unit.
Experimental results on five benchmark datasets demonstrate that the proposed method without any post-processing performs favorably against 23 state-of-the-art approaches.
arXiv Detail & Related papers (2020-07-17T15:41:37Z) - Pairwise Similarity Knowledge Transfer for Weakly Supervised Object
Localization [53.99850033746663]
We study the problem of learning localization model on target classes with weakly supervised image labels.
In this work, we argue that learning only an objectness function is a weak form of knowledge transfer.
Experiments on the COCO and ILSVRC 2013 detection datasets show that the performance of the localization model improves significantly with the inclusion of pairwise similarity function.
arXiv Detail & Related papers (2020-03-18T17:53:33Z) - Improving Few-shot Learning by Spatially-aware Matching and
CrossTransformer [116.46533207849619]
We study the impact of scale and location mismatch in the few-shot learning scenario.
We propose a novel Spatially-aware Matching scheme to effectively perform matching across multiple scales and locations.
arXiv Detail & Related papers (2020-01-06T14:10:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.