Multi-Attention-Based Soft Partition Network for Vehicle
Re-Identification
- URL: http://arxiv.org/abs/2104.10401v2
- Date: Wed, 2 Aug 2023 07:58:00 GMT
- Title: Multi-Attention-Based Soft Partition Network for Vehicle
Re-Identification
- Authors: Sangrok Lee, Taekang Woo, Sang Hun Lee
- Abstract summary: Vehicle re-identification helps in distinguishing between images of the same and other vehicles.
We propose a new vehicle re-identification network based on a multiple soft attention mechanism for capturing various discriminative regions.
Our proposed model achieved a state-of-the-art performance among the attention-based methods without metadata.
- Score: 12.319699929810355
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Vehicle re-identification helps in distinguishing between images of the same
and other vehicles. It is a challenging process because of significant
intra-instance differences between identical vehicles from different views and
subtle inter-instance differences between similar vehicles. To solve this
issue, researchers have extracted view-aware or part-specific features via
spatial attention mechanisms, which usually result in noisy attention maps or
otherwise require expensive additional annotation for metadata, such as key
points, to improve the quality. Meanwhile, based on the researchers' insights,
various handcrafted multi-attention architectures for specific viewpoints or
vehicle parts have been proposed. However, this approach does not guarantee
that the number and nature of attention branches will be optimal for real-world
re-identification tasks. To address these problems, we proposed a new vehicle
re-identification network based on a multiple soft attention mechanism for
capturing various discriminative regions from different viewpoints more
efficiently. Furthermore, this model can significantly reduce the noise in
spatial attention maps by devising a new method for creating an attention map
for insignificant regions and then excluding it from generating the final
result. We also combined a channel-wise attention mechanism with a spatial
attention mechanism for the efficient selection of important semantic
attributes for vehicle re-identification. Our experiments showed that our
proposed model achieved a state-of-the-art performance among the
attention-based methods without metadata and was comparable to the approaches
using metadata for the VehicleID and VERI-Wild datasets.
Related papers
- Invariant Causal Mechanisms through Distribution Matching [86.07327840293894]
In this work we provide a causal perspective and a new algorithm for learning invariant representations.
Empirically we show that this algorithm works well on a diverse set of tasks and in particular we observe state-of-the-art performance on domain generalization.
arXiv Detail & Related papers (2022-06-23T12:06:54Z) - Discriminative-Region Attention and Orthogonal-View Generation Model for
Vehicle Re-Identification [7.5366501970852955]
Multiple challenges hamper the applications of vision-based vehicle Re-ID methods.
The proposed DRA model can automatically extract the discriminative region features, which can distinguish similar vehicles.
And the OVG model can generate multi-view features based on the input view features to reduce the impact of viewpoint mismatches.
arXiv Detail & Related papers (2022-04-28T07:46:03Z) - Pluggable Weakly-Supervised Cross-View Learning for Accurate Vehicle
Re-Identification [53.6218051770131]
Cross-view consistent feature representation is key for accurate vehicle ReID.
Existing approaches resort to supervised cross-view learning using extensive extra viewpoints annotations.
We present a pluggable Weakly-supervised Cross-View Learning (WCVL) module for vehicle ReID.
arXiv Detail & Related papers (2021-03-09T11:51:09Z) - Detecting 32 Pedestrian Attributes for Autonomous Vehicles [103.87351701138554]
In this paper, we address the problem of jointly detecting pedestrians and recognizing 32 pedestrian attributes.
We introduce a Multi-Task Learning (MTL) model relying on a composite field framework, which achieves both goals in an efficient way.
We show competitive detection and attribute recognition results, as well as a more stable MTL training.
arXiv Detail & Related papers (2020-12-04T15:10:12Z) - Discriminative Feature Representation with Spatio-temporal Cues for
Vehicle Re-identification [0.0]
Vehicle-identification (re-ID) aims to discover and match the target vehicles from a gallery image set taken by different cameras on a wide range of road networks.
We propose a feature representation with novel clues (DFR-ST) for vehicle re-ID.
It is capable of building robust features in the embedding space by involving appearance and retemporal information.
arXiv Detail & Related papers (2020-11-13T10:50:21Z) - Viewpoint-Aware Channel-Wise Attentive Network for Vehicle
Re-Identification [30.797140440568455]
Vehicle re-identification (re-ID) matches images of the same vehicle across different cameras.
It is fundamentally challenging because the dramatically different appearance caused by different viewpoints would make the framework fail to match two vehicles of the same identity.
We propose Viewpoint-aware Channel-wise Attention Mechanism (VCAM) by observing the attention mechanism from a different aspect.
arXiv Detail & Related papers (2020-10-12T16:05:41Z) - Orientation-aware Vehicle Re-identification with Semantics-guided Part
Attention Network [33.712450134663236]
We propose a dedicated Semantics-guided Part Attention Network (SPAN) to robustly predict part attention masks for different views of vehicles.
With the help of part attention masks, we can extract discriminative features in each part separately.
Then we introduce Co-occurrence Part-attentive Distance Metric (CPDM) which places greater emphasis on co-occurrence vehicle parts.
arXiv Detail & Related papers (2020-08-26T07:33:09Z) - The Devil is in the Details: Self-Supervised Attention for Vehicle
Re-Identification [75.3310894042132]
Self-supervised Attention for Vehicle Re-identification (SAVER) is a novel approach to effectively learn vehicle-specific discriminative features.
We show that SAVER improves upon the state-of-the-art on challenging VeRi, VehicleID, Vehicle-1M and VERI-Wild datasets.
arXiv Detail & Related papers (2020-04-14T02:24:47Z) - Parsing-based View-aware Embedding Network for Vehicle Re-Identification [138.11983486734576]
We propose a parsing-based view-aware embedding network (PVEN) to achieve the view-aware feature alignment and enhancement for vehicle ReID.
The experiments conducted on three datasets show that our model outperforms state-of-the-art methods by a large margin.
arXiv Detail & Related papers (2020-04-10T13:06:09Z) - Co-Saliency Spatio-Temporal Interaction Network for Person
Re-Identification in Videos [85.6430597108455]
We propose a novel Co-Saliency Spatio-Temporal Interaction Network (CSTNet) for person re-identification in videos.
It captures the common salient foreground regions among video frames and explores the spatial-temporal long-range context interdependency from such regions.
Multiple spatialtemporal interaction modules within CSTNet are proposed, which exploit the spatial and temporal long-range context interdependencies on such features and spatial-temporal information correlation.
arXiv Detail & Related papers (2020-04-10T10:23:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.