Learning Multi-Attention Context Graph for Group-Based Re-Identification
- URL: http://arxiv.org/abs/2104.14236v1
- Date: Thu, 29 Apr 2021 09:57:47 GMT
- Title: Learning Multi-Attention Context Graph for Group-Based Re-Identification
- Authors: Yichao Yan, Jie Qin, Bingbing Ni, Jiaxin Chen, Li Liu, Fan Zhu,
Wei-Shi Zheng, Xiaokang Yang, Ling Shao
- Abstract summary: Learning to re-identify or retrieve a group of people across non-overlapped camera systems has important applications in video surveillance.
In this work, we consider employing context information for identifying groups of people, i.e., group re-id.
We propose a novel unified framework based on graph neural networks to simultaneously address the group-based re-id tasks.
- Score: 214.84551361855443
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Learning to re-identify or retrieve a group of people across non-overlapped
camera systems has important applications in video surveillance. However, most
existing methods focus on (single) person re-identification (re-id), ignoring
the fact that people often walk in groups in real scenarios. In this work, we
take a step further and consider employing context information for identifying
groups of people, i.e., group re-id. We propose a novel unified framework based
on graph neural networks to simultaneously address the group-based re-id tasks,
i.e., group re-id and group-aware person re-id. Specifically, we construct a
context graph with group members as its nodes to exploit dependencies among
different people. A multi-level attention mechanism is developed to formulate
both intra-group and inter-group context, with an additional self-attention
module for robust graph-level representations by attentively aggregating
node-level features. The proposed model can be directly generalized to tackle
group-aware person re-id using node-level representations. Meanwhile, to
facilitate the deployment of deep learning models on these tasks, we build a
new group re-id dataset that contains more than 3.8K images with 1.5K annotated
groups, an order of magnitude larger than existing group re-id datasets.
Extensive experiments on the novel dataset as well as three existing datasets
clearly demonstrate the effectiveness of the proposed framework for both
group-based re-id tasks. The code is available at
https://github.com/daodaofr/group_reid.
Related papers
- The Research of Group Re-identification from Multiple Cameras [0.4955551943523977]
Group re-identification is very challenging since it is not only interfered by view-point and human pose variations in the traditional re-identification tasks.
This paper introduces a novel approach which leverages the multi-granularity information inside groups to facilitate group re-identification.
arXiv Detail & Related papers (2024-07-19T18:28:13Z) - Towards More Practical Group Activity Detection: A New Benchmark and Model [61.39427407758131]
Group activity detection (GAD) is the task of identifying members of each group and classifying the activity of the group at the same time in a video.
We present a new dataset, dubbed Caf'e, which presents more practical scenarios and metrics.
We also propose a new GAD model that deals with an unknown number of groups and latent group members efficiently and effectively.
arXiv Detail & Related papers (2023-12-05T16:48:17Z) - Ranking-based Group Identification via Factorized Attention on Social
Tripartite Graph [68.08590487960475]
We propose a novel GNN-based framework named Contextualized Factorized Attention for Group identification (CFAG)
We devise tripartite graph convolution layers to aggregate information from different types of neighborhoods among users, groups, and items.
To cope with the data sparsity issue, we devise a novel propagation augmentation layer, which is based on our proposed factorized attention mechanism.
arXiv Detail & Related papers (2022-11-02T01:42:20Z) - AggNet: Learning to Aggregate Faces for Group Membership Verification [20.15673797674449]
In some face recognition applications, we are interested to verify whether an individual is a member of a group, without revealing their identity.
Some existing methods, propose a mechanism for quantizing precomputed face descriptors into discrete embeddings and aggregating them into one group representation.
We propose a deep architecture that jointly learns face descriptors and the aggregation mechanism for better end-to-end performances.
arXiv Detail & Related papers (2022-06-17T10:48:34Z) - Towards Group Robustness in the presence of Partial Group Labels [61.33713547766866]
spurious correlations between input samples and the target labels wrongly direct the neural network predictions.
We propose an algorithm that optimize for the worst-off group assignments from a constraint set.
We show improvements in the minority group's performance while preserving overall aggregate accuracy across groups.
arXiv Detail & Related papers (2022-01-10T22:04:48Z) - Graph Neural Netwrok with Interaction Pattern for Group Recommendation [1.066048003460524]
We propose the model GIP4GR (Graph Neural Network with Interaction Pattern For Group Recommendation)
Specifically, our model use the graph neural network framework with powerful representation capabilities to represent the interaction between group-user-items in the topological structure of the graph.
We conducted a lot of experiments on two real-world datasets to illustrate the superior performance of our model.
arXiv Detail & Related papers (2021-09-21T13:42:46Z) - Double-Scale Self-Supervised Hypergraph Learning for Group
Recommendation [35.841350982832545]
Group recommendation suffers seriously from the problem of data sparsity.
We propose a self-supervised hypergraph learning framework for group recommendation to achieve two goals.
arXiv Detail & Related papers (2021-09-09T12:19:49Z) - GroupFormer: Group Activity Recognition with Clustered Spatial-Temporal
Transformer [16.988878921451484]
GroupFormer captures spatial-temporal contextual information jointly to augment the individual and group representations.
The proposed framework outperforms state-of-the-art methods on the Volleyball dataset and Collective Activity dataset.
arXiv Detail & Related papers (2021-08-28T11:24:36Z) - Learning Spatial Context with Graph Neural Network for Multi-Person Pose
Grouping [71.59494156155309]
Bottom-up approaches for image-based multi-person pose estimation consist of two stages: keypoint detection and grouping.
In this work, we formulate the grouping task as a graph partitioning problem, where we learn the affinity matrix with a Graph Neural Network (GNN)
The learned geometry-based affinity is further fused with appearance-based affinity to achieve robust keypoint association.
arXiv Detail & Related papers (2021-04-06T09:21:14Z) - Overcoming Data Sparsity in Group Recommendation [52.00998276970403]
Group recommender systems should be able to accurately learn not only users' personal preferences but also preference aggregation strategy.
In this paper, we take Bipartite Graphding Model (BGEM), the self-attention mechanism and Graph Convolutional Networks (GCNs) as basic building blocks to learn group and user representations in a unified way.
arXiv Detail & Related papers (2020-10-02T07:11:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.