Multi-Granularity Graph Pooling for Video-based Person Re-Identification
- URL: http://arxiv.org/abs/2209.11584v1
- Date: Fri, 23 Sep 2022 13:26:05 GMT
- Title: Multi-Granularity Graph Pooling for Video-based Person Re-Identification
- Authors: Honghu Pan, Yongyong Chen, Zhenyu He
- Abstract summary: graph neural networks (GNNs) are introduced to aggregate temporal and spatial features of video samples.
Existing graph-based models, like STGCN, perform the textitmean/textitmax pooling on node features to obtain the graph representation.
We propose the graph pooling network (GPNet) to learn the multi-granularity graph representation for the video retrieval.
- Score: 14.943835935921296
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The video-based person re-identification (ReID) aims to identify the given
pedestrian video sequence across multiple non-overlapping cameras.
To aggregate the temporal and spatial features of the video samples, the
graph neural networks (GNNs) are introduced.
However, existing graph-based models, like STGCN, perform the
\textit{mean}/\textit{max pooling} on node features to obtain the graph
representation, which neglect the graph topology and node importance.
In this paper, we propose the graph pooling network (GPNet) to learn the
multi-granularity graph representation for the video retrieval, where the
\textit{graph pooling layer} is implemented to downsample the graph.
We first construct a multi-granular graph, whose node features denote image
embedding learned by backbone, and edges are established between the temporal
and Euclidean neighborhood nodes.
We then implement multiple graph convolutional layers to perform the
neighborhood aggregation on the graphs.
To downsample the graph, we propose a multi-head full attention graph pooling
(MHFAPool) layer, which integrates the advantages of existing node clustering
and node selection pooling methods.
Specifically, MHFAPool takes the main eigenvector of full attention matrix as
the aggregation coefficients to involve the global graph information in each
pooled nodes.
Extensive experiments demonstrate that our GPNet achieves the competitive
results on four widely-used datasets, i.e., MARS, DukeMTMC-VideoReID, iLIDS-VID
and PRID-2011.
Related papers
- Graph Parsing Networks [64.5041886737007]
We propose an efficient graph parsing algorithm to infer the pooling structure, which then drives graph pooling.
The resulting Graph Parsing Network (GPN) adaptively learns personalized pooling structure for each individual graph.
arXiv Detail & Related papers (2024-02-22T09:08:36Z) - Saliency-Aware Regularized Graph Neural Network [39.82009838086267]
We propose the Saliency-Aware Regularized Graph Neural Network (SAR-GNN) for graph classification.
We first estimate the global node saliency by measuring the semantic similarity between the compact graph representation and node features.
Then the learned saliency distribution is leveraged to regularize the neighborhood aggregation of the backbone.
arXiv Detail & Related papers (2024-01-01T13:44:16Z) - MPool: Motif-Based Graph Pooling [0.0]
Graph Neural networks (GNNs) have recently become a powerful technique for many graph-related tasks including graph classification.
We propose a multi-channel Motif-based Graph Pooling method named (MPool)
As the first channel, we develop node selection-based graph pooling by designing a node ranking model considering the motif adjacency of nodes.
As the second channel, we develop cluster-based graph pooling by designing a spectral clustering model using motif adjacency.
As the final layer, the result of each channel is aggregated into the final graph representation.
arXiv Detail & Related papers (2023-03-07T05:21:15Z) - LiftPool: Lifting-based Graph Pooling for Hierarchical Graph
Representation Learning [53.176603566951016]
We propose an enhanced three-stage method via lifting, named LiftPool, to improve hierarchical graph representation.
For each node to be removed, its local information is obtained by subtracting the global information aggregated from its neighboring preserved nodes.
Evaluations on benchmark graph datasets show that LiftPool substantially outperforms the state-of-the-art graph pooling methods in the task of graph classification.
arXiv Detail & Related papers (2022-04-27T12:38:02Z) - Edge but not Least: Cross-View Graph Pooling [76.71497833616024]
This paper presents a cross-view graph pooling (Co-Pooling) method to better exploit crucial graph structure information.
Through cross-view interaction, edge-view pooling and node-view pooling seamlessly reinforce each other to learn more informative graph-level representations.
arXiv Detail & Related papers (2021-09-24T08:01:23Z) - Accurate Learning of Graph Representations with Graph Multiset Pooling [45.72542969364438]
We propose a Graph Multiset Transformer (GMT) that captures the interaction between nodes according to their structural dependencies.
Our experimental results show that GMT significantly outperforms state-of-the-art graph pooling methods on graph classification benchmarks.
arXiv Detail & Related papers (2021-02-23T07:45:58Z) - Second-Order Pooling for Graph Neural Networks [62.13156203025818]
We propose to use second-order pooling as graph pooling, which naturally solves the above challenges.
We show that direct use of second-order pooling with graph neural networks leads to practical problems.
We propose two novel global graph pooling methods based on second-order pooling; namely, bilinear mapping and attentional second-order pooling.
arXiv Detail & Related papers (2020-07-20T20:52:36Z) - Graph Pooling with Node Proximity for Hierarchical Representation
Learning [80.62181998314547]
We propose a novel graph pooling strategy that leverages node proximity to improve the hierarchical representation learning of graph data with their multi-hop topology.
Results show that the proposed graph pooling strategy is able to achieve state-of-the-art performance on a collection of public graph classification benchmark datasets.
arXiv Detail & Related papers (2020-06-19T13:09:44Z) - Structural Temporal Graph Neural Networks for Anomaly Detection in
Dynamic Graphs [54.13919050090926]
We propose an end-to-end structural temporal Graph Neural Network model for detecting anomalous edges in dynamic graphs.
In particular, we first extract the $h$-hop enclosing subgraph centered on the target edge and propose the node labeling function to identify the role of each node in the subgraph.
Based on the extracted features, we utilize Gated recurrent units (GRUs) to capture the temporal information for anomaly detection.
arXiv Detail & Related papers (2020-05-15T09:17:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.