Query-Guided Networks for Few-shot Fine-grained Classification and
Person Search
- URL: http://arxiv.org/abs/2209.10250v1
- Date: Wed, 21 Sep 2022 10:25:32 GMT
- Title: Query-Guided Networks for Few-shot Fine-grained Classification and
Person Search
- Authors: Bharti Munjal and Alessandro Flaborea and Sikandar Amin and Federico
Tombari and Fabio Galasso
- Abstract summary: Few-shot fine-grained classification and person search appear as distinct tasks and literature has treated them separately.
We propose a novel unified Query-Guided Network (QGN) applicable to both tasks.
QGN improves on a few recent few-shot fine-grained datasets, outperforming other techniques on CUB by a large margin.
- Score: 93.80556485668731
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Few-shot fine-grained classification and person search appear as distinct
tasks and literature has treated them separately. But a closer look unveils
important similarities: both tasks target categories that can only be
discriminated by specific object details; and the relevant models should
generalize to new categories, not seen during training.
We propose a novel unified Query-Guided Network (QGN) applicable to both
tasks. QGN consists of a Query-guided Siamese-Squeeze-and-Excitation subnetwork
which re-weights both the query and gallery features across all network layers,
a Query-guided Region Proposal subnetwork for query-specific localisation, and
a Query-guided Similarity subnetwork for metric learning.
QGN improves on a few recent few-shot fine-grained datasets, outperforming
other techniques on CUB by a large margin. QGN also performs competitively on
the person search CUHK-SYSU and PRW datasets, where we perform in-depth
analysis.
Related papers
- Multi-Grained Query-Guided Set Prediction Network for Grounded Multimodal Named Entity Recognition [9.506482334842293]
Grounded Multimodal Named Entity Recognition (GMNER) is an emerging information extraction (IE) task.
Recent unified methods employing machine reading comprehension or sequence generation-based frameworks show limitations in this difficult task.
We propose a novel unified framework named Multi-grained Query-guided Set Prediction Network (MQSPN) to learn appropriate relationships at intra-entity and inter-entity levels.
arXiv Detail & Related papers (2024-07-17T05:42:43Z) - Building Interpretable and Reliable Open Information Retriever for New
Domains Overnight [67.03842581848299]
Information retrieval is a critical component for many down-stream tasks such as open-domain question answering (QA)
We propose an information retrieval pipeline that uses entity/event linking model and query decomposition model to focus more accurately on different information units of the query.
We show that, while being more interpretable and reliable, our proposed pipeline significantly improves passage coverages and denotation accuracies across five IR and QA benchmarks.
arXiv Detail & Related papers (2023-08-09T07:47:17Z) - UniKGQA: Unified Retrieval and Reasoning for Solving Multi-hop Question
Answering Over Knowledge Graph [89.98762327725112]
Multi-hop Question Answering over Knowledge Graph(KGQA) aims to find the answer entities that are multiple hops away from the topic entities mentioned in a natural language question.
We propose UniKGQA, a novel approach for multi-hop KGQA task, by unifying retrieval and reasoning in both model architecture and parameter learning.
arXiv Detail & Related papers (2022-12-02T04:08:09Z) - Knowledge Base Question Answering by Case-based Reasoning over Subgraphs [81.22050011503933]
We show that our model answers queries requiring complex reasoning patterns more effectively than existing KG completion algorithms.
The proposed model outperforms or performs competitively with state-of-the-art models on several KBQA benchmarks.
arXiv Detail & Related papers (2022-02-22T01:34:35Z) - Temporal Query Networks for Fine-grained Video Understanding [88.9877174286279]
We cast this into a query-response mechanism, where each query addresses a particular question, and has its own response label set.
We evaluate the method extensively on the FineGym and Diving48 benchmarks for fine-grained action classification and surpass the state-of-the-art using only RGB features.
arXiv Detail & Related papers (2021-04-19T17:58:48Z) - Hierarchical Representation based Query-Specific Prototypical Network
for Few-Shot Image Classification [5.861206243996454]
Few-shot image classification aims at recognizing unseen categories with a small number of labeled training data.
Recent metric-based frameworks tend to represent a support class by a fixed prototype (e.g., the mean of the support category) and make classification according to the similarities between query instances and support prototypes.
We propose a Hierarchical Representation based Query-Specific Prototypical Network (QPN) to tackle the limitations by generating a region-level prototype for each query sample.
arXiv Detail & Related papers (2021-03-21T12:50:05Z) - Graph Prototypical Networks for Few-shot Learning on Attributed Networks [72.31180045017835]
We propose a graph meta-learning framework -- Graph Prototypical Networks (GPN)
GPN is able to perform textitmeta-learning on an attributed network and derive a highly generalizable model for handling the target classification task.
arXiv Detail & Related papers (2020-06-23T04:13:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.