A Modular Unsupervised Framework for Attribute Recognition from Unstructured Text
- URL: http://arxiv.org/abs/2507.03949v1
- Date: Sat, 05 Jul 2025 08:22:52 GMT
- Title: A Modular Unsupervised Framework for Attribute Recognition from Unstructured Text
- Authors: KMA Solaiman,
- Abstract summary: POSID is a framework for extracting structured attribute-based properties from unstructured text.<n>We demonstrate its effectiveness on a missing person use case using the InciText dataset.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: We propose POSID, a modular, lightweight and on-demand framework for extracting structured attribute-based properties from unstructured text without task-specific fine-tuning. While the method is designed to be adaptable across domains, in this work, we evaluate it on human attribute recognition in incident reports. POSID combines lexical and semantic similarity techniques to identify relevant sentences and extract attributes. We demonstrate its effectiveness on a missing person use case using the InciText dataset, achieving effective attribute extraction without supervised training.
Related papers
- Dynamic Context Adaptation for Consistent Role-Playing Agents with Retrieval-Augmented Generations [0.3524869467682149]
AMADEUS is composed of Adaptive Context-aware Text Splitter (ACTS), Guided Selection (GS), and Attribute Extractor (AE)<n>AE identifies a character's general attributes from the chunks retrieved by GS and uses these attributes as a final context to maintain robust persona consistency even when answering out of knowledge questions.<n>CharacterRAG consists of persona documents for 15 distinct fictional characters totaling 976K written characters, and 450 question and answer pairs.
arXiv Detail & Related papers (2025-08-04T03:27:05Z) - FOCUS: Fine-grained Optimization with Semantic Guided Understanding for Pedestrian Attributes Recognition [40.85042685914472]
Pedestrian attribute recognition is a fundamental perception task in intelligent transportation and security.<n>To tackle this fine-grained task, most existing methods focus on extracting regional features to enrich attribute information.<n>We propose the textbfFine-grained textbfOptimization with semantitextbfC gtextbfUided undertextbfStanding (FOCUS) approach for PAR.
arXiv Detail & Related papers (2025-06-28T10:38:54Z) - LATex: Leveraging Attribute-based Text Knowledge for Aerial-Ground Person Re-Identification [63.07563443280147]
We propose a novel framework named LATex for AG-ReID.<n>It adopts prompt-tuning strategies to leverage attribute-based text knowledge.<n>Our framework can fully leverage attribute-based text knowledge to improve the AG-ReID.
arXiv Detail & Related papers (2025-03-31T04:47:05Z) - Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition [58.79807861739438]
Existing pedestrian recognition (PAR) algorithms are mainly developed based on a static image.
We propose to understand human attributes using video frames that can fully use temporal information.
arXiv Detail & Related papers (2024-04-27T14:43:32Z) - Text Attribute Control via Closed-Loop Disentanglement [72.2786244367634]
We propose a novel approach to achieve a robust control of attributes while enhancing content preservation.
In this paper, we use a semi-supervised contrastive learning method to encourage the disentanglement of attributes in latent spaces.
We conducted experiments on three text datasets, including the Yelp Service review dataset, the Amazon Product review dataset, and the GoEmotions dataset.
arXiv Detail & Related papers (2023-12-01T01:26:38Z) - Joint Event Extraction via Structural Semantic Matching [12.248124072173935]
Event Extraction (EE) is one of the essential tasks in information extraction.
This paper encodes the semantic features of event types and makes structural matching with target text.
arXiv Detail & Related papers (2023-06-06T07:42:39Z) - TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision [61.186488081379]
We propose TextFormer, a query-based end-to-end text spotter with Transformer architecture.
TextFormer builds upon an image encoder and a text decoder to learn a joint semantic understanding for multi-task modeling.
It allows for mutual training and optimization of classification, segmentation, and recognition branches, resulting in deeper feature sharing.
arXiv Detail & Related papers (2023-06-06T03:37:41Z) - BERM: Training the Balanced and Extractable Representation for Matching
to Improve Generalization Ability of Dense Retrieval [54.66399120084227]
We propose a novel method to improve the generalization of dense retrieval via capturing matching signal called BERM.
Dense retrieval has shown promise in the first-stage retrieval process when trained on in-domain labeled datasets.
arXiv Detail & Related papers (2023-05-18T15:43:09Z) - Investigating the Role of Attribute Context in Vision-Language Models
for Object Recognition and Detection [33.77415850289717]
Methods are mostly evaluated in terms of how well object class names are learned, but captions also contain rich attribute context.
It is unclear how methods use this context in learning, as well as whether models succeed when tasks require attribute and object understanding.
Our results show that attribute context can be wasted when learning alignment for detection, attribute meaning is not adequately considered in embeddings, and describing classes by only their attributes is ineffective.
arXiv Detail & Related papers (2023-03-17T16:14:37Z) - Fine-grained Sentiment Controlled Text Generation [28.20006438705556]
Controlled text generation techniques aim to regulate specific attributes while preserving the attribute independent content.
We propose DE-VAE, a hierarchical framework which captures both information enriched entangled representation and attribute specific disentangled representation.
arXiv Detail & Related papers (2020-06-17T14:17:58Z) - ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural
Language [36.319953919737245]
Person search by natural language aims at retrieving a specific person in a large-scale image pool that matches the given textual descriptions.
We propose an attribute-aligning perspective that allows grounding specific attribute phrases to the corresponding visual regions.
We achieve success as well as the performance boosting by a robust feature learning.
arXiv Detail & Related papers (2020-05-15T02:22:28Z) - Extractive Summarization as Text Matching [123.09816729675838]
This paper creates a paradigm shift with regard to the way we build neural extractive summarization systems.
We formulate the extractive summarization task as a semantic text matching problem.
We have driven the state-of-the-art extractive result on CNN/DailyMail to a new level (44.41 in ROUGE-1)
arXiv Detail & Related papers (2020-04-19T08:27:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.