A Federated Approach for Fine-Grained Classification of Fashion Apparel
- URL: http://arxiv.org/abs/2008.12350v1
- Date: Thu, 27 Aug 2020 19:44:43 GMT
- Title: A Federated Approach for Fine-Grained Classification of Fashion Apparel
- Authors: Tejaswini Mallavarapu, Luke Cranfill, Junggab Son, Eun Hye Kim, Reza
M. Parizi, and John Morris
- Abstract summary: This paper aims to enable an in-depth classification of fashion item attributes within the same category.
The proposed scheme is comprised of three major stages: (a) localization of a target item from an input image using semantic segmentation, (b) detection of human key points (e.g., point of shoulder) using a pre-trained CNN and a bounding box, and (c) three phases to classify the attributes using a combination of algorithmic approaches and deep neural networks.
- Score: 4.328969982631974
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: As online retail services proliferate and are pervasive in modern lives,
applications for classifying fashion apparel features from image data are
becoming more indispensable. Online retailers, from leading companies to
start-ups, can leverage such applications in order to increase profit margin
and enhance the consumer experience. Many notable schemes have been proposed to
classify fashion items, however, the majority of which focused upon classifying
basic-level categories, such as T-shirts, pants, skirts, shoes, bags, and so
forth. In contrast to most prior efforts, this paper aims to enable an in-depth
classification of fashion item attributes within the same category. Beginning
with a single dress, we seek to classify the type of dress hem, the hem length,
and the sleeve length. The proposed scheme is comprised of three major stages:
(a) localization of a target item from an input image using semantic
segmentation, (b) detection of human key points (e.g., point of shoulder) using
a pre-trained CNN and a bounding box, and (c) three phases to classify the
attributes using a combination of algorithmic approaches and deep neural
networks. The experimental results demonstrate that the proposed scheme is
highly effective, with all categories having average precision of above 93.02%,
and outperforms existing Convolutional Neural Networks (CNNs)-based schemes.
Related papers
- Dual-Modal Prompting for Sketch-Based Image Retrieval [76.12076969949062]
We propose a dual-modal CLIP (DP-CLIP) network, in which an adaptive prompting strategy is designed.
We employ a set of images within the target category and the textual category label to respectively construct a set of category-adaptive prompt tokens and channel scales.
Our DP-CLIP outperforms the state-of-the-art fine-grained zero-shot method by 7.3% in Acc.@1 on the Sketchy dataset.
arXiv Detail & Related papers (2024-04-29T13:43:49Z) - Semantic Embedded Deep Neural Network: A Generic Approach to Boost
Multi-Label Image Classification Performance [10.257208600853199]
We introduce a generic semantic-embedding deep neural network to apply the spatial awareness semantic feature.
We observed an Avg.relative improvement of 15.27% in terms of AUC score across all labels compared to the baseline approach.
arXiv Detail & Related papers (2023-05-09T07:44:52Z) - Data Efficient Training with Imbalanced Label Sample Distribution for
Fashion Detection [5.912870746288055]
We propose a state-of-the-art weighted objective function to boost the performance of deep neural networks (DNNs) for multi-label classification with long-tailed data distribution.
Our experiments involve image-based attribute classification of fashion apparels, and the results demonstrate favorable performance for the new weighting method.
arXiv Detail & Related papers (2023-05-07T21:25:09Z) - Design-time Fashion Popularity Forecasting in VR Environments [9.621518697689128]
We develop a computer vision pipeline fine tuned on fashion imagery to extract relevant visual features.
We propose MuQAR, a Multimodal Quasi-AutoRegressive neural network that forecasts the popularity of new garments.
Both the proposed HLS and MuQAR prove capable of surpassing the current state-of-the-art in key benchmark datasets.
arXiv Detail & Related papers (2022-12-14T12:30:03Z) - MMFL-Net: Multi-scale and Multi-granularity Feature Learning for
Cross-domain Fashion Retrieval [3.7045939497992917]
Cross-domain fashion retrieval is a difficult task due to a wide range of consumer-to-shop (C2S) domain discrepancies.
We propose a novel multi-scale and multi-granularity feature learning network (MMFL-Net), which can jointly learn global-local aggregation feature representations of clothing images.
Our proposed model also combines the multi-task attribute recognition and classification module with multi-label semantic attributes and product ID labels.
arXiv Detail & Related papers (2022-10-27T02:25:52Z) - FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified
Retrieval and Captioning [66.38951790650887]
Multimodal tasks in the fashion domain have significant potential for e-commerce.
We propose a novel fashion-specific pre-training framework based on weakly-supervised triplets constructed from fashion image-text pairs.
We show the triplet-based tasks are an effective addition to standard multimodal pre-training tasks.
arXiv Detail & Related papers (2022-10-26T21:01:19Z) - Learning What Not to Segment: A New Perspective on Few-Shot Segmentation [63.910211095033596]
Recently few-shot segmentation (FSS) has been extensively developed.
This paper proposes a fresh and straightforward insight to alleviate the problem.
In light of the unique nature of the proposed approach, we also extend it to a more realistic but challenging setting.
arXiv Detail & Related papers (2022-03-15T03:08:27Z) - AF$_2$: Adaptive Focus Framework for Aerial Imagery Segmentation [86.44683367028914]
Aerial imagery segmentation has some unique challenges, the most critical one among which lies in foreground-background imbalance.
We propose Adaptive Focus Framework (AF$), which adopts a hierarchical segmentation procedure and focuses on adaptively utilizing multi-scale representations.
AF$ has significantly improved the accuracy on three widely used aerial benchmarks, as fast as the mainstream method.
arXiv Detail & Related papers (2022-02-18T10:14:45Z) - Grasp-Oriented Fine-grained Cloth Segmentation without Real Supervision [66.56535902642085]
This paper tackles the problem of fine-grained region detection in deformed clothes using only a depth image.
We define up to 6 semantic regions of varying extent, including edges on the neckline, sleeve cuffs, and hem, plus top and bottom grasping points.
We introduce a U-net based network to segment and label these parts.
We show that training our network solely with synthetic data and the proposed DA yields results competitive with models trained on real data.
arXiv Detail & Related papers (2021-10-06T16:31:20Z) - Learning Diverse Fashion Collocation by Neural Graph Filtering [78.9188246136867]
We propose a novel fashion collocation framework, Neural Graph Filtering, that models a flexible set of fashion items via a graph neural network.
By applying symmetric operations on the edge vectors, this framework allows varying numbers of inputs/outputs and is invariant to their ordering.
We evaluate the proposed approach on three popular benchmarks, the Polyvore dataset, the Polyvore-D dataset, and our reorganized Amazon Fashion dataset.
arXiv Detail & Related papers (2020-03-11T16:17:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.