Fashion Landmark Detection and Category Classification for Robotics
- URL: http://arxiv.org/abs/2003.11827v1
- Date: Thu, 26 Mar 2020 10:53:26 GMT
- Title: Fashion Landmark Detection and Category Classification for Robotics
- Authors: Thomas Ziegler, Judith Butepage, Michael C. Welle, Anastasiia Varava,
Tonci Novkovic and Danica Kragic
- Abstract summary: We focus on techniques that can generalize from large-scale fashion datasets to less structured, small datasets collected in a robotic lab.
Our experiments demonstrate that our approach outperforms stateof-the art models with respect to clothing category classification and fashion landmark detection when tested on previously unseen datasets.
- Score: 15.134184609780924
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Research on automated, image based identification of clothing categories and
fashion landmarks has recently gained significant interest due to its potential
impact on areas such as robotic clothing manipulation, automated clothes
sorting and recycling, and online shopping. Several public and annotated
fashion datasets have been created to facilitate research advances in this
direction. In this work, we make the first step towards leveraging the data and
techniques developed for fashion image analysis in vision-based robotic
clothing manipulation tasks. We focus on techniques that can generalize from
large-scale fashion datasets to less structured, small datasets collected in a
robotic lab. Specifically, we propose training data augmentation methods such
as elastic warping, and model adjustments such as rotation invariant
convolutions to make the model generalize better. Our experiments demonstrate
that our approach outperforms stateof-the art models with respect to clothing
category classification and fashion landmark detection when tested on
previously unseen datasets. Furthermore, we present experimental results on a
new dataset composed of images where a robot holds different garments,
collected in our lab.
Related papers
- SKT: Integrating State-Aware Keypoint Trajectories with Vision-Language Models for Robotic Garment Manipulation [82.61572106180705]
This paper presents a unified approach using vision-language models (VLMs) to improve keypoint prediction across various garment categories.
We created a large-scale synthetic dataset using advanced simulation techniques, allowing scalable training without extensive real-world data.
Experimental results indicate that the VLM-based method significantly enhances keypoint detection accuracy and task success rates.
arXiv Detail & Related papers (2024-09-26T17:26:16Z) - UniFolding: Towards Sample-efficient, Scalable, and Generalizable
Robotic Garment Folding [53.38503172679482]
UniFolding is a sample-efficient, scalable, and generalizable robotic system for unfolding and folding garments.
UniFolding employs the proposed UFONet neural network to integrate unfolding and folding decisions into a single policy model.
The system is tested on two garment types: long-sleeve and short-sleeve shirts.
arXiv Detail & Related papers (2023-11-02T14:25:10Z) - Data Efficient Training with Imbalanced Label Sample Distribution for
Fashion Detection [5.912870746288055]
We propose a state-of-the-art weighted objective function to boost the performance of deep neural networks (DNNs) for multi-label classification with long-tailed data distribution.
Our experiments involve image-based attribute classification of fashion apparels, and the results demonstrate favorable performance for the new weighting method.
arXiv Detail & Related papers (2023-05-07T21:25:09Z) - Automatically Classifying Emotions based on Text: A Comparative
Exploration of Different Datasets [0.0]
We focus on three datasets that were recently presented in the related literature.
We explore the performance of traditional as well as state-of-the-art deep learning models in the presence of different characteristics in the data.
Our experimental work shows that state-of-the-art models such as RoBERTa perform the best for all cases.
arXiv Detail & Related papers (2023-02-28T16:34:55Z) - RT-1: Robotics Transformer for Real-World Control at Scale [98.09428483862165]
We present a model class, dubbed Robotics Transformer, that exhibits promising scalable model properties.
We verify our conclusions in a study of different model classes and their ability to generalize as a function of the data size, model size, and data diversity based on a large-scale data collection on real robots performing real-world tasks.
arXiv Detail & Related papers (2022-12-13T18:55:15Z) - StyleGAN-Human: A Data-Centric Odyssey of Human Generation [96.7080874757475]
This work takes a data-centric perspective and investigates multiple critical aspects in "data engineering"
We collect and annotate a large-scale human image dataset with over 230K samples capturing diverse poses and textures.
We rigorously investigate three essential factors in data engineering for StyleGAN-based human generation, namely data size, data distribution, and data alignment.
arXiv Detail & Related papers (2022-04-25T17:55:08Z) - Using Artificial Intelligence to Analyze Fashion Trends [0.76146285961466]
This study proposes a data-driven quantitative abstracting approach using an artificial intelligence (A.I.) algorithm.
An A.I. model was trained on fashion images from a large-scale dataset under different scenarios.
It was found that the A.I. model can generate rich descriptions of detected regions and accurately bind the garments in the images.
arXiv Detail & Related papers (2020-05-03T04:46:12Z) - Learning Diverse Fashion Collocation by Neural Graph Filtering [78.9188246136867]
We propose a novel fashion collocation framework, Neural Graph Filtering, that models a flexible set of fashion items via a graph neural network.
By applying symmetric operations on the edge vectors, this framework allows varying numbers of inputs/outputs and is invariant to their ordering.
We evaluate the proposed approach on three popular benchmarks, the Polyvore dataset, the Polyvore-D dataset, and our reorganized Amazon Fashion dataset.
arXiv Detail & Related papers (2020-03-11T16:17:08Z) - A Strong Baseline for Fashion Retrieval with Person Re-Identification
Models [0.0]
Fashion retrieval is the challenging task of finding an exact match for fashion items contained within an image.
We introduce a simple baseline model for fashion retrieval, significantly outperforming previous state-of-the-art results.
We conduct in-depth experiments on Street2Shop and DeepFashion datasets and validate our results.
arXiv Detail & Related papers (2020-03-09T12:50:15Z) - Learning Predictive Models From Observation and Interaction [137.77887825854768]
Learning predictive models from interaction with the world allows an agent, such as a robot, to learn about how the world works.
However, learning a model that captures the dynamics of complex skills represents a major challenge.
We propose a method to augment the training set with observational data of other agents, such as humans.
arXiv Detail & Related papers (2019-12-30T01:10:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.