KaoKore: A Pre-modern Japanese Art Facial Expression Dataset
- URL: http://arxiv.org/abs/2002.08595v1
- Date: Thu, 20 Feb 2020 07:22:13 GMT
- Title: KaoKore: A Pre-modern Japanese Art Facial Expression Dataset
- Authors: Yingtao Tian, Chikahiko Suzuki, Tarin Clanuwat, Mikel Bober-Irizar,
Alex Lamb, Asanobu Kitamoto
- Abstract summary: We propose a new dataset KaoKore which consists of faces extracted from pre-modern Japanese artwork.
We demonstrate its value as both a dataset for image classification as well as a creative and artistic dataset, which we explore using generative models.
- Score: 8.987910033541239
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: From classifying handwritten digits to generating strings of text, the
datasets which have received long-time focus from the machine learning
community vary greatly in their subject matter. This has motivated a renewed
interest in building datasets which are socially and culturally relevant, so
that algorithmic research may have a more direct and immediate impact on
society. One such area is in history and the humanities, where better and
relevant machine learning models can accelerate research across various fields.
To this end, newly released benchmarks and models have been proposed for
transcribing historical Japanese cursive writing, yet for the field as a whole
using machine learning for historical Japanese artworks still remains largely
uncharted. To bridge this gap, in this work we propose a new dataset KaoKore
which consists of faces extracted from pre-modern Japanese artwork. We
demonstrate its value as both a dataset for image classification as well as a
creative and artistic dataset, which we explore using generative models.
Dataset available at https://github.com/rois-codh/kaokore
Related papers
- Unlocking Comics: The AI4VA Dataset for Visual Understanding [62.345344799258804]
This paper presents a novel dataset comprising Franco-Belgian comics from the 1950s annotated for tasks including depth estimation, semantic segmentation, saliency detection, and character identification.
It consists of two distinct and consistent styles and incorporates object concepts and labels taken from natural images.
By including such diverse information across styles, this dataset not only holds promise for computational creativity but also offers avenues for the digitization of art and storytelling innovation.
arXiv Detail & Related papers (2024-10-27T14:27:05Z) - Historical Printed Ornaments: Dataset and Tasks [12.309062405756686]
This paper aims to develop the study of historical printed ornaments with modern unsupervised computer vision.
We highlight three complex tasks that are of critical interest to book historians: clustering, element discovery, and unsupervised change localization.
Our Rey's Ornaments dataset is designed to be a representative example of a set of ornaments historians would be interested in.
arXiv Detail & Related papers (2024-08-16T09:54:12Z) - Open the Data! Chuvash Datasets [50.59120569845975]
We introduce four comprehensive datasets for the Chuvash language.
These datasets include a monolingual dataset, a parallel dataset with Russian, a parallel dataset with English, and an audio dataset.
arXiv Detail & Related papers (2024-05-31T07:51:19Z) - Prompt me a Dataset: An investigation of text-image prompting for
historical image dataset creation using foundation models [0.9065034043031668]
We present a pipeline for image extraction from historical documents using foundation models.
We evaluate text-image prompts and their effectiveness on humanities datasets of varying levels of complexity.
arXiv Detail & Related papers (2023-09-04T15:37:03Z) - ArcAid: Analysis of Archaeological Artifacts using Drawings [23.906975910478142]
Archaeology is an intriguing domain for computer vision.
It suffers not only from shortage in (labeled) data, but also from highly-challenging data, which is often extremely abraded and damaged.
This paper proposes a novel semi-supervised model for classification and retrieval of images of archaeological artifacts.
arXiv Detail & Related papers (2022-11-17T11:57:01Z) - Unsupervised Neural Stylistic Text Generation using Transfer learning
and Adapters [66.17039929803933]
We propose a novel transfer learning framework which updates only $0.3%$ of model parameters to learn style specific attributes for response generation.
We learn style specific attributes from the PERSONALITY-CAPTIONS dataset.
arXiv Detail & Related papers (2022-10-07T00:09:22Z) - Learning Co-segmentation by Segment Swapping for Retrieval and Discovery [67.6609943904996]
The goal of this work is to efficiently identify visually similar patterns from a pair of images.
We generate synthetic training pairs by selecting object segments in an image and copy-pasting them into another image.
We show our approach provides clear improvements for artwork details retrieval on the Brueghel dataset.
arXiv Detail & Related papers (2021-10-29T16:51:16Z) - REGRAD: A Large-Scale Relational Grasp Dataset for Safe and
Object-Specific Robotic Grasping in Clutter [52.117388513480435]
We present a new dataset named regrad to sustain the modeling of relationships among objects and grasps.
Our dataset is collected in both forms of 2D images and 3D point clouds.
Users are free to import their own object models for the generation of as many data as they want.
arXiv Detail & Related papers (2021-04-29T05:31:21Z) - Deep Learning for Text Style Transfer: A Survey [71.8870854396927]
Text style transfer is an important task in natural language generation, which aims to control certain attributes in the generated text.
We present a systematic survey of the research on neural text style transfer, spanning over 100 representative articles since the first neural text style transfer work in 2017.
We discuss the task formulation, existing datasets and subtasks, evaluation, as well as the rich methodologies in the presence of parallel and non-parallel data.
arXiv Detail & Related papers (2020-11-01T04:04:43Z) - Russian Natural Language Generation: Creation of a Language Modelling
Dataset and Evaluation with Modern Neural Architectures [0.0]
We provide a novel reference dataset for Russian language modeling.
We experiment with popular modern methods for text generation, namely variational autoencoders, and generative adversarial networks.
We evaluate the generated text regarding metrics such as perplexity, grammatical correctness and lexical diversity.
arXiv Detail & Related papers (2020-05-05T20:20:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.