ShufaNet: Classification method for calligraphers who have reached the
professional level
- URL: http://arxiv.org/abs/2111.11350v1
- Date: Mon, 22 Nov 2021 16:55:31 GMT
- Title: ShufaNet: Classification method for calligraphers who have reached the
professional level
- Authors: Ge Yunfei, Diao Changyu, Li Min, Yu Ruohan, Qiu Linshan and Xu
Duanqing
- Abstract summary: We propose a novel method, ShufaNet, to classify Chinese calligraphers' styles based on metric learning.
Our method achieved 65% accuracy rate in our data set for few-shot learning, surpassing resNet and other mainstream CNNs.
- Score: 0.0
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: The authenticity of calligraphy is significant but difficult task in the
realm of art, where the key problem is the few-shot classification of
calligraphy. We propose a novel method, ShufaNet ("Shufa" is the pinyin of
Chinese calligraphy), to classify Chinese calligraphers' styles based on metric
learning in the case of few-shot, whose classification accuracy exceeds the
level of students majoring in calligraphy. We present a new network
architecture, including the unique expression of the style of handwriting fonts
called ShufaLoss and the calligraphy category information as prior knowledge.
Meanwhile, we modify the spatial attention module and create ShufaAttention for
handwriting fonts based on the traditional Chinese nine Palace thought. For the
training of the model, we build a calligraphers' data set. Our method achieved
65% accuracy rate in our data set for few-shot learning, surpassing resNet and
other mainstream CNNs. Meanwhile, we conducted battle for calligraphy major
students, and finally surpassed them. This is the first attempt of deep
learning in the field of calligrapher classification, and we expect to provide
ideas for subsequent research.
Related papers
- Moyun: A Diffusion-Based Model for Style-Specific Chinese Calligraphy Generation [10.7430517947254]
'Moyun' can effectively control the generation process and produce calligraphy in the specified style.
Even for calligraphy the calligrapher has not written, 'Moyun' can generate calligraphy that matches the style of the calligrapher.
arXiv Detail & Related papers (2024-10-10T05:14:03Z) - Few-shot Calligraphy Style Learning [0.0]
"Presidifussion" is a novel approach to learning and replicating the unique style of calligraphy of President Xu.
We introduce innovative techniques of font image conditioning and stroke information conditioning, enabling the model to capture the intricate structural elements of Chinese characters.
This work not only presents a breakthrough in the digital preservation of calligraphic art but also sets a new standard for data-efficient generative modeling in the domain of cultural heritage digitization.
arXiv Detail & Related papers (2024-04-26T07:17:09Z) - MetaScript: Few-Shot Handwritten Chinese Content Generation via
Generative Adversarial Networks [15.037121719502606]
We propose MetaScript, a novel content generation system designed to address the diminishing presence of personal handwriting styles in the digital representation of Chinese characters.
Our approach harnesses the power of few-shot learning to generate Chinese characters that retain the individual's unique handwriting style and maintain the efficiency of digital typing.
arXiv Detail & Related papers (2023-12-25T17:31:19Z) - Style Generation in Robot Calligraphy with Deep Generative Adversarial
Networks [15.199472080437527]
The number of Chinese characters is tens of thousands, which leads to difficulties in the generation of a style consistent Chinese calligraphic font with over 6000 characters.
This paper proposes an automatic calligraphy generation model based on deep generative adversarial networks (deepGAN) that can generate style calligraphy fonts with professional standards.
arXiv Detail & Related papers (2023-12-15T10:35:30Z) - Calliffusion: Chinese Calligraphy Generation and Style Transfer with
Diffusion Modeling [1.856334276134661]
We propose Calliffusion, a system for generating high-quality Chinese calligraphy using diffusion models.
Our model architecture is based on DDPM (Denoising Diffusion Probabilistic Models)
arXiv Detail & Related papers (2023-05-30T15:34:45Z) - ALADIN-NST: Self-supervised disentangled representation learning of
artistic style through Neural Style Transfer [60.6863849241972]
We learn a representation of visual artistic style more strongly disentangled from the semantic content depicted in an image.
We show that strongly addressing the disentanglement of style and content leads to large gains in style-specific metrics.
arXiv Detail & Related papers (2023-04-12T10:33:18Z) - PART: Pre-trained Authorship Representation Transformer [64.78260098263489]
Authors writing documents imprint identifying information within their texts: vocabulary, registry, punctuation, misspellings, or even emoji usage.
Previous works use hand-crafted features or classification tasks to train their authorship models, leading to poor performance on out-of-domain authors.
We propose a contrastively trained model fit to learn textbfauthorship embeddings instead of semantics.
arXiv Detail & Related papers (2022-09-30T11:08:39Z) - Letter-level Online Writer Identification [86.13203975836556]
We focus on a novel problem, letter-level online writer-id, which requires only a few trajectories of written letters as identification cues.
A main challenge is that a person often writes a letter in different styles from time to time.
We refer to this problem as the variance of online writing styles (Var-O-Styles)
arXiv Detail & Related papers (2021-12-06T07:21:53Z) - Scalable Font Reconstruction with Dual Latent Manifolds [55.29525824849242]
We propose a deep generative model that performs typography analysis and font reconstruction.
Our approach enables us to massively scale up the number of character types we can effectively model.
We evaluate on the task of font reconstruction over various datasets representing character types of many languages.
arXiv Detail & Related papers (2021-09-10T20:37:43Z) - Font Completion and Manipulation by Cycling Between Multi-Modality
Representations [113.26243126754704]
We innovate to explore the generation of font glyphs as 2D graphic objects with the graph as an intermediate representation.
We formulate a cross-modality cycled image-to-image structure with a graph between an image encoder and an image.
Our model generates improved results than both image-to-image baseline and previous state-of-the-art methods for glyph completion.
arXiv Detail & Related papers (2021-08-30T02:43:29Z) - Grafit: Learning fine-grained image representations with coarse labels [114.17782143848315]
This paper tackles the problem of learning a finer representation than the one provided by training labels.
By jointly leveraging the coarse labels and the underlying fine-grained latent space, it significantly improves the accuracy of category-level retrieval methods.
arXiv Detail & Related papers (2020-11-25T19:06:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.