Preliminary experiments on automatic gender recognition based on online
capital letters
- URL: http://arxiv.org/abs/2203.06265v1
- Date: Fri, 11 Mar 2022 21:55:38 GMT
- Title: Preliminary experiments on automatic gender recognition based on online
capital letters
- Authors: Marcos Faundez-Zanuy, Enric Sesa-Nogueras
- Abstract summary: In this paper we present some experiments to automatically classify online handwritten text based on capital letters.
Although handwritten text is not as discriminative as face or voice, we still found some chance for gender classification based on handwritten text.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In this paper we present some experiments to automatically classify online
handwritten text based on capital letters. Although handwritten text is not as
discriminative as face or voice, we still found some chance for gender
classification based on handwritten text. Accuracies are up to 74%, even in the
most challenging case of capital letters.
Related papers
- Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI [95.81924314159943]
We find that major gaps between human and machine text lie in concreteness, cultural nuances, and diversity.
We also find that humans do not always prefer human-written text, particularly when they cannot clearly identify its source.
arXiv Detail & Related papers (2025-02-17T09:56:46Z) - Improving the Generation Quality of Watermarked Large Language Models
via Word Importance Scoring [81.62249424226084]
Token-level watermarking inserts watermarks in the generated texts by altering the token probability distributions.
This watermarking algorithm alters the logits during generation, which can lead to a downgraded text quality.
We propose to improve the quality of texts generated by a watermarked language model by Watermarking with Importance Scoring (WIS)
arXiv Detail & Related papers (2023-11-16T08:36:00Z) - UIT-HWDB: Using Transferring Method to Construct A Novel Benchmark for
Evaluating Unconstrained Handwriting Image Recognition in Vietnamese [2.8360662552057323]
In Vietnamese, besides the modern Latin characters, there are accent and letter marks together with characters that draw confusion to state-of-the-art handwriting recognition methods.
As a low-resource language, there are not many datasets for researching handwriting recognition in Vietnamese.
Recent works evaluated offline handwriting recognition methods in Vietnamese using images from an online handwriting dataset constructed by connecting pen stroke coordinates without further processing.
This paper proposes the Transferring method to construct a handwriting image dataset that associates crucial natural attributes required for offline handwriting images.
arXiv Detail & Related papers (2022-11-10T08:23:54Z) - PART: Pre-trained Authorship Representation Transformer [64.78260098263489]
Authors writing documents imprint identifying information within their texts: vocabulary, registry, punctuation, misspellings, or even emoji usage.
Previous works use hand-crafted features or classification tasks to train their authorship models, leading to poor performance on out-of-domain authors.
We propose a contrastively trained model fit to learn textbfauthorship embeddings instead of semantics.
arXiv Detail & Related papers (2022-09-30T11:08:39Z) - Gender classification by means of online uppercase handwriting: A
text-dependent allographic approach [0.0]
This paper presents a gender classification schema based on online handwriting.
Using samples acquired with a digital tablet that captures the dynamics of the writing, it classifies the writer as a male or a female.
arXiv Detail & Related papers (2022-03-18T10:37:19Z) - Writer Recognition Using Off-line Handwritten Single Block Characters [59.17685450892182]
We use personal identity numbers consisting of the six digits of the date of birth, DoB.
We evaluate two recognition approaches, one based on handcrafted features that compute directional measurements, and another based on deep features from a ResNet50 model.
Results show the presence of identity-related information in a piece of handwritten information as small as six digits with the DoB.
arXiv Detail & Related papers (2022-01-25T23:04:10Z) - Augmentation of base classifier performance via HMMs on a handwritten
character data set [0.0]
This paper presents results of a study of the performance of several base classifiers for recognition of handwritten characters of the modern Latin alphabet.
The best classification performance after correction was 89.8%, and the average was 68.1%.
arXiv Detail & Related papers (2021-11-17T15:22:47Z) - KOHTD: Kazakh Offline Handwritten Text Dataset [0.0]
We propose an extensive Kazakh offline Handwritten Text dataset (KOHTD)
KOHTD has 3000 handwritten exam papers and more than 140335 segmented images and there are approximately 922010 symbols.
We used a variety of popular text recognition methods for word and line recognition in our studies, including CTC-based and attention-based methods.
arXiv Detail & Related papers (2021-09-22T16:19:38Z) - Support Vector Machine for Handwritten Character Recognition [0.0]
A database of 10,000 character samples of 44 basic Malayalam characters is used in this work.
A discriminate feature set of 64 local and 4 global features are used to train and test SVM classifier and achieved 92.24% accuracy.
arXiv Detail & Related papers (2021-09-07T13:36:12Z) - SmartPatch: Improving Handwritten Word Imitation with Patch
Discriminators [67.54204685189255]
We propose SmartPatch, a new technique increasing the performance of current state-of-the-art methods.
We combine the well-known patch loss with information gathered from the parallel trained handwritten text recognition system.
This leads to a more enhanced local discriminator and results in more realistic and higher-quality generated handwritten words.
arXiv Detail & Related papers (2021-05-21T18:34:21Z) - Multi-Dimensional Gender Bias Classification [67.65551687580552]
Machine learning models can inadvertently learn socially undesirable patterns when training on gender biased text.
We propose a general framework that decomposes gender bias in text along several pragmatic and semantic dimensions.
Using this fine-grained framework, we automatically annotate eight large scale datasets with gender information.
arXiv Detail & Related papers (2020-05-01T21:23:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.