On the Inference of Soft Biometrics from Typing Patterns Collected in a
Multi-device Environment
- URL: http://arxiv.org/abs/2006.09501v1
- Date: Tue, 16 Jun 2020 20:25:58 GMT
- Title: On the Inference of Soft Biometrics from Typing Patterns Collected in a
Multi-device Environment
- Authors: Vishaal Udandarao and Mohit Agrawal and Rajesh Kumar and Rajiv Ratn
Shah
- Abstract summary: In this paper, we study the inference of gender, major/minor, typing style, age, and height from the typing patterns collected from 117 individuals in a multi-device environment.
For classification tasks, we benchmark the performance of six classical machine learning (ML) and four deep learning (DL) classifiers.
The results are promising considering the variety of application scenarios that we have listed in this work.
- Score: 47.37893297206786
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we study the inference of gender, major/minor (computer
science, non-computer science), typing style, age, and height from the typing
patterns collected from 117 individuals in a multi-device environment. The
inference of the first three identifiers was considered as classification
tasks, while the rest as regression tasks. For classification tasks, we
benchmark the performance of six classical machine learning (ML) and four deep
learning (DL) classifiers. On the other hand, for regression tasks, we
evaluated three ML and four DL-based regressors. The overall experiment
consisted of two text-entry (free and fixed) and four device (Desktop, Tablet,
Phone, and Combined) configurations. The best arrangements achieved accuracies
of 96.15%, 93.02%, and 87.80% for typing style, gender, and major/minor,
respectively, and mean absolute errors of 1.77 years and 2.65 inches for age
and height, respectively. The results are promising considering the variety of
application scenarios that we have listed in this work.
Related papers
- Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning [73.73967342609603]
We introduce a predictor-corrector learning framework to minimize truncation errors.
We also propose an exponential moving average-based coefficient learning method to strengthen our higher-order predictor.
Our model surpasses a robust 3.8B DeepNet by an average of 2.9 SacreBLEU, using only 1/3 parameters.
arXiv Detail & Related papers (2024-11-05T12:26:25Z) - Evaluating Large Language Models on Spatial Tasks: A Multi-Task Benchmarking Study [4.80612909282198]
This study introduces a novel multi-task spatial evaluation dataset.
The dataset encompasses twelve distinct task types, including spatial understanding and path planning.
The study highlights the impact of prompt strategies on model performance in specific tasks.
arXiv Detail & Related papers (2024-08-26T17:25:16Z) - AISPACE at SemEval-2024 task 8: A Class-balanced Soft-voting System for Detecting Multi-generator Machine-generated Text [0.0]
SemEval-2024 Task 8 provides a challenge to detect human-written and machine-generated text.
This paper proposes a system that mainly deals with Subtask B.
It aims to detect if given full text is written by human or is generated by a specific Large Language Model (LLM), which is actually a multi-class text classification task.
arXiv Detail & Related papers (2024-04-01T06:25:47Z) - Dual Pose-invariant Embeddings: Learning Category and Object-specific
Discriminative Representations for Recognition and Retrieval [0.7770029179741429]
We show that it is possible to achieve significant improvements in performance if both the category-based and the object-identity-based embeddings are learned simultaneously during training.
This paper presents an attention-based dual-encoder architecture with specially designed loss functions that optimize the inter- and intra-class distances.
We demonstrate the power of our approach with three challenging multi-view datasets.
arXiv Detail & Related papers (2024-03-01T04:20:13Z) - Large Language Models in the Workplace: A Case Study on Prompt
Engineering for Job Type Classification [58.720142291102135]
This case study investigates the task of job classification in a real-world setting.
The goal is to determine whether an English-language job posting is appropriate for a graduate or entry-level position.
arXiv Detail & Related papers (2023-03-13T14:09:53Z) - HFL at SemEval-2022 Task 8: A Linguistics-inspired Regression Model with
Data Augmentation for Multilingual News Similarity [16.454545004093735]
This paper describes our system designed for SemEval-2022 Task 8: Multilingual News Article Similarity.
We proposed a linguistics-inspired model trained with a few task-specific strategies.
Our system ranked 1st on the leaderboard while achieving a Pearson's Correlation Coefficient of 0.818 on the official evaluation set.
arXiv Detail & Related papers (2022-04-11T03:08:37Z) - Few-shot Learning with Multilingual Language Models [66.49496434282564]
We train multilingual autoregressive language models on a balanced corpus covering a diverse set of languages.
Our largest model sets new state of the art in few-shot learning in more than 20 representative languages.
We present a detailed analysis of where the model succeeds and fails, showing in particular that it enables cross-lingual in-context learning.
arXiv Detail & Related papers (2021-12-20T16:52:35Z) - Meta-Generating Deep Attentive Metric for Few-shot Classification [53.07108067253006]
We present a novel deep metric meta-generation method to generate a specific metric for a new few-shot learning task.
In this study, we structure the metric using a three-layer deep attentive network that is flexible enough to produce a discriminative metric for each task.
We gain surprisingly obvious performance improvement over state-of-the-art competitors, especially in the challenging cases.
arXiv Detail & Related papers (2020-12-03T02:07:43Z) - Competence-Level Prediction and Resume & Job Description Matching Using
Context-Aware Transformer Models [13.302702823447476]
A total of 6,492 resumes are extracted from 24,933 job applications for 252 positions designated into four levels of experience for Clinical Research Coordinators.
A high Kappa score of 61% is achieved for inter-annotator agreement.
arXiv Detail & Related papers (2020-11-05T17:47:03Z) - Device-Robust Acoustic Scene Classification Based on Two-Stage
Categorization and Data Augmentation [63.98724740606457]
We present a joint effort of four groups, namely GT, USTC, Tencent, and UKE, to tackle Task 1 - Acoustic Scene Classification (ASC) in the DCASE 2020 Challenge.
Task 1a focuses on ASC of audio signals recorded with multiple (real and simulated) devices into ten different fine-grained classes.
Task 1b concerns with classification of data into three higher-level classes using low-complexity solutions.
arXiv Detail & Related papers (2020-07-16T15:07:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.