Sigma-lognormal modeling of speech
- URL: http://arxiv.org/abs/2401.17320v1
- Date: Sat, 27 Jan 2024 18:00:20 GMT
- Title: Sigma-lognormal modeling of speech
- Authors: C. Carmona-Duarte, M.A.Ferrer, R. Plamondon, A. Gomez-Rodellar, P.
Gomez-Vilda
- Abstract summary: This work presents a speech kinematics based model that can be used to study, analyze, and complex speech kinematics in a simplified manner.
A method based on the kinematic theory of rapid human movements and its associated Sigma lognormal model are applied to describe the impulse response of the neuromuscular networks involved in speech.
Experiments carried out with the (English) VTR TIMIT database and the (German) Saarbrucken Voice Database, including people of different ages, corroborate the link between the extracted parameters and aging.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Human movement studies and analyses have been fundamental in many scientific
domains, ranging from neuroscience to education, pattern recognition to
robotics, health care to sports, and beyond. Previous speech motor models were
proposed to understand how speech movement is produced and how the resulting
speech varies when some parameters are changed. However, the inverse approach,
in which the muscular response parameters and the subject's age are derived
from real continuous speech, is not possible with such models. Instead, in the
handwriting field, the kinematic theory of rapid human movements and its
associated Sigma-lognormal model have been applied successfully to obtain the
muscular response parameters. This work presents a speech kinematics based
model that can be used to study, analyze, and reconstruct complex speech
kinematics in a simplified manner. A method based on the kinematic theory of
rapid human movements and its associated Sigma lognormal model are applied to
describe and to parameterize the asymptotic impulse response of the
neuromuscular networks involved in speech as a response to a neuromotor
command. The method used to carry out transformations from formants to a
movement observation is also presented. Experiments carried out with the
(English) VTR TIMIT database and the (German) Saarbrucken Voice Database,
including people of different ages, with and without laryngeal pathologies,
corroborate the link between the extracted parameters and aging, on the one
hand, and the proportion between the first and second formants required in
applying the kinematic theory of rapid human movements, on the other. The
results should drive innovative developments in the modeling and understanding
of speech kinematics.
Related papers
- Discovering dynamical laws for speech gestures [0.0]
We discover models in the form of symbolic equations that govern articulatory gestures during speech.
A sparse symbolic regression algorithm is used to discover models from kinematic data on the tongue and lips.
arXiv Detail & Related papers (2025-04-07T09:03:32Z) - Reinforcement learning-based motion imitation for physiologically plausible musculoskeletal motor control [47.423243831156285]
We present a model-free motion imitation framework (KINESIS) to advance the understanding of muscle-based motor control.
We demonstrate that KINESIS achieves strong imitation performance on 1.9 hours of motion capture data.
KINESIS generates muscle activity patterns that correlate well with human EMG activity.
arXiv Detail & Related papers (2025-03-18T18:37:49Z) - Continual Learning from Simulated Interactions via Multitask Prospective Rehearsal for Bionic Limb Behavior Modeling [0.7922558880545526]
We introduce a model for human behavior in the context of bionic prosthesis control.
We propose a multitasking, continually adaptive model that anticipates and refines movements over time.
We validate our model through experiments on real-world human gait datasets, including transtibial amputees.
arXiv Detail & Related papers (2024-05-02T09:22:54Z) - MindBridge: A Cross-Subject Brain Decoding Framework [60.58552697067837]
Brain decoding aims to reconstruct stimuli from acquired brain signals.
Currently, brain decoding is confined to a per-subject-per-model paradigm.
We present MindBridge, that achieves cross-subject brain decoding by employing only one model.
arXiv Detail & Related papers (2024-04-11T15:46:42Z) - The bionic neural network for external simulation of human locomotor
system [2.6311880922890842]
This paper proposes a physics-informed deep learning method based on musculoskeletal (MSK) modeling to predict joint motion and muscle forces.
The method can effectively identify subject-specific MSK physiological parameters and the trained physics-informed forward-dynamics surrogate yields accurate motion and muscle forces predictions.
arXiv Detail & Related papers (2023-09-11T23:02:56Z) - Persistent-Transient Duality: A Multi-mechanism Approach for Modeling
Human-Object Interaction [58.67761673662716]
Humans are highly adaptable, swiftly switching between different modes to handle different tasks, situations and contexts.
In Human-object interaction (HOI) activities, these modes can be attributed to two mechanisms: (1) the large-scale consistent plan for the whole activity and (2) the small-scale children interactive actions that start and end along the timeline.
This work proposes to model two concurrent mechanisms that jointly control human motion.
arXiv Detail & Related papers (2023-07-24T12:21:33Z) - A Physics-Informed Low-Shot Learning For sEMG-Based Estimation of Muscle
Force and Joint Kinematics [4.878073267556235]
Muscle force and joint kinematics estimation from surface electromyography (sEMG) are essential for real-time biomechanical analysis.
Recent advances in deep neural networks (DNNs) have shown the potential to improve biomechanical analysis in a fully automated and reproducible manner.
This paper presents a novel physics-informed low-shot learning method for sEMG-based estimation of muscle force and joint kinematics.
arXiv Detail & Related papers (2023-07-08T23:01:12Z) - Modelling Human Visual Motion Processing with Trainable Motion Energy
Sensing and a Self-attention Network [1.9458156037869137]
We propose an image-computable model of human motion perception by bridging the gap between biological and computer vision models.
This model architecture aims to capture the computations in V1-MT, the core structure for motion perception in the biological visual system.
In silico neurophysiology reveals that our model's unit responses are similar to mammalian neural recordings regarding motion pooling and speed tuning.
arXiv Detail & Related papers (2023-05-16T04:16:07Z) - Deep state-space modeling for explainable representation, analysis, and
generation of professional human poses [0.0]
This paper introduces three novel methods for creating explainable representations of human movement.
The trained models are used for the full-body dexterity analysis of expert professionals.
arXiv Detail & Related papers (2023-04-13T08:13:10Z) - Skeleton2Humanoid: Animating Simulated Characters for
Physically-plausible Motion In-betweening [59.88594294676711]
Modern deep learning based motion synthesis approaches barely consider the physical plausibility of synthesized motions.
We propose a system Skeleton2Humanoid'' which performs physics-oriented motion correction at test time.
Experiments on the challenging LaFAN1 dataset show our system can outperform prior methods significantly in terms of both physical plausibility and accuracy.
arXiv Detail & Related papers (2022-10-09T16:15:34Z) - Freeform Body Motion Generation from Speech [53.50388964591343]
Body motion generation from speech is inherently difficult due to the non-deterministic mapping from speech to body motions.
We introduce a novel freeform motion generation model (FreeMo) by equipping a two-stream architecture.
Experiments demonstrate the superior performance against several baselines.
arXiv Detail & Related papers (2022-03-04T13:03:22Z) - From Motion to Muscle [0.0]
We show that muscle activity can be artificially generated based on motion features such as position, velocity, and acceleration.
The model achieves remarkable precision for previously trained movements and maintains significantly high precision for new movements that have not been previously trained.
arXiv Detail & Related papers (2022-01-27T13:30:17Z) - Model-based analysis of brain activity reveals the hierarchy of language
in 305 subjects [82.81964713263483]
A popular approach to decompose the neural bases of language consists in correlating, across individuals, the brain responses to different stimuli.
Here, we show that a model-based approach can reach equivalent results within subjects exposed to natural stimuli.
arXiv Detail & Related papers (2021-10-12T15:30:21Z) - Continuous Learning and Adaptation with Membrane Potential and
Activation Threshold Homeostasis [91.3755431537592]
This paper presents the Membrane Potential and Activation Threshold Homeostasis (MPATH) neuron model.
The model allows neurons to maintain a form of dynamic equilibrium by automatically regulating their activity when presented with input.
Experiments demonstrate the model's ability to adapt to and continually learn from its input.
arXiv Detail & Related papers (2021-04-22T04:01:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.