ToddlerAct: A Toddler Action Recognition Dataset for Gross Motor Development Assessment
- URL: http://arxiv.org/abs/2409.00349v1
- Date: Sat, 31 Aug 2024 04:31:46 GMT
- Title: ToddlerAct: A Toddler Action Recognition Dataset for Gross Motor Development Assessment
- Authors: Hsiang-Wei Huang, Jiacheng Sun, Cheng-Yen Yang, Zhongyu Jiang, Li-Yu Huang, Jenq-Neng Hwang, Yu-Ching Yeh,
- Abstract summary: ToddlerAct is a toddler gross motor action recognition dataset.
We describe the data collection process, annotation methodology, and dataset characteristics.
Our findings highlight the importance of domain-specific datasets for accurate assessment of gross motor development in toddlers.
- Score: 26.16139407666899
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Assessing gross motor development in toddlers is crucial for understanding their physical development and identifying potential developmental delays or disorders. However, existing datasets for action recognition primarily focus on adults, lacking the diversity and specificity required for accurate assessment in toddlers. In this paper, we present ToddlerAct, a toddler gross motor action recognition dataset, aiming to facilitate research in early childhood development. The dataset consists of video recordings capturing a variety of gross motor activities commonly observed in toddlers aged under three years old. We describe the data collection process, annotation methodology, and dataset characteristics. Furthermore, we benchmarked multiple state-of-the-art methods including image-based and skeleton-based action recognition methods on our datasets. Our findings highlight the importance of domain-specific datasets for accurate assessment of gross motor development in toddlers and lay the foundation for future research in this critical area. Our dataset will be available at https://github.com/ipl-uw/ToddlerAct.
Related papers
- Neural Lineage [56.34149480207817]
We introduce a novel task known as neural lineage detection, aiming at discovering lineage relationships between parent and child models.
For practical convenience, we introduce a learning-free approach, which integrates an approximation of the finetuning process into the neural network representation similarity metrics.
For the pursuit of accuracy, we introduce a learning-based lineage detector comprising encoders and a transformer detector.
arXiv Detail & Related papers (2024-06-17T01:11:53Z) - The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences [8.952954042940368]
We release the largest developmental egocentric video dataset to date -- the BabyView dataset.
This 493 hour dataset includes egocentric videos from children spanning 6 months - 5 years of age in both longitudinal, at-home contexts and in a preschool environment.
We train self-supervised language and vision models and evaluate their transfer to out-of-distribution tasks including syntactic structure learning, object recognition, depth estimation, and image segmentation.
arXiv Detail & Related papers (2024-06-14T23:52:27Z) - Data Augmentation in Human-Centric Vision [54.97327269866757]
This survey presents a comprehensive analysis of data augmentation techniques in human-centric vision tasks.
It delves into a wide range of research areas including person ReID, human parsing, human pose estimation, and pedestrian detection.
Our work categorizes data augmentation methods into two main types: data generation and data perturbation.
arXiv Detail & Related papers (2024-03-13T16:05:18Z) - Modeling 3D Infant Kinetics Using Adaptive Graph Convolutional Networks [2.2279946664123664]
Spontaneous motor activity, orkinetics', is shown to provide a powerful surrogate measure of upcoming neurodevelopment.
Here, we follow an alternative approach, predicting infants' maturation based on data-driven evaluation of individual motor patterns.
arXiv Detail & Related papers (2024-02-22T09:34:48Z) - Challenges in Video-Based Infant Action Recognition: A Critical
Examination of the State of the Art [9.327466428403916]
We introduce a groundbreaking dataset called InfActPrimitive'', encompassing five significant infant milestone action categories.
We conduct an extensive comparative analysis employing cutting-edge skeleton-based action recognition models.
Our findings reveal that, although the PoseC3D model achieves the highest accuracy at approximately 71%, the remaining models struggle to accurately capture the dynamics of infant actions.
arXiv Detail & Related papers (2023-11-21T02:36:47Z) - Towards early prediction of neurodevelopmental disorders: Computational
model for Face Touch and Self-adaptors in Infants [0.0]
evaluating a baby's movements is key to understanding possible risks of developmental disorders in their growth.
Previous research in psychology has shown that measuring specific movements or gestures such as face touches in babies is essential to analyse how babies understand themselves and their context.
This research proposes the first automatic approach that detects face touches from video recordings by tracking infants' movements and gestures.
arXiv Detail & Related papers (2023-01-07T18:08:43Z) - NeuroExplainer: Fine-Grained Attention Decoding to Uncover Cortical
Development Patterns of Preterm Infants [73.85768093666582]
We propose an explainable geometric deep network dubbed NeuroExplainer.
NeuroExplainer is used to uncover altered infant cortical development patterns associated with preterm birth.
arXiv Detail & Related papers (2023-01-01T12:48:12Z) - Voxel-level Importance Maps for Interpretable Brain Age Estimation [70.5330922395729]
We focus on the task of brain age regression from 3D brain Magnetic Resonance (MR) images using a Convolutional Neural Network, termed prediction model.
We implement a noise model which aims to add as much noise as possible to the input without harming the performance of the prediction model.
We test our method on 13,750 3D brain MR images from the UK Biobank, and our findings are consistent with the existing neuropathology literature.
arXiv Detail & Related papers (2021-08-11T18:08:09Z) - FP-Age: Leveraging Face Parsing Attention for Facial Age Estimation in
the Wild [50.8865921538953]
We propose a method to explicitly incorporate facial semantics into age estimation.
We design a face parsing-based network to learn semantic information at different scales.
We show that our method consistently outperforms all existing age estimation methods.
arXiv Detail & Related papers (2021-06-21T14:31:32Z) - Child-Computer Interaction: Recent Works, New Dataset, and Age Detection [6.061943386819384]
ChildCI aims to generate a better understanding of the cognitive and neuromotor development of children while interacting with mobile devices.
In our framework children interact with a tablet device, using both a pen stylus and the finger, performing different tasks that require different levels of neuromotor and cognitive skills.
ChildCIdb comprises more than 400 children from 18 months to 8 years old, considering therefore the first three development stages of the Piaget's theory.
arXiv Detail & Related papers (2021-02-02T09:51:58Z) - A Developmental Neuro-Robotics Approach for Boosting the Recognition of
Handwritten Digits [91.3755431537592]
Recent evidence shows that a simulation of the children's embodied strategies can improve the machine intelligence too.
This article explores the application of embodied strategies to convolutional neural network models in the context of developmental neuro-robotics.
arXiv Detail & Related papers (2020-03-23T14:55:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.