Related papers: Can Human Sex Be Learned Using Only 2D Body Keypoint Estimations?

Can Human Sex Be Learned Using Only 2D Body Keypoint Estimations?

URL: http://arxiv.org/abs/2011.03104v2
Date: Wed, 20 Apr 2022 07:05:43 GMT
Title: Can Human Sex Be Learned Using Only 2D Body Keypoint Estimations?
Authors: Kristijan Bartol and Tomislav Pribanic and David Bojanic and Tomislav Petkovic
Abstract summary: We present a fully automated classification system using only 2D keypoints. A keypoint set consists of 15 joints and the keypoint estimations are obtained using an OpenPose 2D keypoint detector. We learn a deep learning model to distinguish males and females using the keypoints as input and binary labels as output.
Score: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In this paper, we analyze human male and female sex recognition problem and present a fully automated classification system using only 2D keypoints. The keypoints represent human joints. A keypoint set consists of 15 joints and the keypoint estimations are obtained using an OpenPose 2D keypoint detector. We learn a deep learning model to distinguish males and females using the keypoints as input and binary labels as output. We use two public datasets in the experimental section - 3DPeople and PETA. On PETA dataset, we report a 77% accuracy. We provide model performance details on both PETA and 3DPeople. To measure the effect of noisy 2D keypoint detections on the performance, we run separate experiments on 3DPeople ground truth and noisy keypoint data. Finally, we extract a set of factors that affect the classification accuracy and propose future work. The advantage of the approach is that the input is small and the architecture is simple, which enables us to run many experiments and keep the real-time performance in inference. The source code, with the experiments and data preparation scripts, are available on GitHub (https://github.com/kristijanbartol/human-sex-classifier).

Related papers

Robust Human Registration with Body Part Segmentation on Noisy Point Clouds [73.00876572870787]
We introduce a hybrid approach that incorporates body-part segmentation into the mesh fitting process. Our method first assigns body part labels to individual points, which then guide a two-step SMPL-X fitting. We demonstrate that the fitted human mesh can refine body part labels, leading to improved segmentation.
arXiv Detail & Related papers (2025-04-04T17:17:33Z)
CameraHMR: Aligning People with Perspective [54.05758012879385]
We address the challenge of accurate 3D human pose and shape estimation from monocular images. Existing training datasets containing real images with pseudo ground truth (pGT) use SMPLify to fit SMPL to sparse 2D joint locations. We make two contributions that improve pGT accuracy.
arXiv Detail & Related papers (2024-11-12T19:12:12Z)
VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data [53.638818890966036]
textitVoxelKP is a novel fully sparse network architecture tailored for human keypoint estimation in LiDAR data. We introduce sparse box-attention to focus on learning spatial correlations between keypoints within each human instance. We incorporate a spatial encoding to leverage absolute 3D coordinates when projecting 3D voxels to a 2D grid encoding a bird's eye view.
arXiv Detail & Related papers (2023-12-11T23:50:14Z)
3D Human Keypoints Estimation From Point Clouds in the Wild Without Human Labels [78.69095161350059]
GC-KPL is an approach for learning 3D human joint locations from point clouds without human labels. We show that by training on a large training set without any human annotated keypoints, we are able to achieve reasonable performance as compared to the fully supervised approach.
arXiv Detail & Related papers (2023-06-07T19:46:30Z)
Pedestrian Crossing Action Recognition and Trajectory Prediction with 3D Human Keypoints [25.550524178542833]
We propose a novel multi-task learning framework for pedestrian crossing action recognition and trajectory prediction. We use 3D human keypoints extracted from raw sensor data to capture rich information on human pose and activity. We show that our approach achieves state-of-the-art performance on a wide range of evaluation metrics.
arXiv Detail & Related papers (2023-06-01T18:27:48Z)
2D Human Pose Estimation with Explicit Anatomical Keypoints Structure Constraints [15.124606575017621]
We present a novel 2D human pose estimation method with explicit anatomical keypoints structure constraints. Our proposed model can be plugged in the most existing bottom-up or top-down human pose estimation methods. Our methods perform favorably against the most existing bottom-up and top-down human pose estimation methods.
arXiv Detail & Related papers (2022-12-05T11:01:43Z)
An Empirical Study of Pseudo-Labeling for Image-based 3D Object Detection [72.30883544352918]
We investigate whether pseudo-labels can provide effective supervision for the baseline models under varying settings. We achieve 20.23 AP for moderate level on the KITTI-3D testing set without bells and whistles, improving the baseline model by 6.03 AP. We hope this work can provide insights for the image-based 3D detection community under a semi-supervised setting.
arXiv Detail & Related papers (2022-08-15T12:17:46Z)
Human keypoint detection for close proximity human-robot interaction [29.99153271571971]
We study the performance of state-of-the-art human keypoint detectors in the context of close proximity human-robot interaction. The best performing whole-body keypoint detectors in close proximity were MMPose and AlphaPose, but both had difficulty with finger detection. We propose a combination of MMPose or AlphaPose for the body and MediaPipe for the hands in a single framework providing the most accurate and robust detection.
arXiv Detail & Related papers (2022-07-15T20:33:29Z)
Efficient Human Pose Estimation via 3D Event Point Cloud [10.628192454401553]
We are the first to estimate 2D human pose directly from 3D event point cloud. We propose a novel representation of events, the NXized event point cloud, aggregating events on the same position of a small time slice. We find that our method achieves PointNet with 2048 points input 82.46mm in MPJPE3D on the DHP19 dataset, while only has a latency of 12.29ms.
arXiv Detail & Related papers (2022-06-09T13:50:20Z)
Vision-based Behavioral Recognition of Novelty Preference in Pigs [1.837722971703011]
Behavioral scoring of research data is crucial for extracting domain-specific metrics but is bottlenecked on the ability to analyze enormous volumes of information using human labor. Deep learning is widely viewed as a key advancement to relieve this bottleneck. We identify one such domain, where deep learning can be leveraged to alleviate the process of manual scoring.
arXiv Detail & Related papers (2021-06-23T06:10:34Z)
Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D [71.11034329713058]
Existing datasets lack large-scale, high-quality 3D ground truth information. Rel3D is the first large-scale, human-annotated dataset for grounding spatial relations in 3D. We propose minimally contrastive data collection -- a novel crowdsourcing method for reducing dataset bias.
arXiv Detail & Related papers (2020-12-03T01:51:56Z)
KeypointNet: A Large-scale 3D Keypoint Dataset Aggregated from Numerous Human Annotations [56.34297279246823]
KeypointNet is the first large-scale and diverse 3D keypoint dataset. It contains 103,450 keypoints and 8,234 3D models from 16 object categories. Ten state-of-the-art methods are benchmarked on our proposed dataset.
arXiv Detail & Related papers (2020-02-28T12:58:56Z)

This list is automatically generated from the titles and abstracts of the papers in this site.