A deep-learning approach to early identification of suggested sexual
harassment from videos
- URL: http://arxiv.org/abs/2306.00856v1
- Date: Thu, 1 Jun 2023 16:14:17 GMT
- Title: A deep-learning approach to early identification of suggested sexual
harassment from videos
- Authors: Shreya Shetye, Anwita Maiti, Tannistha Maiti and Tarry Singh
- Abstract summary: Sexual harassment, sexual abuse, and sexual violence are prevalent problems in this day and age.
We have classified the three terms (harassment, abuse, and violence) based on the visual attributes present in images depicting these situations.
We identified that factors such as facial expression of the victim and perpetrator and unwanted touching had a direct link to identifying the scenes.
Based on these definitions and characteristics, we have developed a first-of-its-kind dataset from various Indian movie scenes.
- Score: 0.802904964931021
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Sexual harassment, sexual abuse, and sexual violence are prevalent problems
in this day and age. Women's safety is an important issue that needs to be
highlighted and addressed. Given this issue, we have studied each of these
concerns and the factors that affect it based on images generated from movies.
We have classified the three terms (harassment, abuse, and violence) based on
the visual attributes present in images depicting these situations. We
identified that factors such as facial expression of the victim and perpetrator
and unwanted touching had a direct link to identifying the scenes containing
sexual harassment, abuse and violence. We also studied and outlined how
state-of-the-art explicit content detectors such as Google Cloud Vision API and
Clarifai API fail to identify and categorise these images. Based on these
definitions and characteristics, we have developed a first-of-its-kind dataset
from various Indian movie scenes. These scenes are classified as sexual
harassment, sexual abuse, or sexual violence and exported in the PASCAL VOC 1.1
format. Our dataset is annotated on the identified relevant features and can be
used to develop and train a deep-learning computer vision model to identify
these issues. The dataset is publicly available for research and development.
Related papers
- Anonymization Prompt Learning for Facial Privacy-Preserving Text-to-Image Generation [56.46932751058042]
We train a learnable prompt prefix for text-to-image diffusion models, which forces the model to generate anonymized facial identities.
Experiments demonstrate the successful anonymization performance of APL, which anonymizes any specific individuals without compromising the quality of non-identity-specific image generation.
arXiv Detail & Related papers (2024-05-27T07:38:26Z) - "Did They F***ing Consent to That?": Safer Digital Intimacy via Proactive Protection Against Image-Based Sexual Abuse [12.424265801615322]
8 in 10 adults share intimate content such as nude or lewd images.
Stigmatizing attitudes and a lack of technological mitigations put those sharing such content at risk of sexual violence.
arXiv Detail & Related papers (2024-03-07T17:04:55Z) - It is not Sexually Suggestive, It is Educative. Separating Sex Education
from Suggestive Content on TikTok Videos [22.870334358353585]
SexTok is a dataset composed of TikTok videos labeled as sexually suggestive (from the annotator's point of view), sex-educational content, or neither.
Children's exposure to sexually suggestive videos has been shown to have adversarial effects on their development.
Virtual sex education, especially on subjects that are more relevant to the LGBTQIA+ community, is very valuable.
arXiv Detail & Related papers (2023-07-06T20:23:17Z) - Contrastive Language-Vision AI Models Pretrained on Web-Scraped
Multimodal Data Exhibit Sexual Objectification Bias [11.6727088473067]
We show that language-vision AI models trained on web scrapes learn biases of sexual objectification.
Images of female professionals are likely to be associated with sexual descriptions relative to images of male professionals.
arXiv Detail & Related papers (2022-12-21T18:54:19Z) - Fighting Malicious Media Data: A Survey on Tampering Detection and
Deepfake Detection [115.83992775004043]
Recent advances in deep learning, particularly deep generative models, open the doors for producing perceptually convincing images and videos at a low cost.
This paper provides a comprehensive review of the current media tampering detection approaches, and discusses the challenges and trends in this field for future research.
arXiv Detail & Related papers (2022-12-12T02:54:08Z) - CoSIm: Commonsense Reasoning for Counterfactual Scene Imagination [87.4797527628459]
We introduce a new task/dataset called Commonsense Reasoning for Counterfactual Scene Imagination (CoSIm)
CoSIm is designed to evaluate the ability of AI systems to reason about scene change imagination.
arXiv Detail & Related papers (2022-07-08T15:28:23Z) - Leveraging Real Talking Faces via Self-Supervision for Robust Forgery
Detection [112.96004727646115]
We develop a method to detect face-manipulated videos using real talking faces.
We show that our method achieves state-of-the-art performance on cross-manipulation generalisation and robustness experiments.
Our results suggest that leveraging natural and unlabelled videos is a promising direction for the development of more robust face forgery detectors.
arXiv Detail & Related papers (2022-01-18T17:14:54Z) - Reporting Revenge Porn: a Preliminary Expert Analysis [0.0]
We present a preliminary expert analysis of the process for reporting revenge porn abuses in selected content sharing platforms.
Among these, we included social networks, image hosting websites, video hosting platforms, forums, and pornographic sites.
arXiv Detail & Related papers (2021-06-23T08:08:59Z) - Enhancing Unsupervised Video Representation Learning by Decoupling the
Scene and the Motion [86.56202610716504]
Action categories are highly related with the scene where the action happens, making the model tend to degrade to a solution where only the scene information is encoded.
We propose to decouple the scene and the motion (DSM) with two simple operations, so that the model attention towards the motion information is better paid.
arXiv Detail & Related papers (2020-09-12T09:54:11Z) - Large image datasets: A pyrrhic win for computer vision? [2.627046865670577]
We investigate problematic practices and consequences of large scale vision datasets.
We examine broad issues such as the question of consent and justice as well as specific concerns such as the inclusion of verifiably pornographic images in datasets.
arXiv Detail & Related papers (2020-06-24T06:41:32Z) - #MeToo on Campus: Studying College Sexual Assault at Scale Using Data
Reported on Social Media [71.74529365205053]
We analyze the influence of the # trend on a pool of college followers.
The results show that the majority of topics embedded in those # tweets detail sexual harassment stories.
There exists a significant correlation between the prevalence of this trend and official reports on several major geographical regions.
arXiv Detail & Related papers (2020-01-16T18:05:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.