Including Everyone, Everywhere: Understanding Opportunities and
Challenges of Geographic Gender-Inclusion in OSS
- URL: http://arxiv.org/abs/2010.00822v2
- Date: Thu, 16 Sep 2021 03:35:06 GMT
- Title: Including Everyone, Everywhere: Understanding Opportunities and
Challenges of Geographic Gender-Inclusion in OSS
- Authors: Gede Artha Azriadi Prana, Denae Ford, Ayushi Rastogi, David Lo, Rahul
Purandare, Nachiappan Nagappan
- Abstract summary: This study presents a multi-region geographical analysis of gender inclusion on GitHub.
Gender diversity is low across all parts of the world, with no substantial difference across regions.
There has been statistically significant improvement in diversity worldwide since 2014, with certain regions such as Africa improving at faster pace.
- Score: 15.757897147034873
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The gender gap is a significant concern facing the software industry as the
development becomes more geographically distributed. Widely shared reports
indicate that gender differences may be specific to each region. However, how
complete can these reports be with little to no research reflective of the Open
Source Software (OSS) process and communities software is now commonly
developed in? Our study presents a multi-region geographical analysis of gender
inclusion on GitHub. This mixed-methods approach includes quantitatively
investigating differences in gender inclusion in projects across geographic
regions and investigate these trends over time using data from contributions to
21,456 project repositories. We also qualitatively understand the unique
experiences of developers contributing to these projects through a survey that
is strategically targeted to developers in various regions worldwide. Our
findings indicate that gender diversity is low across all parts of the world,
with no substantial difference across regions. However, there has been
statistically significant improvement in diversity worldwide since 2014, with
certain regions such as Africa improving at faster pace. We also find that most
motivations and barriers to contributions (e.g., lack of resources to
contribute and poor working environment) were shared across regions, however,
some insightful differences, such as how to make projects more inclusive, did
arise. From these findings, we derive and present implications for tools that
can foster inclusion in open source software communities and empower
contributions from everyone, everywhere.
Related papers
- Saliency-Based diversity and fairness Metric and FaceKeepOriginalAugment: A Novel Approach for Enhancing Fairness and Diversity [46.74201905814679]
We introduce an extension of the KeepOriginalAugment method, termed FaceKeepOriginalAugment, which explores various debiasing aspects-geographical, gender, and stereotypical biases-in computer vision models.
By maintaining a delicate balance between data diversity and information preservation, our approach empowers models to exploit both diverse salient and non-salient regions.
We quantify dataset diversity across a range of datasets, including Flickr Faces HQ (FFHQ), WIKI, IMDB, Labelled Faces in the Wild (LFW), UTK Faces, and Diverse dataset.
arXiv Detail & Related papers (2024-10-29T13:49:23Z) - Report on Female Participation in Informatics degrees in Europe [3.498239025413087]
This study aims to enrich and leverage data from the Informatics Europe Higher Education (IEHE) data portal.
The research examines the proportion of female students, first-year enrollments, and degrees awarded to women in the field.
arXiv Detail & Related papers (2024-10-15T09:33:16Z) - GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender Bias in Large Language Models [73.23743278545321]
Large language models (LLMs) have exhibited remarkable capabilities in natural language generation, but have also been observed to magnify societal biases.
GenderCARE is a comprehensive framework that encompasses innovative Criteria, bias Assessment, Reduction techniques, and Evaluation metrics.
arXiv Detail & Related papers (2024-08-22T15:35:46Z) - Breaking the Global North Stereotype: A Global South-centric Benchmark Dataset for Auditing and Mitigating Biases in Facial Recognition Systems [7.790132091010725]
We propose a new face dataset composed of 6,579 unique male and female sportspersons from eight countries around the world.
More than 50% of the dataset comprises individuals from the Global South countries and is demographically diverse.
To aid adversarial audits and robust model training, each image has four adversarial variants, totaling over 40,000 images.
arXiv Detail & Related papers (2024-07-22T17:22:04Z) - Harnessing Hierarchical Label Distribution Variations in Test Agnostic Long-tail Recognition [114.96385572118042]
We argue that the variation in test label distributions can be broken down hierarchically into global and local levels.
We propose a new MoE strategy, $mathsfDirMixE$, which assigns experts to different Dirichlet meta-distributions of the label distribution.
We show that our proposed objective benefits from enhanced generalization by virtue of the variance-based regularization.
arXiv Detail & Related papers (2024-05-13T14:24:56Z) - A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond [84.95530356322621]
This survey presents a systematic review of the advancements in code intelligence.
It covers over 50 representative models and their variants, more than 20 categories of tasks, and an extensive coverage of over 680 related works.
Building on our examination of the developmental trajectories, we further investigate the emerging synergies between code intelligence and broader machine intelligence.
arXiv Detail & Related papers (2024-03-21T08:54:56Z) - Assessing the Influence of Toxic and Gender Discriminatory Communication on Perceptible Diversity in OSS Projects [2.526146573337397]
The presence of toxic and gender-identity derogatory language in open-source software (OSS) communities has recently become a focal point for researchers.
This study aims to investigate how such content influences the gender, ethnicity, and tenure diversity of open-source software development teams.
arXiv Detail & Related papers (2024-03-12T22:48:21Z) - How diverse is the ACII community? Analysing gender, geographical and
business diversity of Affective Computing research [0.0]
ACII is the premier international forum for presenting the latest research on affective computing.
We measure diversity in terms of gender, geographic location and academia vs research centres vs industry, and consider three different actors: authors, keynote speakers and organizers.
Results raise awareness on the limited diversity in the field, in all studied facets, and compared to other AI conferences.
arXiv Detail & Related papers (2021-09-12T18:30:36Z) - MultiFair: Multi-Group Fairness in Machine Learning [52.24956510371455]
We study multi-group fairness in machine learning (MultiFair)
We propose a generic end-to-end algorithmic framework to solve it.
Our proposed framework is generalizable to many different settings.
arXiv Detail & Related papers (2021-05-24T02:30:22Z) - Please Don't Go -- A Comprehensive Approach to Increase Women's
Participation in Open Source Software [11.326760036768068]
Women represent less than 24% of employees in the software development industry.
Despite efforts to increase diversity and multi-gendered participation, women are even more underrepresented in Open Source Software (OSS) projects.
I will identify different OSS career pathways and develop a holistic view of women's motivations to join or leave OSS.
arXiv Detail & Related papers (2021-03-15T23:23:15Z) - CNN-based Density Estimation and Crowd Counting: A Survey [65.06491415951193]
This paper comprehensively studies the crowd counting models, mainly CNN-based density map estimation methods.
According to the evaluation metrics, we select the top three performers on their crowd counting datasets.
We expect to make reasonable inference and prediction for the future development of crowd counting.
arXiv Detail & Related papers (2020-03-28T13:17:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.