Diversifying the Genomic Data Science Research Community
- URL: http://arxiv.org/abs/2201.08443v2
- Date: Thu, 9 Jun 2022 13:49:35 GMT
- Title: Diversifying the Genomic Data Science Research Community
- Authors: The Genomic Data Science Community Network, Rosa Alcazar (1), Maria
Alvarez (2), Rachel Arnold (3), Mentewab Ayalew (4), Lyle G. Best (5),
Michael C. Campbell (6), Kamal Chowdhury (7), Katherine E. L. Cox (8),
Christina Daulton (9), Youping Deng (10), Carla Easter (11), Karla Fuller
(12), Shazia Tabassum Hakim (13), Ava M. Hoffman (8), Natalie Kucher (14),
Andrew Lee (15), Joslynn Lee (16), Jeffrey T. Leek (8), Robert Meller (17),
Loyda B. M\'endez (18), Miguel P. M\'endez-Gonz\'alez (19), Stephen Mosher
(14), Michele Nishiguchi (20), Siddharth Pratap (21), Tiffany Rolle (9),
Sourav Roy (22), Rachel Saidi (23), Michael C. Schatz (14 and 24), Shurjo Sen
(9), James Sniezek (25), Edu Suarez Martinez (26), Frederick Tan (27),
Jennifer Vessio (14), Karriem Watson (28), Wendy Westbroek (29), Joseph
Wilcox (30), Xianfa Xie (31) ((1) Clovis Community College, Fresno, CA, USA,
(2) Biology, El Paso Community College, El Paso, TX, USA, (3) US Fish and
Wildlife and Northwest Indian College, Onalaska, WI, USA, (4) Biology
Department, Spelman College, Atlanta, GA, USA, (5) Turtle Mountain Community
College, Belcourt, ND, USA, (6) Department of Biological Sciences, University
of Southern California, Los Angeles CA, USA, (7) Biology Department, Claflin
University, Orangeburg, SC, USA, (8) Department of Biostatistics, Johns
Hopkins Bloomberg School of Public Health, Baltimore, MD, USA, (9) National
Human Genome Research Institute, National Institutes of Health, Bethesda, MD,
USA, (10) Department of Quantitative Health Sciences, University of Hawaii at
Manoa, Honolulu, HI, USA, (11) Smithsonian Institute National Museum of
Natural History, Washington, DC, USA, (12) Guttman Community College, New
York, NY, USA, (13) Department of Microbiology and Biomedical Sciences, Dine
College, Tuba City, AZ, USA, (14) Department of Biology, Johns Hopkins
University, Baltimore, MD, USA, (15) Department of Biology, Northern Virginia
Community College - Alexandria, Alexandria, VA, USA, (16) Department of
Chemistry and Biochemistry, Fort Lewis College, Durango, CO, USA, (17)
Department of Neurobiology, Morehouse School of Medicine, Atlanta, GA, USA,
(18) Science & Technology, Universidad Ana G. M\'endez, Carolina, Carolina,
PR, (19) Natural Sciences Department, University of Puerto Rico at Aguadilla,
Aguadilla, PR, (20) Department of Molecular and Cell Biology, University of
California, Merced, Merced, CA, USA, (21) School of Graduate Studies and
Research, Meharry Medical College, Nashville, TN, USA, (22) Department of
Biological Sciences and Border Biomedical Research Center, University of
Texas at El Paso, El Paso, TX, USA, (23) Department of Math, Statistics, and
Data Science, Montgomery College, Rockville, MD, USA, (24) Departments of
Computer Science, Johns Hopkins University, Baltimore, MD, USA, (25) Chemical
and Biological Sciences, Montgomery College, Germantown, MD, USA, (26)
Department of Biology, University of Puerto Rico, Ponce, Ponce, PR, (27)
Department of Embryology, Carnegie Institution, Baltimore, MD, USA, (28)
National Institutes of Health, Bethesda, MD, USA, (29) Department of Biology,
Flathead Valley Community College, Kalispell, MT, USA, (30) Department of
Biology, Nevada State College, Henderson, NV, USA, (31) Department of
Biology, Virginia State University, Petersburg, VA, USA)
- Abstract summary: We have formed the Genomic Data Science Community Network to identify opportunities and support broadening access to cloud-enabled genomic data science.
Here, we provide a summary of the priorities for faculty members at UIs, as well as administrators, funders, and R1 researchers to consider as we create a more diverse genomic data science community.
- Score: 22.633385577446617
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Over the last 20 years, there has been an explosion of genomic data collected
for disease association, functional analyses, and other large-scale
discoveries. At the same time, there have been revolutions in cloud computing
that enable computational and data science research, while making data
accessible to anyone with a web browser and an internet connection. However,
students at institutions with limited resources have received relatively little
exposure to curricula or professional development opportunities that lead to
careers in genomic data science. To broaden participation in genomics research,
the scientific community needs to support students, faculty, and administrators
at Underserved Institutions (UIs) including Community Colleges, Historically
Black Colleges and Universities, Hispanic-Serving Institutions, and Tribal
Colleges and Universities in taking advantage of these tools in local
educational and research programs. We have formed the Genomic Data Science
Community Network (http://www.gdscn.org/) to identify opportunities and support
broadening access to cloud-enabled genomic data science. Here, we provide a
summary of the priorities for faculty members at UIs, as well as
administrators, funders, and R1 researchers to consider as we create a more
diverse genomic data science community.
Related papers
- Pennsieve: A Collaborative Platform for Translational Neuroscience and Beyond [0.5130659559809153]
Pennsieve is an open-source, cloud-based scientific data management platform.
It supports complex multimodal datasets and provides tools for data visualization and analyses.
Pennsieve stores over 125 TB of scientific data, with 35 TB of data publicly available across more than 350 high-impact datasets.
arXiv Detail & Related papers (2024-09-16T17:55:58Z) - Social Intelligence Data Infrastructure: Structuring the Present and Navigating the Future [59.78608958395464]
We build a Social AI Data Infrastructure, which consists of a comprehensive social AI taxonomy and a data library of 480 NLP datasets.
Our infrastructure allows us to analyze existing dataset efforts, and also evaluate language models' performance in different social intelligence aspects.
We show there is a need for multifaceted datasets, increased diversity in language and culture, more long-tailed social situations, and more interactive data in future social intelligence data efforts.
arXiv Detail & Related papers (2024-02-28T00:22:42Z) - Data Science for Social Good [2.8621556092850065]
We present a framework for "data science for social good" (DSSG) research.
We perform an analysis of the literature to empirically demonstrate the paucity of work on DSSG in information systems.
We hope that this article and the special issue will spur future DSSG research.
arXiv Detail & Related papers (2023-11-02T15:40:20Z) - On the Opportunities of Green Computing: A Survey [80.21955522431168]
Artificial Intelligence (AI) has achieved significant advancements in technology and research with the development over several decades.
The needs for high computing power brings higher carbon emission and undermines research fairness.
To tackle the challenges of computing resources and environmental impact of AI, Green Computing has become a hot research topic.
arXiv Detail & Related papers (2023-11-01T11:16:41Z) - Assessing Scientific Contributions in Data Sharing Spaces [64.16762375635842]
This paper introduces the SCIENCE-index, a blockchain-based metric measuring a researcher's scientific contributions.
To incentivize researchers to share their data, the SCIENCE-index is augmented to include a data-sharing parameter.
Our model is evaluated by comparing the distribution of its output for geographically diverse researchers to that of the h-index.
arXiv Detail & Related papers (2023-03-18T19:17:47Z) - Coordinated Science Laboratory 70th Anniversary Symposium: The Future of
Computing [80.72844751804166]
In 2021, the Coordinated Science Laboratory CSL hosted the Future of Computing Symposium to celebrate its 70th anniversary.
We summarize the major technological points, insights, and directions that speakers brought forward during the symposium.
Participants discussed topics related to new computing paradigms, technologies, algorithms, behaviors, and research challenges to be expected in the future.
arXiv Detail & Related papers (2022-10-04T17:32:27Z) - Retiring Adult: New Datasets for Fair Machine Learning [47.27417042497261]
UCI Adult has served as the basis for the development and comparison of many algorithmic fairness interventions.
We reconstruct a superset of the UCI Adult data from available US Census sources and reveal idiosyncrasies of the UCI Adult dataset that limit its external validity.
Our primary contribution is a suite of new datasets that extend the existing data ecosystem for research on fair machine learning.
arXiv Detail & Related papers (2021-08-10T19:19:41Z) - A field guide to cultivating computational biology [1.040598660564506]
Biomedical research centers can empower basic discovery and novel therapeutic strategies by leveraging their large-scale datasets from experiments and patients.
This data, together with new technologies to create and analyze it, has ushered in an era of data-driven discovery which requires moving beyond the traditional individual, single-discipline investigator research model.
We propose solutions for individual scientists, institutions, journal publishers, funding agencies, and educators.
arXiv Detail & Related papers (2021-04-23T01:24:21Z) - Learnings from Frontier Development Lab and SpaceML -- AI Accelerators
for NASA and ESA [57.06643156253045]
Research with AI and ML technologies lives in a variety of settings with often asynchronous goals and timelines.
We perform a case study of the Frontier Development Lab (FDL), an AI accelerator under a public-private partnership from NASA and ESA.
FDL research follows principled practices that are grounded in responsible development, conduct, and dissemination of AI research.
arXiv Detail & Related papers (2020-11-09T21:23:03Z) - Computational Skills by Stealth in Secondary School Data Science [16.960800464621993]
We discuss a proposal for the stealth development of computational skills in students' first exposure to data science.
The intent of this approach is to support students, regardless of interest and self-efficacy in coding, in becoming data-driven learners.
arXiv Detail & Related papers (2020-10-08T09:11:51Z) - The Data Science Fire Next Time: Innovative strategies for mentoring in
data science [14.213973379473655]
The Broadening Participation in Data Mining (BPDM) workshop was created more than 7 years ago.
BPDM aims to foster mentorship, guidance, and connections for minority and underrepresented groups in the data science and machine learning community.
To date it has impacted the lives of more than 330 underrepresented trainees in data science.
arXiv Detail & Related papers (2020-03-01T03:40:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.