Machine Learning Practices Outside Big Tech: How Resource Constraints
Challenge Responsible Development
- URL: http://arxiv.org/abs/2110.02932v1
- Date: Wed, 6 Oct 2021 17:25:21 GMT
- Title: Machine Learning Practices Outside Big Tech: How Resource Constraints
Challenge Responsible Development
- Authors: Aspen Hopkins, Serena Booth
- Abstract summary: Machine learning practitioners from diverse occupations and backgrounds are increasingly using machine learning (ML) methods.
Past research often excludes the broader, lesser-resourced ML community.
These practitioners share many of the same ML development difficulties and ethical conundrums as their Big Tech counterparts.
- Score: 1.8275108630751844
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Practitioners from diverse occupations and backgrounds are increasingly using
machine learning (ML) methods. Nonetheless, studies on ML Practitioners
typically draw populations from Big Tech and academia, as researchers have
easier access to these communities. Through this selection bias, past research
often excludes the broader, lesser-resourced ML community -- for example,
practitioners working at startups, at non-tech companies, and in the public
sector. These practitioners share many of the same ML development difficulties
and ethical conundrums as their Big Tech counterparts; however, their
experiences are subject to additional under-studied challenges stemming from
deploying ML with limited resources, increased existential risk, and absent
access to in-house research teams. We contribute a qualitative analysis of 17
interviews with stakeholders from organizations which are less represented in
prior studies. We uncover a number of tensions which are introduced or
exacerbated by these organizations' resource constraints -- tensions between
privacy and ubiquity, resource management and performance optimization, and
access and monopolization. Increased academic focus on these practitioners can
facilitate a more holistic understanding of ML limitations, and so is useful
for prescribing a research agenda to facilitate responsible ML development for
all.
Related papers
- Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review [50.67937325077047]
This paper is devoted to a comprehensive review of realizing the sample efficiency and generalization of RL algorithms through transfer and inverse reinforcement learning (T-IRL)
Our findings denote that a majority of recent research works have dealt with the aforementioned challenges by utilizing human-in-the-loop and sim-to-real strategies.
Under the IRL structure, training schemes that require a low number of experience transitions and extension of such frameworks to multi-agent and multi-intention problems have been the priority of researchers in recent years.
arXiv Detail & Related papers (2024-11-15T15:18:57Z) - A Multivocal Review of MLOps Practices, Challenges and Open Issues [9.227450931458907]
We conduct a Multivocal Literature Review (MLR) of 150 relevant academic studies and 48 gray literature to provide a comprehensive body of knowledge on MLOps.
We identify the emerging MLOps practices, adoption challenges and solutions related to various areas, including development and operation of complex pipelines, managing production at scale, managing artifacts, and ensuring quality, security, governance, and ethical aspects.
arXiv Detail & Related papers (2024-06-14T05:47:13Z) - A Survey on Large Language Model based Autonomous Agents [105.2509166861984]
Large language models (LLMs) have demonstrated remarkable potential in achieving human-level intelligence.
This paper delivers a systematic review of the field of LLM-based autonomous agents from a holistic perspective.
We present a comprehensive overview of the diverse applications of LLM-based autonomous agents in the fields of social science, natural science, and engineering.
arXiv Detail & Related papers (2023-08-22T13:30:37Z) - SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models [70.5763210869525]
We introduce an expansive benchmark suite SciBench for Large Language Model (LLM)
SciBench contains a dataset featuring a range of collegiate-level scientific problems from mathematics, chemistry, and physics domains.
The results reveal that the current LLMs fall short of delivering satisfactory performance, with the best overall score of merely 43.22%.
arXiv Detail & Related papers (2023-07-20T07:01:57Z) - Towards machine learning guided by best practices [0.0]
Machine learning (ML) is being used in software systems with multiple application fields, from medicine to software engineering (SE)
This thesis aims to answer research questions that help to understand the practices used and discussed by practitioners and researchers in the SE community.
arXiv Detail & Related papers (2023-04-29T10:58:37Z) - REAL ML: Recognizing, Exploring, and Articulating Limitations of Machine
Learning Research [19.71032778307425]
Transparency around limitations can improve the scientific rigor of research, help ensure appropriate interpretation of research findings, and make research claims more credible.
Despite these benefits, the machine learning (ML) research community lacks well-developed norms around disclosing and discussing limitations.
We conduct an iterative design process with 30 ML and ML-adjacent researchers to develop REAL ML, a set of guided activities to help ML researchers recognize, explore, and articulate the limitations of their research.
arXiv Detail & Related papers (2022-05-05T15:32:45Z) - Information Extraction in Low-Resource Scenarios: Survey and Perspective [56.5556523013924]
Information Extraction seeks to derive structured information from unstructured texts.
This paper presents a review of neural approaches to low-resource IE from emphtraditional and emphLLM-based perspectives.
arXiv Detail & Related papers (2022-02-16T13:44:00Z) - Machine Learning Application Development: Practitioners' Insights [18.114724750441724]
We report about a survey that aimed to understand the challenges and best practices of ML application development.
We synthesize the results obtained from 80 practitioners into 17 findings; outlining challenges and best practices for ML application development.
We hope that the reported challenges will inform the research community about topics that need to be investigated to improve the engineering process and the quality of ML-based applications.
arXiv Detail & Related papers (2021-12-31T03:38:37Z) - Understanding the Usability Challenges of Machine Learning In
High-Stakes Decision Making [67.72855777115772]
Machine learning (ML) is being applied to a diverse and ever-growing set of domains.
In many cases, domain experts -- who often have no expertise in ML or data science -- are asked to use ML predictions to make high-stakes decisions.
We investigate the ML usability challenges present in the domain of child welfare screening through a series of collaborations with child welfare screeners.
arXiv Detail & Related papers (2021-03-02T22:50:45Z) - Machine Learning Towards Intelligent Systems: Applications, Challenges,
and Opportunities [8.68311678910946]
Machine learning (ML) provides a mechanism for humans to process large amounts of data.
This review focuses on some of the fields and applications such as education, healthcare, network security, banking and finance, and social media.
arXiv Detail & Related papers (2021-01-11T01:32:15Z) - Learnings from Frontier Development Lab and SpaceML -- AI Accelerators
for NASA and ESA [57.06643156253045]
Research with AI and ML technologies lives in a variety of settings with often asynchronous goals and timelines.
We perform a case study of the Frontier Development Lab (FDL), an AI accelerator under a public-private partnership from NASA and ESA.
FDL research follows principled practices that are grounded in responsible development, conduct, and dissemination of AI research.
arXiv Detail & Related papers (2020-11-09T21:23:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.