A data science axiology: the nature, value, and risks of data science
- URL: http://arxiv.org/abs/2307.10460v2
- Date: Fri, 21 Jul 2023 21:32:12 GMT
- Title: A data science axiology: the nature, value, and risks of data science
- Authors: Michael L. Brodie
- Abstract summary: Data science is a research paradigm with an unfathomed scope, scale, complexity, and power for knowledge discovery.
This paper presents an axiology of data science, its purpose, nature, importance, risks, and value for problem solving.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Data science is not a science. It is a research paradigm with an unfathomed
scope, scale, complexity, and power for knowledge discovery that is not
otherwise possible and can be beyond human reasoning. It is changing our world
practically and profoundly already widely deployed in tens of thousands of
applications in every discipline in an AI Arms Race that, due to its
inscrutability, can lead to unfathomed risks. This paper presents an axiology
of data science, its purpose, nature, importance, risks, and value for problem
solving, by exploring and evaluating its remarkable, definitive features. As
data science is in its infancy, this initial, speculative axiology is intended
to aid in understanding and defining data science to recognize its potential
benefits, risks, and open research challenges. AI based data science is
inherently about uncertainty that may be more realistic than our preference for
the certainty of science. Data science will have impacts far beyond knowledge
discovery and will take us into new ways of understanding the world.
Related papers
- Explain the Black Box for the Sake of Science: the Scientific Method in the Era of Generative Artificial Intelligence [0.9065034043031668]
The scientific method is the cornerstone of human progress across all branches of the natural and applied sciences.
We argue that human complex reasoning for scientific discovery remains of vital importance, at least before the advent of artificial general intelligence.
Knowing what data AI systems deemed important to make decisions can be a point of contact with domain experts and scientists.
arXiv Detail & Related papers (2024-06-15T08:34:42Z) - Control Risk for Potential Misuse of Artificial Intelligence in Science [85.91232985405554]
We aim to raise awareness of the dangers of AI misuse in science.
We highlight real-world examples of misuse in chemical science.
We propose a system called SciGuard to control misuse risks for AI models in science.
arXiv Detail & Related papers (2023-12-11T18:50:57Z) - Artificial Intelligence for Science in Quantum, Atomistic, and Continuum Systems [268.585904751315]
New area of research known as AI for science (AI4Science)
Areas aim at understanding the physical world from subatomic (wavefunctions and electron density), atomic (molecules, proteins, materials, and interactions), to macro (fluids, climate, and subsurface) scales.
Key common challenge is how to capture physics first principles, especially symmetries, in natural systems by deep learning methods.
arXiv Detail & Related papers (2023-07-17T12:14:14Z) - The Future of Fundamental Science Led by Generative Closed-Loop
Artificial Intelligence [67.70415658080121]
Recent advances in machine learning and AI are disrupting technological innovation, product development, and society as a whole.
AI has contributed less to fundamental science in part because large data sets of high-quality data for scientific practice and model discovery are more difficult to access.
Here we explore and investigate aspects of an AI-driven, automated, closed-loop approach to scientific discovery.
arXiv Detail & Related papers (2023-07-09T21:16:56Z) - Defining data science: a new field of inquiry [0.0]
Modern data science is in its infancy. Emerging slowly since 1962 and rapidly since 2000, it is one of the most active, powerful, and rapidly evolving 21st century innovations.
Due to its value, power, and applicability, it is emerging in over 40 disciplines, hundreds of research areas, and thousands of applications.
This research addresses this data science multiple definitions challenge by proposing the development of coherent, unified definition based on a data science reference framework.
arXiv Detail & Related papers (2023-06-28T12:58:42Z) - Modeling Information Change in Science Communication with Semantically
Matched Paraphrases [50.67030449927206]
SPICED is the first paraphrase dataset of scientific findings annotated for degree of information change.
SPICED contains 6,000 scientific finding pairs extracted from news stories, social media discussions, and full texts of original papers.
Models trained on SPICED improve downstream performance on evidence retrieval for fact checking of real-world scientific claims.
arXiv Detail & Related papers (2022-10-24T07:44:38Z) - Learning from learning machines: a new generation of AI technology to
meet the needs of science [59.261050918992325]
We outline emerging opportunities and challenges to enhance the utility of AI for scientific discovery.
The distinct goals of AI for industry versus the goals of AI for science create tension between identifying patterns in data versus discovering patterns in the world from data.
arXiv Detail & Related papers (2021-11-27T00:55:21Z) - Biases in Data Science Lifecycle [0.0]
The aim of this study is to provide a practical guideline to data scientists and increase their awareness.
In this work, we reviewed different sources of biases and grouped them under different stages of the data science lifecycle.
arXiv Detail & Related papers (2020-09-10T13:41:48Z) - Data Science: Challenges and Directions [42.98602883069444]
We review hundreds of pieces of literature which include data science in their titles.
We find that the majority of the discussions essentially concern statistics, data mining, machine learning, big data, or broadly data analytics.
We focus on the research and innovation challenges inspired by the nature of data science problems as complex systems.
arXiv Detail & Related papers (2020-06-28T01:49:00Z) - Ten Research Challenge Areas in Data Science [4.670305538969914]
Data science builds on knowledge from computer science, mathematics, statistics, and other disciplines.
This article starts with meta-questions about data science as a discipline and then elaborates on ten ideas for the basis of a research agenda for data science.
arXiv Detail & Related papers (2020-01-27T21:39:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.