From Data to Knowledge to Action: A Global Enabler for the 21st Century
- URL: http://arxiv.org/abs/2008.00045v1
- Date: Fri, 31 Jul 2020 19:19:42 GMT
- Title: From Data to Knowledge to Action: A Global Enabler for the 21st Century
- Authors: Eric Horvitz and Tom Mitchell
- Abstract summary: A confluence of advances in the computer and mathematical sciences has unleashed unprecedented capabilities for enabling true evidence-based decision making.
These capabilities are making possible the large-scale capture of data and the transformation of that data into insights and recommendations.
The shift of commerce, science, education, art, and entertainment to the web makes available unprecedented quantities of structured and unstructured databases about human activities.
- Score: 26.32590947516587
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A confluence of advances in the computer and mathematical sciences has
unleashed unprecedented capabilities for enabling true evidence-based decision
making. These capabilities are making possible the large-scale capture of data
and the transformation of that data into insights and recommendations in
support of decisions about challenging problems in science, society, and
government. Key advances include jumps in the availability of rich streams of
data, precipitous drops in the cost of storing and retrieving massive amounts
of data, exponential increases in computing power and memory, and jumps in the
prowess of methods for performing machine learning and reasoning. These
advances have come together to create an inflection point in our ability to
harness large amounts of data for generating insights and guiding decision
making. The shift of commerce, science, education, art, and entertainment to
the web makes available unprecedented quantities of structured and unstructured
databases about human activities - much of it available to anyone who wishes to
mine it for insights. In the sciences, new evidential paradigms and sensing
technologies are making available great quantities of data, via use of
fundamentally new kinds of low-cost sensors (e.g., genomic microarrays) or
through viewers that provide unprecedented scope and resolution. The data pose
a huge opportunity for data-centric analyses. To date, we have only scratched
the surface of the potential for learning from these large-scale data sets.
Opportunities abound for tapping our new capabilities more broadly to provide
insights to decision makers and to enhance the quality of their actions and
policies.
Related papers
- On Responsible Machine Learning Datasets with Fairness, Privacy, and Regulatory Norms [56.119374302685934]
There have been severe concerns over the trustworthiness of AI technologies.
Machine and deep learning algorithms depend heavily on the data used during their development.
We propose a framework to evaluate the datasets through a responsible rubric.
arXiv Detail & Related papers (2023-10-24T14:01:53Z) - Privacy-Preserving Graph Machine Learning from Data to Computation: A
Survey [67.7834898542701]
We focus on reviewing privacy-preserving techniques of graph machine learning.
We first review methods for generating privacy-preserving graph data.
Then we describe methods for transmitting privacy-preserved information.
arXiv Detail & Related papers (2023-07-10T04:30:23Z) - Machine Learning and Consumer Data [0.4873362301533825]
The digital revolution has led to the digitization of human behavior, creating unprecedented opportunities to understand observable actions on an unmatched scale.
Emerging phenomena such as crowdfunding and crowdsourcing have further illuminated consumer behavior while also introducing new behavioral patterns.
Traditional methods used to analyze consumer data fall short in handling the breadth, precision, and scale of emerging data sources.
arXiv Detail & Related papers (2023-06-25T03:58:15Z) - The Dimensions of Data Labor: A Road Map for Researchers, Activists, and
Policymakers to Empower Data Producers [14.392208044851976]
Data producers have little say in what data is captured, how it is used, or who it benefits.
Organizations with the ability to access and process this data, e.g. OpenAI and Google, possess immense power in shaping the technology landscape.
By synthesizing related literature that reconceptualizes the production of data for computing as data labor'', we outline opportunities for researchers, policymakers, and activists to empower data producers.
arXiv Detail & Related papers (2023-05-22T17:11:22Z) - A Vision for Semantically Enriched Data Science [19.604667287258724]
Key areas such as utilizing domain knowledge and data semantics are areas where we have seen little automation.
We envision how leveraging "semantic" understanding and reasoning on data in combination with novel tools for data science automation can help with consistent and explainable data augmentation and transformation.
arXiv Detail & Related papers (2023-03-02T16:03:12Z) - Advanced Data Augmentation Approaches: A Comprehensive Survey and Future
directions [57.30984060215482]
We provide a background of data augmentation, a novel and comprehensive taxonomy of reviewed data augmentation techniques, and the strengths and weaknesses (wherever possible) of each technique.
We also provide comprehensive results of the data augmentation effect on three popular computer vision tasks, such as image classification, object detection and semantic segmentation.
arXiv Detail & Related papers (2023-01-07T11:37:32Z) - "If we didn't solve small data in the past, how can we solve Big Data
today?" [0.0]
We aim to research terms such as'small' and 'big' data, understand their attributes, and look at ways in which they can add value.
Based on the research, it can be inferred that, regardless of how small data might have been used, organizations can still leverage big data with the right technology and business vision.
arXiv Detail & Related papers (2021-11-08T16:31:01Z) - Understanding the World Through Action [91.3755431537592]
I will argue that a general, principled, and powerful framework for utilizing unlabeled data can be derived from reinforcement learning.
I will discuss how such a procedure is more closely aligned with potential downstream tasks.
arXiv Detail & Related papers (2021-10-24T22:33:52Z) - Synthetic Data: Opening the data floodgates to enable faster, more
directed development of machine learning methods [96.92041573661407]
Many ground-breaking advancements in machine learning can be attributed to the availability of a large volume of rich data.
Many large-scale datasets are highly sensitive, such as healthcare data, and are not widely available to the machine learning community.
Generating synthetic data with privacy guarantees provides one such solution.
arXiv Detail & Related papers (2020-12-08T17:26:10Z) - Occams Razor for Big Data? On Detecting Quality in Large Unstructured
Datasets [0.0]
New trend towards analytic complexity represents a severe challenge for the principle of parsimony or Occams Razor in science.
Computational building block approaches for data clustering can help to deal with large unstructured datasets in minimized computation time.
The review concludes on how cultural differences between East and West are likely to affect the course of big data analytics.
arXiv Detail & Related papers (2020-11-12T16:06:01Z) - Urban Sensing based on Mobile Phone Data: Approaches, Applications and
Challenges [67.71975391801257]
Much concern in mobile data analysis is related to human beings and their behaviours.
This work aims to review the methods and techniques that have been implemented to discover knowledge from mobile phone data.
arXiv Detail & Related papers (2020-08-29T15:14:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.