Data science and Machine learning in the Clouds: A Perspective for the
Future
- URL: http://arxiv.org/abs/2109.01661v1
- Date: Thu, 2 Sep 2021 17:36:24 GMT
- Title: Data science and Machine learning in the Clouds: A Perspective for the
Future
- Authors: Hrishav Bakul Barua
- Abstract summary: Data driven science (the so called fourth science paradigm) is going to be the driving force in research and innovation.
Huge amount of data to be processed under this new paradigm will be a major concern in the future.
One will strongly require cloud based services in all the aspects of these computations.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: As we are fast approaching the beginning of a paradigm shift in the field of
science, Data driven science (the so called fourth science paradigm) is going
to be the driving force in research and innovation. From medicine to
biodiversity and astronomy to geology, all these terms are somehow going to be
affected by this paradigm shift. The huge amount of data to be processed under
this new paradigm will be a major concern in the future and one will strongly
require cloud based services in all the aspects of these computations (from
storage to compute and other services). Another aspect will be energy
consumption and performance of prediction jobs and tasks within such a
scientific paradigm which will change the way one sees computation. Data
science has heavily impacted or rather triggered the emergence of Machine
Learning, Signal/Image/Video processing related algorithms, Artificial
intelligence, Robotics, health informatics, geoinformatics, and many more such
areas of interest. Hence, we envisage an era where Data science can deliver its
promises with the help of the existing cloud based platforms and services with
the addition of new services. In this article, we discuss about data driven
science and Machine learning and how they are going to be linked through cloud
based services in the future. It also discusses the rise of paradigms like
approximate computing, quantum computing and many more in recent times and
their applicability in big data processing, data science, analytics, prediction
and machine learning in the cloud environments.
Related papers
- The Future of Data Science Education [0.11566458078238004]
The School of Data Science at the University of Virginia has developed a novel model for the definition of Data Science.
This paper will present the core features of the model and explain how it unifies various concepts going far beyond the analytics component of AI.
arXiv Detail & Related papers (2024-07-16T15:11:54Z) - On Responsible Machine Learning Datasets with Fairness, Privacy, and Regulatory Norms [56.119374302685934]
There have been severe concerns over the trustworthiness of AI technologies.
Machine and deep learning algorithms depend heavily on the data used during their development.
We propose a framework to evaluate the datasets through a responsible rubric.
arXiv Detail & Related papers (2023-10-24T14:01:53Z) - A Vision for Semantically Enriched Data Science [19.604667287258724]
Key areas such as utilizing domain knowledge and data semantics are areas where we have seen little automation.
We envision how leveraging "semantic" understanding and reasoning on data in combination with novel tools for data science automation can help with consistent and explainable data augmentation and transformation.
arXiv Detail & Related papers (2023-03-02T16:03:12Z) - Coordinated Science Laboratory 70th Anniversary Symposium: The Future of
Computing [80.72844751804166]
In 2021, the Coordinated Science Laboratory CSL hosted the Future of Computing Symposium to celebrate its 70th anniversary.
We summarize the major technological points, insights, and directions that speakers brought forward during the symposium.
Participants discussed topics related to new computing paradigms, technologies, algorithms, behaviors, and research challenges to be expected in the future.
arXiv Detail & Related papers (2022-10-04T17:32:27Z) - A Computational Inflection for Scientific Discovery [48.176406062568674]
We stand at the foot of a significant inflection in the trajectory of scientific discovery.
As society continues on its fast-paced digital transformation, so does humankind's collective scientific knowledge.
Computer science is poised to ignite a revolution in the scientific process itself.
arXiv Detail & Related papers (2022-05-04T11:36:54Z) - Edge-Cloud Polarization and Collaboration: A Comprehensive Survey [61.05059817550049]
We conduct a systematic review for both cloud and edge AI.
We are the first to set up the collaborative learning mechanism for cloud and edge modeling.
We discuss potentials and practical experiences of some on-going advanced edge AI topics.
arXiv Detail & Related papers (2021-11-11T05:58:23Z) - A Review into Data Science and Its Approaches in Mechanical Engineering [0.0]
This article briefly introduced data science and reviewed its methods.
In the introduction, different definitions of data science and its background in technology reviewed.
Some researches in the mechanical engineering area that used data science methods in their studies are reviewed.
arXiv Detail & Related papers (2020-12-30T23:05:29Z) - Data-Driven Aerospace Engineering: Reframing the Industry with Machine
Learning [49.367020832638794]
The aerospace industry is poised to capitalize on big data and machine learning.
Recent trends will be explored in context of critical challenges in design, manufacturing, verification and services.
arXiv Detail & Related papers (2020-08-24T22:40:26Z) - From Data to Knowledge to Action: A Global Enabler for the 21st Century [26.32590947516587]
A confluence of advances in the computer and mathematical sciences has unleashed unprecedented capabilities for enabling true evidence-based decision making.
These capabilities are making possible the large-scale capture of data and the transformation of that data into insights and recommendations.
The shift of commerce, science, education, art, and entertainment to the web makes available unprecedented quantities of structured and unstructured databases about human activities.
arXiv Detail & Related papers (2020-07-31T19:19:42Z) - A Data Scientist's Guide to Streamflow Prediction [55.22219308265945]
We focus on the element of hydrologic rainfall--runoff models and their application to forecast floods and predict streamflow.
This guide aims to help interested data scientists gain an understanding of the problem, the hydrologic concepts involved, and the details that come up along the way.
arXiv Detail & Related papers (2020-06-05T08:04:37Z) - Incorporating Physical Knowledge into Machine Learning for Planetary
Space Physics [0.0]
We build off a previous effort applying a semi-supervised physics-based classification of plasma instabilities in Saturn's magnetosphere.
We show that incorporating knowledge of these orbiting spacecraft data characteristics improves the performance and interpretability of machine learning methods.
arXiv Detail & Related papers (2020-06-02T20:31:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.