There Are No Data Like More Data- Datasets for Deep Learning in Earth
Observation
- URL: http://arxiv.org/abs/2310.19231v1
- Date: Mon, 30 Oct 2023 02:19:16 GMT
- Title: There Are No Data Like More Data- Datasets for Deep Learning in Earth
Observation
- Authors: Michael Schmitt and Seyed Ali Ahmadi and Yonghao Xu and Gulsen Taskin
and Ujjwal Verma and Francescopaolo Sica and Ronny Hansch
- Abstract summary: We want to put machine learning datasets dedicated to Earth observation data into the spotlight.
We hope to contribute to an understanding that the nature of our data is what distinguishes the Earth observation community.
- Score: 6.839093061382966
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Carefully curated and annotated datasets are the foundation of machine
learning, with particularly data-hungry deep neural networks forming the core
of what is often called Artificial Intelligence (AI). Due to the massive
success of deep learning applied to Earth Observation (EO) problems, the focus
of the community has been largely on the development of ever-more sophisticated
deep neural network architectures and training strategies largely ignoring the
overall importance of datasets. For that purpose, numerous task-specific
datasets have been created that were largely ignored by previously published
review articles on AI for Earth observation. With this article, we want to
change the perspective and put machine learning datasets dedicated to Earth
observation data and applications into the spotlight. Based on a review of the
historical developments, currently available resources are described and a
perspective for future developments is formed. We hope to contribute to an
understanding that the nature of our data is what distinguishes the Earth
observation community from many other communities that apply deep learning
techniques to image data, and that a detailed understanding of EO data
peculiarities is among the core competencies of our discipline.
Related papers
- Foundation Models for Remote Sensing and Earth Observation: A Survey [101.77425018347557]
This survey systematically reviews the emerging field of Remote Sensing Foundation Models (RSFMs)
It begins with an outline of their motivation and background, followed by an introduction of their foundational concepts.
We benchmark these models against publicly available datasets, discuss existing challenges, and propose future research directions.
arXiv Detail & Related papers (2024-10-22T01:08:21Z) - Deep Learning for Spatiotemporal Big Data: A Vision on Opportunities and
Challenges [4.497634148674422]
Intemporal big data can foster new opportunities to solve problems that have not been possible before.
The distinctive characteristics of big data pose new challenges for deep learning technologies.
arXiv Detail & Related papers (2023-10-30T19:12:51Z) - On Responsible Machine Learning Datasets with Fairness, Privacy, and Regulatory Norms [56.119374302685934]
There have been severe concerns over the trustworthiness of AI technologies.
Machine and deep learning algorithms depend heavily on the data used during their development.
We propose a framework to evaluate the datasets through a responsible rubric.
arXiv Detail & Related papers (2023-10-24T14:01:53Z) - Towards Data-centric Graph Machine Learning: Review and Outlook [120.64417630324378]
We introduce a systematic framework, Data-centric Graph Machine Learning (DC-GML), that encompasses all stages of the graph data lifecycle.
A thorough taxonomy of each stage is presented to answer three critical graph-centric questions.
We pinpoint the future prospects of the DC-GML domain, providing insights to navigate its advancements and applications.
arXiv Detail & Related papers (2023-09-20T00:40:13Z) - Data-centric Artificial Intelligence: A Survey [47.24049907785989]
Recently, the role of data in AI has been significantly magnified, giving rise to the emerging concept of data-centric AI.
In this survey, we discuss the necessity of data-centric AI, followed by a holistic view of three general data-centric goals.
We believe this is the first comprehensive survey that provides a global view of a spectrum of tasks across various stages of the data lifecycle.
arXiv Detail & Related papers (2023-03-17T17:44:56Z) - Data-centric AI: Perspectives and Challenges [51.70828802140165]
Data-centric AI (DCAI) advocates a fundamental shift from model advancements to ensuring data quality and reliability.
We bring together three general missions: training data development, inference data development, and data maintenance.
arXiv Detail & Related papers (2023-01-12T05:28:59Z) - Big Earth Data and Machine Learning for Sustainable and Resilient
Agriculture [0.0]
This thesis recognizes the unprecedented opportunities offered by the high quality and open access Earth observation data of our times.
It introduces novel machine learning and big data methods to properly exploit them towards developing applications for sustainable and resilient agriculture.
arXiv Detail & Related papers (2022-11-22T20:58:54Z) - On The State of Data In Computer Vision: Human Annotations Remain
Indispensable for Developing Deep Learning Models [0.0]
High-quality labeled datasets play a crucial role in fueling the development of machine learning (ML)
Since the emergence of the ImageNet dataset and the AlexNet model in 2012, the size of new open-source labeled vision datasets has remained roughly constant.
Only a minority of publications in the computer vision community tackle supervised learning on datasets that are orders of magnitude larger than Imagenet.
arXiv Detail & Related papers (2021-07-31T00:08:21Z) - REGRAD: A Large-Scale Relational Grasp Dataset for Safe and
Object-Specific Robotic Grasping in Clutter [52.117388513480435]
We present a new dataset named regrad to sustain the modeling of relationships among objects and grasps.
Our dataset is collected in both forms of 2D images and 3D point clouds.
Users are free to import their own object models for the generation of as many data as they want.
arXiv Detail & Related papers (2021-04-29T05:31:21Z) - Machine Learning Information Fusion in Earth Observation: A
Comprehensive Review of Methods, Applications and Data Sources [0.0]
This paper reviews the most important information fusion algorithms based on Machine Learning (ML) techniques for problems in Earth observation.
Data-driven approaches, and ML techniques in particular, are the natural choice to extract significant information from this data deluge.
arXiv Detail & Related papers (2020-12-07T13:35:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.