Spatial-Temporal Data Mining for Ocean Science: Data, Methodologies, and
Opportunities
- URL: http://arxiv.org/abs/2307.10803v2
- Date: Thu, 3 Aug 2023 05:41:37 GMT
- Title: Spatial-Temporal Data Mining for Ocean Science: Data, Methodologies, and
Opportunities
- Authors: Hanchen Yang, Wengen Li, Shuyu Wang, Hui Li, Jihong Guan, Shuigeng
Zhou, Jiannong Cao
- Abstract summary: spatial-temporal data mining (STDM) studies have been conducted to address various oceanic issues.
STDM studies are difficult to design and train because of unique characteristics, e.g., diverse regionality and high sparsity.
This paper provides a comprehensive survey of existing STDM studies for ocean science.
- Score: 33.342420636418794
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the rapid amassing of spatial-temporal (ST) ocean data, many
spatial-temporal data mining (STDM) studies have been conducted to address
various oceanic issues, including climate forecasting and disaster warning.
Compared with typical ST data (e.g., traffic data), ST ocean data is more
complicated but with unique characteristics, e.g., diverse regionality and high
sparsity. These characteristics make it difficult to design and train STDM
models on ST ocean data. To the best of our knowledge, a comprehensive survey
of existing studies remains missing in the literature, which hinders not only
computer scientists from identifying the research issues in ocean data mining
but also ocean scientists to apply advanced STDM techniques. In this paper, we
provide a comprehensive survey of existing STDM studies for ocean science.
Concretely, we first review the widely-used ST ocean datasets and highlight
their unique characteristics. Then, typical ST ocean data quality enhancement
techniques are explored. Next, we classify existing STDM studies in ocean
science into four types of tasks, i.e., prediction, event detection, pattern
mining, and anomaly detection, and elaborate on the techniques for these tasks.
Finally, promising research opportunities are discussed. This survey can help
scientists from both computer science and ocean science better understand the
fundamental concepts, key techniques, and open challenges of STDM for ocean
science.
Related papers
- Foundation Models for Remote Sensing and Earth Observation: A Survey [101.77425018347557]
This survey systematically reviews the emerging field of Remote Sensing Foundation Models (RSFMs)
It begins with an outline of their motivation and background, followed by an introduction of their foundational concepts.
We benchmark these models against publicly available datasets, discuss existing challenges, and propose future research directions.
arXiv Detail & Related papers (2024-10-22T01:08:21Z) - Causal Representation Learning in Temporal Data via Single-Parent Decoding [66.34294989334728]
Scientific research often seeks to understand the causal structure underlying high-level variables in a system.
Scientists typically collect low-level measurements, such as geographically distributed temperature readings.
We propose a differentiable method, Causal Discovery with Single-parent Decoding, that simultaneously learns the underlying latents and a causal graph over them.
arXiv Detail & Related papers (2024-10-09T15:57:50Z) - A Survey of Generative Techniques for Spatial-Temporal Data Mining [93.55501980723974]
This paper focuses on the integration of generative techniques into spatial-temporal data mining.
The paper provides a comprehensive analysis of generative technique-based spatial-temporal methods.
It also introduces a standardized framework specifically designed for the spatial-temporal data mining pipeline.
arXiv Detail & Related papers (2024-05-15T12:07:43Z) - Deep Learning for Spatiotemporal Big Data: A Vision on Opportunities and
Challenges [4.497634148674422]
Intemporal big data can foster new opportunities to solve problems that have not been possible before.
The distinctive characteristics of big data pose new challenges for deep learning technologies.
arXiv Detail & Related papers (2023-10-30T19:12:51Z) - OceanGPT: A Large Language Model for Ocean Science Tasks [37.053614694078014]
We introduce OceanGPT, the first-ever large language model in the ocean domain, which is expert in various ocean science tasks.
We also propose OceanGPT, a novel framework to automatically obtain a large volume of ocean domain instruction data.
We construct the first oceanography benchmark, OceanBench, to evaluate the capabilities of LLMs in the ocean domain.
arXiv Detail & Related papers (2023-10-03T13:17:35Z) - Physical Knowledge Enhanced Deep Neural Network for Sea Surface
Temperature Prediction [29.989387641655625]
We introduce a method for Sea Surface Temperature (SST) prediction that transfers physical knowledge from historical observations to numerical models.
Specifically, we use a combination of an encoder and a generative adversarial network (GAN) to capture physical knowledge from the observed data.
The numerical model data is then fed into the pre-trained model to generate physics-enhanced data, which can then be used for SST prediction.
arXiv Detail & Related papers (2023-04-19T02:08:54Z) - Modeling Information Change in Science Communication with Semantically
Matched Paraphrases [50.67030449927206]
SPICED is the first paraphrase dataset of scientific findings annotated for degree of information change.
SPICED contains 6,000 scientific finding pairs extracted from news stories, social media discussions, and full texts of original papers.
Models trained on SPICED improve downstream performance on evidence retrieval for fact checking of real-world scientific claims.
arXiv Detail & Related papers (2022-10-24T07:44:38Z) - Synergy between Observation Systems Oceanic in Turbulent Regions [0.0]
Ocean dynamics constitute a source of incertitude in determining the ocean's role in complex climatic phenomena.
Current observation systems have limitations in achieving sufficiently statistical precision for three-dimensional oceanic data.
We present the data-driven approaches which explore latent class regressions and deep regression neural networks in modeling ocean dynamics in the extensions of Gulf Stream and Kuroshio currents.
arXiv Detail & Related papers (2020-12-28T22:52:57Z) - Deep Learning for Surface Wave Identification in Distributed Acoustic
Sensing Data [1.7237878022600697]
We present a highly scalable and efficient approach to process real, complex DAS data.
Deep supervised learning is used to identify "useful" coherent surface waves generated by anthropogenic activity.
Our method provides interpretable patterns describing the interaction of ground-based human activities with the buried sensors.
arXiv Detail & Related papers (2020-10-15T15:53:03Z) - A Data Scientist's Guide to Streamflow Prediction [55.22219308265945]
We focus on the element of hydrologic rainfall--runoff models and their application to forecast floods and predict streamflow.
This guide aims to help interested data scientists gain an understanding of the problem, the hydrologic concepts involved, and the details that come up along the way.
arXiv Detail & Related papers (2020-06-05T08:04:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.