Knowledge Guided Representation Learning and Causal Structure Learning
in Soil Science
- URL: http://arxiv.org/abs/2306.09302v1
- Date: Thu, 15 Jun 2023 17:31:13 GMT
- Title: Knowledge Guided Representation Learning and Causal Structure Learning
in Soil Science
- Authors: Somya Sharma, Swati Sharma, Licheng Liu, Rishabh Tushir, Andy Neal,
Robert Ness, John Crawford, Emre Kiciman, Ranveer Chandra
- Abstract summary: We propose a framework, knowledge-guided representation learning, and causal structure learning (KGRCL) to accelerate scientific discoveries in soil science.
The framework improves representation learning for simulated soil processes via conditional distribution matching with observed soil processes.
The learned causal graph is more representative of ground truth than other graphs generated from other causal discovery methods.
- Score: 7.242065002172681
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: An improved understanding of soil can enable more sustainable land-use
practices. Nevertheless, soil is called a complex, living medium due to the
complex interaction of different soil processes that limit our understanding of
soil. Process-based models and analyzing observed data provide two avenues for
improving our understanding of soil processes. Collecting observed data is
cost-prohibitive but reflects real-world behavior, while process-based models
can be used to generate ample synthetic data which may not be representative of
reality. We propose a framework, knowledge-guided representation learning, and
causal structure learning (KGRCL), to accelerate scientific discoveries in soil
science. The framework improves representation learning for simulated soil
processes via conditional distribution matching with observed soil processes.
Simultaneously, the framework leverages both observed and simulated data to
learn a causal structure among the soil processes. The learned causal graph is
more representative of ground truth than other graphs generated from other
causal discovery methods. Furthermore, the learned causal graph is leveraged in
a supervised learning setup to predict the impact of fertilizer use and
changing weather on soil carbon. We present the results in five different
locations to show the improvement in the prediction performance in
out-of-sample and few-shots setting.
Related papers
- Learning to refine domain knowledge for biological network inference [2.209921757303168]
Perturbation experiments allow biologists to discover causal relationships between variables of interest.
The sparsity and high dimensionality of these data pose significant challenges for causal structure learning algorithms.
We propose an amortized algorithm for refining domain knowledge, based on data observations.
arXiv Detail & Related papers (2024-10-18T12:53:23Z) - Causal Representation Learning in Temporal Data via Single-Parent Decoding [66.34294989334728]
Scientific research often seeks to understand the causal structure underlying high-level variables in a system.
Scientists typically collect low-level measurements, such as geographically distributed temperature readings.
We propose a differentiable method, Causal Discovery with Single-parent Decoding, that simultaneously learns the underlying latents and a causal graph over them.
arXiv Detail & Related papers (2024-10-09T15:57:50Z) - Smoke and Mirrors in Causal Downstream Tasks [59.90654397037007]
This paper looks at the causal inference task of treatment effect estimation, where the outcome of interest is recorded in high-dimensional observations.
We compare 6 480 models fine-tuned from state-of-the-art visual backbones, and find that the sampling and modeling choices significantly affect the accuracy of the causal estimate.
Our results suggest that future benchmarks should carefully consider real downstream scientific questions, especially causal ones.
arXiv Detail & Related papers (2024-05-27T13:26:34Z) - Domain Adaptation for Sustainable Soil Management using Causal and
Contrastive Constraint Minimization [13.436399861462323]
We propose a multi-modal, scalable framework that can estimate organic matter from remote sensing data.
We leverage inherent structure in the data and train the model to discriminate among domains using contrastive learning.
We shed light on the interpretability of the framework by identifying attributes that are important for improving generalization.
arXiv Detail & Related papers (2024-01-13T23:51:42Z) - SSL-SoilNet: A Hybrid Transformer-based Framework with Self-Supervised Learning for Large-scale Soil Organic Carbon Prediction [2.554658234030785]
This study introduces a novel approach that aims to learn the geographical link between multimodal features via self-supervised contrastive learning.
The proposed approach has undergone rigorous testing on two distinct large-scale datasets.
arXiv Detail & Related papers (2023-08-07T13:44:44Z) - Causal Modeling of Soil Processes for Improved Generalization [6.839859020308749]
Soil organic carbon enriches nutrition in soil, but also has a gamut of co-benefits such as improving water storage and limiting physical erosion.
Current approaches do not generalize well across soil conditions and management practices.
We empirically show that explicit modeling of cause-and-effect relationships among the soil processes improves the out-of-distribution generalizability of prediction models.
arXiv Detail & Related papers (2022-11-10T16:26:13Z) - Taxonomizing local versus global structure in neural network loss
landscapes [60.206524503782006]
We show that the best test accuracy is obtained when the loss landscape is globally well-connected.
We also show that globally poorly-connected landscapes can arise when models are small or when they are trained to lower quality data.
arXiv Detail & Related papers (2021-07-23T13:37:14Z) - Estimating Crop Primary Productivity with Sentinel-2 and Landsat 8 using
Machine Learning Methods Trained with Radiative Transfer Simulations [58.17039841385472]
We take advantage of all parallel developments in mechanistic modeling and satellite data availability for advanced monitoring of crop productivity.
Our model successfully estimates gross primary productivity across a variety of C3 crop types and environmental conditions even though it does not use any local information from the corresponding sites.
This highlights its potential to map crop productivity from new satellite sensors at a global scale with the help of current Earth observation cloud computing platforms.
arXiv Detail & Related papers (2020-12-07T16:23:13Z) - Semi-supervised Soil Moisture Prediction through Graph Neural Networks [12.891517184512551]
We propose to convert the problem of soil moisture prediction as a semi-supervised learning on temporal graphs.
We propose a dynamic graph neural network which can use the dependency of related locations over a region to predict soil moisture.
Our algorithm, referred as DGLR, provides an end-to-end learning which can predict soil moisture over multiple locations in a region over time and also update the graph structure in between.
arXiv Detail & Related papers (2020-12-07T07:56:11Z) - Polyp-artifact relationship analysis using graph inductive learned
representations [52.900974021773024]
The diagnosis process of colorectal cancer mainly focuses on the localization and characterization of abnormal growths in the colon tissue known as polyps.
Despite recent advances in deep object localization, the localization of polyps remains challenging due to the similarities between tissues, and the high level of artifacts.
Recent studies have shown the negative impact of the presence of artifacts in the polyp detection task, and have started to take them into account within the training process.
arXiv Detail & Related papers (2020-09-15T13:56:39Z) - Causal Discovery in Physical Systems from Videos [123.79211190669821]
Causal discovery is at the core of human cognition.
We consider the task of causal discovery from videos in an end-to-end fashion without supervision on the ground-truth graph structure.
arXiv Detail & Related papers (2020-07-01T17:29:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.