Deep Learning for Reference-Free Geolocation for Poplar Trees
- URL: http://arxiv.org/abs/2301.13387v1
- Date: Tue, 31 Jan 2023 03:37:47 GMT
- Title: Deep Learning for Reference-Free Geolocation for Poplar Trees
- Authors: Cai W. John, Owen Queen, Wellington Muchero, and Scott J. Emrich
- Abstract summary: Geolocation is concerned with locating the native region of a given sample based on its genetic makeup.
Here, we investigate genomic geolocation of Populus trichocarpa, or poplar, which has been identified by the US Department of Energy as a fast-rotation biofuel crop.
Our model, MashNet, predicts latitude and longitude for poplar trees from randomly-sampled, unaligned sequence fragments.
- Score: 0.17999333451993943
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: A core task in precision agriculture is the identification of climatic and
ecological conditions that are advantageous for a given crop. The most succinct
approach is geolocation, which is concerned with locating the native region of
a given sample based on its genetic makeup. Here, we investigate genomic
geolocation of Populus trichocarpa, or poplar, which has been identified by the
US Department of Energy as a fast-rotation biofuel crop to be harvested
nationwide. In particular, we approach geolocation from a reference-free
perspective, circumventing the need for compute-intensive processes such as
variant calling and alignment. Our model, MashNet, predicts latitude and
longitude for poplar trees from randomly-sampled, unaligned sequence fragments.
We show that our model performs comparably to Locator, a state-of-the-art
method based on aligned whole-genome sequence data. MashNet achieves an error
of 34.0 km^2 compared to Locator's 22.1 km^2. MashNet allows growers to quickly
and efficiently identify natural varieties that will be most productive in
their growth environment based on genotype. This paper explores geolocation for
precision agriculture while providing a framework and data source for further
development by the machine learning community.
Related papers
- Spatioformer: A Geo-encoded Transformer for Large-Scale Plant Species Richness Prediction [3.017562867737194]
We propose Spatioformer, where a geolocation encoder is coupled with the transformer model to encode geolocation context into remote sensing imagery.
Results demonstrate that geolocational information is advantageous in predicting species richness from satellite observations over large spatial scales.
richness maps produced in this study reveal thetemporal dynamics of plant species richness in Australia, providing supporting evidence to inform effective planning and policy development for plant diversity conservation.
arXiv Detail & Related papers (2024-10-25T02:21:01Z) - Enhancing Worldwide Image Geolocation by Ensembling Satellite-Based Ground-Level Attribute Predictors [4.415977307120618]
We examine the challenge of estimating the location of a single ground-level image in the absence of GPS or other location metadata.
We introduce a novel metric, Recall vs Area, which measures the accuracy of estimated distributions of locations.
We then examine an ensembling approach to global-scale image geolocation, which incorporates information from multiple sources.
arXiv Detail & Related papers (2024-07-18T19:15:52Z) - G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models [40.69217368870192]
We propose a novel framework for worldwide geolocalization based on Retrieval-Augmented Generation (RAG)
G3 consists of three steps, i.e., Geo-alignment, Geo-diversification, and Geo-verification.
Experiments on two well-established datasets verify the superiority of G3 compared to other state-of-the-art methods.
arXiv Detail & Related papers (2024-05-23T15:37:06Z) - Predicting the Geothermal Gradient in Colombia: a Machine Learning Approach [0.0]
geothermal gradient determination is critical for assessing the geothermal energy potential of a given region.
We present an approach that leverages recent advances in supervised machine learning to predict the geothermal gradient.
We show that predictions of our model are within 12% accuracy and that independent measurements performed by other authors agree well with our model.
arXiv Detail & Related papers (2024-04-08T04:18:54Z) - GeoLLM: Extracting Geospatial Knowledge from Large Language Models [49.20315582673223]
We present GeoLLM, a novel method that can effectively extract geospatial knowledge from large language models.
We demonstrate the utility of our approach across multiple tasks of central interest to the international community, including the measurement of population density and economic livelihoods.
Our experiments reveal that LLMs are remarkably sample-efficient, rich in geospatial information, and robust across the globe.
arXiv Detail & Related papers (2023-10-10T00:03:23Z) - Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese
Geographic Re-Ranking [61.60169764507917]
Chinese geographic re-ranking task aims to find the most relevant addresses among retrieved candidates.
We propose an innovative framework, namely Geo-Encoder, to more effectively integrate Chinese geographical semantics into re-ranking pipelines.
arXiv Detail & Related papers (2023-09-04T13:44:50Z) - Spatial Implicit Neural Representations for Global-Scale Species Mapping [72.92028508757281]
Given a set of locations where a species has been observed, the goal is to build a model to predict whether the species is present or absent at any location.
Traditional methods struggle to take advantage of emerging large-scale crowdsourced datasets.
We use Spatial Implicit Neural Representations (SINRs) to jointly estimate the geographical range of 47k species simultaneously.
arXiv Detail & Related papers (2023-06-05T03:36:01Z) - GeoGLUE: A GeoGraphic Language Understanding Evaluation Benchmark [56.08664336835741]
We propose a GeoGraphic Language Understanding Evaluation benchmark, named GeoGLUE.
We collect data from open-released geographic resources and introduce six natural language understanding tasks.
We pro vide evaluation experiments and analysis of general baselines, indicating the effectiveness and significance of the GeoGLUE benchmark.
arXiv Detail & Related papers (2023-05-11T03:21:56Z) - GeoNet: Benchmarking Unsupervised Adaptation across Geographies [71.23141626803287]
We study the problem of geographic robustness and make three main contributions.
First, we introduce a large-scale dataset GeoNet for geographic adaptation.
Second, we hypothesize that the major source of domain shifts arise from significant variations in scene context.
Third, we conduct an extensive evaluation of several state-of-the-art unsupervised domain adaptation algorithms and architectures.
arXiv Detail & Related papers (2023-03-27T17:59:34Z) - Estimating Crop Primary Productivity with Sentinel-2 and Landsat 8 using
Machine Learning Methods Trained with Radiative Transfer Simulations [58.17039841385472]
We take advantage of all parallel developments in mechanistic modeling and satellite data availability for advanced monitoring of crop productivity.
Our model successfully estimates gross primary productivity across a variety of C3 crop types and environmental conditions even though it does not use any local information from the corresponding sites.
This highlights its potential to map crop productivity from new satellite sensors at a global scale with the help of current Earth observation cloud computing platforms.
arXiv Detail & Related papers (2020-12-07T16:23:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.