A Gradient Boosted Mixed-Model Machine Learning Framework for Vessel Speed in the U.S. Arctic
- URL: http://arxiv.org/abs/2602.12292v1
- Date: Sat, 31 Jan 2026 14:53:28 GMT
- Title: A Gradient Boosted Mixed-Model Machine Learning Framework for Vessel Speed in the U.S. Arctic
- Authors: Mauli Pant, Linda Fernandez, Indranil Sahoo,
- Abstract summary: We analyzed Automatic Identification System (AIS) data from 2010-2019 to examine vessel speed over ground (SOG)<n>We applied a two-stage machine learning framework, first modeling the probability of SOG greater than zero and then modeling SOG conditional on being positive.<n>Results: Distance to coast and bathymetric depth were dominant determinants of both the likelihood and magnitude of vessel speed, while changes in course, vessel group, and navigational status introduced secondary variation.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Understanding how environmental and operational conditions influence vessel speed is crucial for characterizing navigational conditions in the Arctic. We analyzed Automatic Identification System (AIS) data from 2010-2019 to examine vessel speed over ground (SOG). Over half of the AIS records showed zero SOG, and treating zero and positive SOG as a single continuous process can obscure important patterns. We therefore applied a two-stage machine learning framework, first modeling the probability of SOG greater than zero and then modeling SOG conditional on being positive. AIS observations were integrated with sea ice concentration, course over ground, wind, bathymetric depth, distance to coast, vessel group, and navigational status. Gradient boosted decision trees with random effects captured nonlinear environmental responses while accounting for repeated observations. The positive SOG classifier achieved strong discrimination (AUC = 0.85), while the conditional speed model explained approximately 77 percent of out-of-fold variance. SHAP values quantified covariate effects by decomposing model predictions into additive contributions from individual variables. Distance to coast and bathymetric depth were dominant determinants of both the likelihood and magnitude of vessel speed, while changes in course, vessel group, and navigational status introduced secondary variation. Wind and sea ice effects were modest. Together, these results empirically characterize Arctic vessel operating regimes relevant to speed management and corridor-level assessment.
Related papers
- IceBench-S2S: A Benchmark of Deep Learning for Challenging Subseasonal-to-Seasonal Daily Arctic Sea Ice Forecasting in Deep Latent Space [39.539136137431335]
Recent advances in artificial intelligence have facilitated the development of data-driven pan-Arctic sea ice forecasting systems.<n>We introduce IceBench-S2S, the first comprehensive benchmark for evaluating DL approaches in mitigating the challenge of forecasting Arctic sea ice concentration in successive 180-day periods.<n>IceBench-S2S provides a unified training and evaluation pipeline for different backbones, along with practical guidance for model selection in polar environmental monitoring tasks.
arXiv Detail & Related papers (2026-01-31T01:40:16Z) - Time-Varying Causal Treatment for Quantifying the Causal Effect of Short-Term Variations on Arctic Sea Ice Dynamics [0.16206783799607727]
Quantifying the causal relationship between ice melt and freshwater distribution is critical, as these complex interactions manifest as regional fluctuations in sea surface height.<n>We propose the Knowledge-Guided Variational Autoencoder (KGCM-VAE) to quantify causal mechanisms between sea ice thickness and SSH.<n> Experimental results on both synthetic and real-world Arctic datasets demonstrate that KGCM-VAE superior PEHE compared to state-of-the-art benchmarks.
arXiv Detail & Related papers (2026-01-25T01:44:55Z) - OceanSAR-2: A Universal Feature Extractor for SAR Ocean Observation [55.978228064498865]
We present OceanSAR-2, the second generation of our foundation model for SAR-based ocean observation.<n>Building on our earlier release, which pioneered self-supervised learning on Sentinel-1 Wave Mode data, OceanSAR-2 relies on improved SSL training and dynamic data curation strategies.
arXiv Detail & Related papers (2026-01-12T10:20:43Z) - A Rapid GeoSAM-Based Workflow for Multi-Temporal Glacier Delineation: Case Study from Svalbard [51.56484100374058]
We present a GeoSAM-based, semi-automatic workflow for rapid glacier delineation from Sentinel-2 imagery.<n>Results show that the approach produces spatially coherent and temporally consistent outlines for major glacier bodies.<n>The reliance on derived RGB imagery makes the method flexible and transferable to other optical datasets.
arXiv Detail & Related papers (2025-12-28T09:42:01Z) - Interpretable Machine Learning for Reservoir Water Temperatures in the U.S. Red River Basin of the South [0.0]
We integrate explainable machine learning (ML) with symbolic modeling to uncover the drivers of Reservoir Water Temperature (RWT) dynamics.<n>We employ ensemble and neural models, including Random Forest (RF), Extreme Gradient Boosting (XGBoost), and Multilayer Perceptron (MLP)<n>To translate these data-driven insights into compact analytical expressions, we developed Kolmogorov Arnold Networks (KANs) to symbolically approximate RWT.
arXiv Detail & Related papers (2025-11-03T18:45:27Z) - Predicting Barge Tow Size on Inland Waterways Using Vessel Trajectory Derived Features: Proof of Concept [0.0]
This study introduces a novel method to use Automatic Identification System (AIS) vessel tracking data to predict the number of barges in tow using Machine Learning (ML)<n>To train and test the model, barge instances were annotated from satellite scenes across the Lower Mississippi River.<n>The proposed approach provides a scalable, readily implementable method for enhancing Maritime Domain Awareness (MDA) with strong potential applications in lock scheduling, port management, and freight planning.
arXiv Detail & Related papers (2025-10-28T01:51:23Z) - Causal Time Series Modeling of Supraglacial Lake Evolution in Greenland under Distribution Shift [2.5551933647600693]
Causal modeling offers a principled foundation for uncovering stable, invariant relationships in time-series data.<n>We propose RIC-TSC, a regionally-informed causal time-series classification framework that embeds lag-aware causal discovery directly into sequence modeling.
arXiv Detail & Related papers (2025-10-17T03:06:08Z) - Multi-Modal Learning-based Reconstruction of High-Resolution Spatial
Wind Speed Fields [46.72819846541652]
We propose a framework based on Vari Data Assimilation and Deep Learning concepts.
This framework is applied to recover rich-in-time, high-resolution information on sea surface wind speed.
arXiv Detail & Related papers (2023-12-14T13:40:39Z) - Learning-based estimation of in-situ wind speed from underwater
acoustics [58.293528982012255]
We introduce a deep learning approach for the retrieval of wind speed time series from underwater acoustics.
Our approach bridges data assimilation and learning-based frameworks to benefit both from prior physical knowledge and computational efficiency.
arXiv Detail & Related papers (2022-08-18T15:27:40Z) - SALT: Sea lice Adaptive Lattice Tracking -- An Unsupervised Approach to
Generate an Improved Ocean Model [72.3183990520267]
We propose SALT: Sea lice Adaptive Lattice Tracking approach for efficient estimation of sea lice dispersion and distribution.
Specifically, an adaptive spatial mesh is generated by merging nodes in the lattice graph of the Ocean Model based on local ocean properties.
The proposed SALT technique shows promise for enhancing proactive aquaculture management through predictive modelling of sea lice infestation pressure maps in a changing climate.
arXiv Detail & Related papers (2021-06-24T17:29:42Z) - Constraining subglacial processes from surface velocity observations
using surrogate-based Bayesian inference [0.0]
Basal motion is the primary mechanism for ice flux outside Antarctica, yet a widely applicable model for predicting it remains elusive.
This is due to the difficulty in both observing small-scale bed properties and predicting a time-varying water pressure on which basal motion putatively depends.
We take a Bayesian approach to these problems by coupling models of ice dynamics and subglacial hydrology and conditioning on observations of surface velocity in southwestern Greenland.
arXiv Detail & Related papers (2020-06-22T16:47:33Z) - Sub-Seasonal Climate Forecasting via Machine Learning: Challenges,
Analysis, and Advances [44.28969320556008]
Sub-seasonal climate forecasting (SSF) focuses on predicting key climate variables such as temperature and precipitation in the 2-week to 2-month time scales.
In this paper, we study a variety of machine learning (ML) approaches for SSF over the US mainland.
arXiv Detail & Related papers (2020-06-14T18:39:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.