Active machine learning for spatio-temporal predictions using feature
embedding
- URL: http://arxiv.org/abs/2012.04407v1
- Date: Tue, 8 Dec 2020 12:55:29 GMT
- Title: Active machine learning for spatio-temporal predictions using feature
embedding
- Authors: Arsam Aryandoust, Stefan Pfenninger
- Abstract summary: Active learning could contribute to solving environmental problems through improved critical-temporal predictions.
Here, we propose a novel batch AL method that fills this gap.
We encode and cluster features of candidate data points, and query the best data based on the distance of embedded features to their cluster centers.
- Score: 0.537133760455631
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Active learning (AL) could contribute to solving critical environmental
problems through improved spatio-temporal predictions. Yet such predictions
involve high-dimensional feature spaces with mixed data types and missing data,
which existing methods have difficulties dealing with. Here, we propose a novel
batch AL method that fills this gap. We encode and cluster features of
candidate data points, and query the best data based on the distance of
embedded features to their cluster centers. We introduce a new metric of
informativeness that we call embedding entropy and a general class of neural
networks that we call embedding networks for using it. Empirical tests on
forecasting electricity demand show a simultaneous reduction in prediction
error by up to 63-88% and data usage by up to 50-69% compared to passive
learning (PL) benchmarks.
Related papers
- Scalable Spatiotemporal Prediction with Bayesian Neural Fields [3.3299088915999295]
BayesNF is a novel deep neural network architecture for high-capacity function estimation.
We evaluate BayesNF against statistical machine-learning prediction problems from climate and public health datasets.
arXiv Detail & Related papers (2024-03-12T13:47:50Z) - Minimally Supervised Learning using Topological Projections in
Self-Organizing Maps [55.31182147885694]
We introduce a semi-supervised learning approach based on topological projections in self-organizing maps (SOMs)
Our proposed method first trains SOMs on unlabeled data and then a minimal number of available labeled data points are assigned to key best matching units (BMU)
Our results indicate that the proposed minimally supervised model significantly outperforms traditional regression techniques.
arXiv Detail & Related papers (2024-01-12T22:51:48Z) - LMD: Light-weight Prediction Quality Estimation for Object Detection in
Lidar Point Clouds [3.927702899922668]
Object detection on Lidar point cloud data is a promising technology for autonomous driving and robotics.
Uncertainty estimation is a crucial component for down-stream tasks and deep neural networks remain error-prone even for predictions with high confidence.
We propose LidarMetaDetect, a light-weight post-processing scheme for prediction quality estimation.
Our experiments show a significant increase of statistical reliability in separating true from false predictions.
arXiv Detail & Related papers (2023-06-13T15:13:29Z) - ASPEST: Bridging the Gap Between Active Learning and Selective
Prediction [56.001808843574395]
Selective prediction aims to learn a reliable model that abstains from making predictions when uncertain.
Active learning aims to lower the overall labeling effort, and hence human dependence, by querying the most informative examples.
In this work, we introduce a new learning paradigm, active selective prediction, which aims to query more informative samples from the shifted target domain.
arXiv Detail & Related papers (2023-04-07T23:51:07Z) - Efficient and Robust Classification for Sparse Attacks [34.48667992227529]
We consider perturbations bounded by the $ell$--norm, which have been shown as effective attacks in the domains of image-recognition, natural language processing, and malware-detection.
We propose a novel defense method that consists of "truncation" and "adrial training"
Motivated by the insights we obtain, we extend these components to neural network classifiers.
arXiv Detail & Related papers (2022-01-23T21:18:17Z) - Convolutional generative adversarial imputation networks for
spatio-temporal missing data in storm surge simulations [86.5302150777089]
Generative Adversarial Imputation Nets (GANs) and GAN-based techniques have attracted attention as unsupervised machine learning methods.
We name our proposed method as Con Conval Generative Adversarial Imputation Nets (Conv-GAIN)
arXiv Detail & Related papers (2021-11-03T03:50:48Z) - Efficient Nearest Neighbor Language Models [114.40866461741795]
Non-parametric neural language models (NLMs) learn predictive distributions of text utilizing an external datastore.
We show how to achieve up to a 6x speed-up in inference speed while retaining comparable performance.
arXiv Detail & Related papers (2021-09-09T12:32:28Z) - ALT-MAS: A Data-Efficient Framework for Active Testing of Machine
Learning Algorithms [58.684954492439424]
We propose a novel framework to efficiently test a machine learning model using only a small amount of labeled test data.
The idea is to estimate the metrics of interest for a model-under-test using Bayesian neural network (BNN)
arXiv Detail & Related papers (2021-04-11T12:14:04Z) - Network Classifiers Based on Social Learning [71.86764107527812]
We propose a new way of combining independently trained classifiers over space and time.
The proposed architecture is able to improve prediction performance over time with unlabeled data.
We show that this strategy results in consistent learning with high probability, and it yields a robust structure against poorly trained classifiers.
arXiv Detail & Related papers (2020-10-23T11:18:20Z) - Early Forecasting of Text Classification Accuracy and F-Measure with
Active Learning [0.7614628596146599]
We investigate the difference in forecasting difficulty when using accuracy and F-measure as the text classification system performance metrics.
We find that forecasting is easiest for decision tree learning, moderate for Support Vector Machines, and most difficult for neural networks.
arXiv Detail & Related papers (2020-01-20T06:27:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.