FSDR: A Novel Deep Learning-based Feature Selection Algorithm for Pseudo
Time-Series Data using Discrete Relaxation
- URL: http://arxiv.org/abs/2403.08403v1
- Date: Wed, 13 Mar 2024 10:37:52 GMT
- Title: FSDR: A Novel Deep Learning-based Feature Selection Algorithm for Pseudo
Time-Series Data using Discrete Relaxation
- Authors: Mohammad Rahman, Manzur Murshed, Shyh Wei Teng, Manoranjan Paul
- Abstract summary: We introduce a Deep Learning-based feature selection algorithm: Feature Selection through Discrete Relaxation (FSDR)
FSDR is capable of accommodating a high number of feature dimensions, a capability beyond the reach of existing DL-based or traditional methods.
- Score: 9.769546018094665
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Conventional feature selection algorithms applied to Pseudo Time-Series (PTS)
data, which consists of observations arranged in sequential order without
adhering to a conventional temporal dimension, often exhibit impractical
computational complexities with high dimensional data. To address this
challenge, we introduce a Deep Learning (DL)-based feature selection algorithm:
Feature Selection through Discrete Relaxation (FSDR), tailored for PTS data.
Unlike the existing feature selection algorithms, FSDR learns the important
features as model parameters using discrete relaxation, which refers to the
process of approximating a discrete optimisation problem with a continuous one.
FSDR is capable of accommodating a high number of feature dimensions, a
capability beyond the reach of existing DL-based or traditional methods.
Through testing on a hyperspectral dataset (i.e., a type of PTS data), our
experimental results demonstrate that FSDR outperforms three commonly used
feature selection algorithms, taking into account a balance among execution
time, $R^2$, and $RMSE$.
Related papers
- Utility-Diversity Aware Online Batch Selection for LLM Supervised Fine-tuning [49.04912820721943]
Supervised fine-tuning (SFT) is computationally expensive and sometimes suffers from overfitting or bias amplification.<n>This work studies the online batch selection family that dynamically scores and filters samples during the training process.<n>We develop textbfUDS (Utility-Diversity Sampling), a framework for efficient online batch selection in SFT.
arXiv Detail & Related papers (2025-10-19T15:32:01Z) - DeepLINK-T: deep learning inference for time series data using knockoffs and LSTM [7.831716013866564]
This study introduces a novel method, Deep Learning Inference using Knockoffs for Time series data (DeepLink-T)
It focuses on the selection of significant time series variables in regression while controlling the false discovery rate (FDR) at a predetermined level.
Three key ingredients for DeepLink-T are 1) a Long Short-Term Memory (LSTM) autoencoder for generating time series knockoff variables, 2) an LSTM prediction network using both original and knockoff variables, and 3) the application of the knockoffs framework for variable selection with FDR control.
arXiv Detail & Related papers (2024-04-05T17:47:50Z) - Embedded feature selection in LSTM networks with multi-objective
evolutionary ensemble learning for time series forecasting [49.1574468325115]
We present a novel feature selection method embedded in Long Short-Term Memory networks.
Our approach optimize the weights and biases of the LSTM in a partitioned manner.
Experimental evaluations on air quality time series data from Italy and southeast Spain demonstrate that our method substantially improves the ability generalization of conventional LSTMs.
arXiv Detail & Related papers (2023-12-29T08:42:10Z) - B-LSTM-MIONet: Bayesian LSTM-based Neural Operators for Learning the
Response of Complex Dynamical Systems to Length-Variant Multiple Input
Functions [6.75867828529733]
Multiple-input deep neural operators (MIONet) extended DeepONet to allow multiple input functions in different Banach spaces.
MIONet offers flexibility in training dataset grid spacing, without constraints on output location.
This work redesigns MIONet, integrating Long Short Term Memory (LSTM) to learn neural operators from time-dependent data.
arXiv Detail & Related papers (2023-11-28T04:58:17Z) - State Sequences Prediction via Fourier Transform for Representation
Learning [111.82376793413746]
We propose State Sequences Prediction via Fourier Transform (SPF), a novel method for learning expressive representations efficiently.
We theoretically analyze the existence of structural information in state sequences, which is closely related to policy performance and signal regularity.
Experiments demonstrate that the proposed method outperforms several state-of-the-art algorithms in terms of both sample efficiency and performance.
arXiv Detail & Related papers (2023-10-24T14:47:02Z) - Online Feature Selection for Efficient Learning in Networked Systems [3.13468877208035]
Current AI/ML methods for data-driven engineering use models that are mostly trained offline.
We present an online algorithm called Online Stable Feature Set Algorithm (OSFS), which selects a small feature set from a large number of available data sources.
OSFS achieves a massive reduction in the size of the feature set by 1-3 orders of magnitude on all investigated datasets.
arXiv Detail & Related papers (2021-12-15T16:31:59Z) - A Feature Selection Method for Multi-Dimension Time-Series Data [2.055949720959582]
Time-series data in application areas such as motion capture and activity recognition is often multi-dimension.
There is a lot of redundancy in these data streams and good classification accuracy will often be achievable with a small number of features.
We present a method for feature subset selection on multidimensional time-series data based on mutual information.
arXiv Detail & Related papers (2021-04-22T14:49:00Z) - Deep Cellular Recurrent Network for Efficient Analysis of Time-Series
Data with Spatial Information [52.635997570873194]
This work proposes a novel deep cellular recurrent neural network (DCRNN) architecture to process complex multi-dimensional time series data with spatial information.
The proposed architecture achieves state-of-the-art performance while utilizing substantially less trainable parameters when compared to comparable methods in the literature.
arXiv Detail & Related papers (2021-01-12T20:08:18Z) - Learning summary features of time series for likelihood free inference [93.08098361687722]
We present a data-driven strategy for automatically learning summary features from time series data.
Our results indicate that learning summary features from data can compete and even outperform LFI methods based on hand-crafted values.
arXiv Detail & Related papers (2020-12-04T19:21:37Z) - Learned Factor Graphs for Inference from Stationary Time Sequences [107.63351413549992]
We propose a framework that combines model-based algorithms and data-driven ML tools for stationary time sequences.
neural networks are developed to separately learn specific components of a factor graph describing the distribution of the time sequence.
We present an inference algorithm based on learned stationary factor graphs, which learns to implement the sum-product scheme from labeled data.
arXiv Detail & Related papers (2020-06-05T07:06:19Z) - Unsupervised Online Anomaly Detection On Irregularly Sampled Or Missing
Valued Time-Series Data Using LSTM Networks [0.0]
We study anomaly detection and introduce an algorithm that processes variable length, irregularly sampled sequences or sequences with missing values.
Our algorithm is fully unsupervised, however, can be readily extended to supervised or semisupervised cases.
arXiv Detail & Related papers (2020-05-25T09:41:04Z) - Supervised Feature Subset Selection and Feature Ranking for Multivariate
Time Series without Feature Extraction [78.84356269545157]
We introduce supervised feature ranking and feature subset selection algorithms for MTS classification.
Unlike most existing supervised/unsupervised feature selection algorithms for MTS our techniques do not require a feature extraction step to generate a one-dimensional feature vector from the time series.
arXiv Detail & Related papers (2020-05-01T07:46:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.