Unsupervised Feature Selection via Multi-step Markov Transition
Probability
- URL: http://arxiv.org/abs/2005.14359v1
- Date: Fri, 29 May 2020 01:15:16 GMT
- Title: Unsupervised Feature Selection via Multi-step Markov Transition
Probability
- Authors: Yan Min, Mao Ye, Liang Tian, Yulin Jian, Ce Zhu, Shangming Yang
- Abstract summary: We propose a novel and very simple approach for unsupervised feature selection, named MMFS.
The idea is using multi-step Markov transition probability to describe the relation between any data pair.
The performance of our approach is compared with the state-of-the-art methods on eight real-world data sets.
- Score: 29.717104109481536
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Feature selection is a widely used dimension reduction technique to select
feature subsets because of its interpretability. Many methods have been
proposed and achieved good results, in which the relationships between adjacent
data points are mainly concerned. But the possible associations between data
pairs that are may not adjacent are always neglected. Different from previous
methods, we propose a novel and very simple approach for unsupervised feature
selection, named MMFS (Multi-step Markov transition probability for Feature
Selection). The idea is using multi-step Markov transition probability to
describe the relation between any data pair. Two ways from the positive and
negative viewpoints are employed respectively to keep the data structure after
feature selection. From the positive viewpoint, the maximum transition
probability that can be reached in a certain number of steps is used to
describe the relation between two points. Then, the features which can keep the
compact data structure are selected. From the viewpoint of negative, the
minimum transition probability that can be reached in a certain number of steps
is used to describe the relation between two points. On the contrary, the
features that least maintain the loose data structure are selected. And the two
ways can also be combined. Thus three algorithms are proposed. Our main
contributions are a novel feature section approach which uses multi-step
transition probability to characterize the data structure, and three algorithms
proposed from the positive and negative aspects for keeping data structure. The
performance of our approach is compared with the state-of-the-art methods on
eight real-world data sets, and the experimental results show that the proposed
MMFS is effective in unsupervised feature selection.
Related papers
- Parallel feature selection based on the trace ratio criterion [4.30274561163157]
This work presents a novel parallel feature selection approach for classification, namely Parallel Feature Selection using Trace criterion (PFST)
Our method uses trace criterion, a measure of class separability used in Fisher's Discriminant Analysis, to evaluate feature usefulness.
The experiments show that our method can produce a small set of features in a fraction of the amount of time by the other methods under comparison.
arXiv Detail & Related papers (2022-03-03T10:50:33Z) - Multiway Non-rigid Point Cloud Registration via Learned Functional Map
Synchronization [105.14877281665011]
We present SyNoRiM, a novel way to register multiple non-rigid shapes by synchronizing the maps relating learned functions defined on the point clouds.
We demonstrate via extensive experiments that our method achieves a state-of-the-art performance in registration accuracy.
arXiv Detail & Related papers (2021-11-25T02:37:59Z) - An Evolutionary Correlation-aware Feature Selection Method for
Classification Problems [3.2550305883611244]
In this paper, an estimation of distribution algorithm is proposed to meet three goals.
Firstly, as an extension of EDA, the proposed method generates only two individuals in each iteration that compete based on a fitness function.
Secondly, we provide a guiding technique for determining the number of features for individuals in each iteration.
As the main contribution of the paper, in addition to considering the importance of each feature alone, the proposed method can consider the interaction between features.
arXiv Detail & Related papers (2021-10-16T20:20:43Z) - Auto-weighted Multi-view Feature Selection with Graph Optimization [90.26124046530319]
We propose a novel unsupervised multi-view feature selection model based on graph learning.
The contributions are threefold: (1) during the feature selection procedure, the consensus similarity graph shared by different views is learned.
Experiments on various datasets demonstrate the superiority of the proposed method compared with the state-of-the-art methods.
arXiv Detail & Related papers (2021-04-11T03:25:25Z) - PointFlow: Flowing Semantics Through Points for Aerial Image
Segmentation [96.76882806139251]
We propose a point-wise affinity propagation module based on the Feature Pyramid Network (FPN) framework, named PointFlow.
Rather than dense affinity learning, a sparse affinity map is generated upon selected points between the adjacent features.
Experimental results on three different aerial segmentation datasets suggest that the proposed method is more effective and efficient than state-of-the-art general semantic segmentation methods.
arXiv Detail & Related papers (2021-03-11T09:42:32Z) - SPL-MLL: Selecting Predictable Landmarks for Multi-Label Learning [87.27700889147144]
We propose to select a small subset of labels as landmarks which are easy to predict according to input (predictable) and can well recover the other possible labels (representative)
We employ the Alternating Direction Method (ADM) to solve our problem. Empirical studies on real-world datasets show that our method achieves superior classification performance over other state-of-the-art methods.
arXiv Detail & Related papers (2020-08-16T11:07:44Z) - A Novel Community Detection Based Genetic Algorithm for Feature
Selection [3.8848561367220276]
Authors propose a genetic algorithm based on community detection, which functions in three steps.
Nine benchmark classification problems were analyzed in terms of the performance of the presented approach.
arXiv Detail & Related papers (2020-08-08T15:39:30Z) - Segment as Points for Efficient Online Multi-Object Tracking and
Segmentation [66.03023110058464]
We propose a highly effective method for learning instance embeddings based on segments by converting the compact image representation to un-ordered 2D point cloud representation.
Our method generates a new tracking-by-points paradigm where discriminative instance embeddings are learned from randomly selected points rather than images.
The resulting online MOTS framework, named PointTrack, surpasses all the state-of-the-art methods by large margins.
arXiv Detail & Related papers (2020-07-03T08:29:35Z) - Infinite Feature Selection: A Graph-based Feature Filtering Approach [78.63188057505012]
We propose a filtering feature selection framework that considers subsets of features as paths in a graph.
Going to infinite allows to constrain the computational complexity of the selection process.
We show that Inf-FS behaves better in almost any situation, that is, when the number of features to keep are fixed a priori.
arXiv Detail & Related papers (2020-06-15T07:20:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.