Comparative Study Between Distance Measures On Supervised Optimum-Path
Forest Classification
- URL: http://arxiv.org/abs/2202.03854v1
- Date: Tue, 8 Feb 2022 13:34:09 GMT
- Title: Comparative Study Between Distance Measures On Supervised Optimum-Path
Forest Classification
- Authors: Gustavo Henrique de Rosa, Mateus Roder, Jo\~ao Paulo Papa
- Abstract summary: Optimum-Path Forest (OPF) uses a graph-based methodology and a distance measure to create arcs between nodes and hence sets of trees.
This work proposes a comparative study over a wide range of distance measures applied to the supervised Optimum-Path Forest classification.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Machine Learning has attracted considerable attention throughout the past
decade due to its potential to solve far-reaching tasks, such as image
classification, object recognition, anomaly detection, and data forecasting. A
standard approach to tackle such applications is based on supervised learning,
which is assisted by large sets of labeled data and is conducted by the
so-called classifiers, such as Logistic Regression, Decision Trees, Random
Forests, and Support Vector Machines, among others. An alternative to
traditional classifiers is the parameterless Optimum-Path Forest (OPF), which
uses a graph-based methodology and a distance measure to create arcs between
nodes and hence sets of trees, responsible for conquering the nodes, defining
their labels, and shaping the forests. Nevertheless, its performance is
strongly associated with an appropriate distance measure, which may vary
according to the dataset's nature. Therefore, this work proposes a comparative
study over a wide range of distance measures applied to the supervised
Optimum-Path Forest classification. The experimental results are conducted
using well-known literature datasets and compared across benchmarking
classifiers, illustrating OPF's ability to adapt to distinct domains.
Related papers
- Learning Deep Tree-based Retriever for Efficient Recommendation: Theory and Method [76.31185707649227]
We propose a Deep Tree-based Retriever (DTR) for efficient recommendation.
DTR frames the training task as a softmax-based multi-class classification over tree nodes at the same level.
To mitigate the suboptimality induced by the labeling of non-leaf nodes, we propose a rectification method for the loss function.
arXiv Detail & Related papers (2024-08-21T05:09:53Z) - Downstream-Pretext Domain Knowledge Traceback for Active Learning [138.02530777915362]
We propose a downstream-pretext domain knowledge traceback (DOKT) method that traces the data interactions of downstream knowledge and pre-training guidance.
DOKT consists of a traceback diversity indicator and a domain-based uncertainty estimator.
Experiments conducted on ten datasets show that our model outperforms other state-of-the-art methods.
arXiv Detail & Related papers (2024-07-20T01:34:13Z) - A Satellite Band Selection Framework for Amazon Forest Deforestation Detection Task [0.5825410941577593]
Deforestation and degradation impact millions of hectares annually, necessitating government or private initiatives for effective forest monitoring.
This study introduces a novel framework that employs the Univariate Marginal Distribution Algorithm (UMDA) to select spectral bands from Landsat-8 satellite.
This selection guides a semantic segmentation architecture, DeepLabv3+, enhancing its performance.
arXiv Detail & Related papers (2024-04-03T11:47:20Z) - Adaptive Betweenness Clustering for Semi-Supervised Domain Adaptation [108.40945109477886]
We propose a novel SSDA approach named Graph-based Adaptive Betweenness Clustering (G-ABC) for achieving categorical domain alignment.
Our method outperforms previous state-of-the-art SSDA approaches, demonstrating the superiority of the proposed G-ABC algorithm.
arXiv Detail & Related papers (2024-01-21T09:57:56Z) - A Mathematical Programming Approach to Optimal Classification Forests [1.0705399532413618]
We propose a novel mathematical optimization-based methodology in which a given number of trees are simultaneously constructed.
The classification rule is derived by assigning to each observation its most frequently predicted class among the trees in the forest.
We show that our proposed method has equal or superior performance compared with state-of-the-art tree-based classification methods.
arXiv Detail & Related papers (2022-11-18T20:33:08Z) - A Novel Approach for Optimum-Path Forest Classification Using Fuzzy
Logic [13.313728527879306]
Fuzzy Optimum-Path Forest is an improved version of the standard OPF classifier.
It learns the samples' membership in an unsupervised fashion, which are further incorporated during supervised training.
Experiments conducted over twelve public datasets highlight the robustness of the proposed approach.
arXiv Detail & Related papers (2022-04-13T20:55:30Z) - Cross-Cluster Weighted Forests [4.9873153106566575]
This article considers the effect of ensembling Random Forest learners trained on clusters within a single dataset with heterogeneity in the distribution of the features.
We find that constructing ensembles of forests trained on clusters determined by algorithms such as k-means results in significant improvements in accuracy and generalizability over the traditional Random Forest algorithm.
arXiv Detail & Related papers (2021-05-17T04:58:29Z) - Instance Level Affinity-Based Transfer for Unsupervised Domain
Adaptation [74.71931918541748]
We propose an instance affinity based criterion for source to target transfer during adaptation, called ILA-DA.
We first propose a reliable and efficient method to extract similar and dissimilar samples across source and target, and utilize a multi-sample contrastive loss to drive the domain alignment process.
We verify the effectiveness of ILA-DA by observing consistent improvements in accuracy over popular domain adaptation approaches on a variety of benchmark datasets.
arXiv Detail & Related papers (2021-04-03T01:33:14Z) - TraND: Transferable Neighborhood Discovery for Unsupervised Cross-domain
Gait Recognition [77.77786072373942]
This paper proposes a Transferable Neighborhood Discovery (TraND) framework to bridge the domain gap for unsupervised cross-domain gait recognition.
We design an end-to-end trainable approach to automatically discover the confident neighborhoods of unlabeled samples in the latent space.
Our method achieves state-of-the-art results on two public datasets, i.e., CASIA-B and OU-LP.
arXiv Detail & Related papers (2021-02-09T03:07:07Z) - Robust Similarity and Distance Learning via Decision Forests [8.587164648430251]
We propose a novel decision forest algorithm for the task of distance learning, which we call Similarity and Metric Random Forests (SMERF)
Its ability to approximate arbitrary distances and identify important features is empirically demonstrated on simulated data sets.
arXiv Detail & Related papers (2020-07-27T20:17:42Z) - Towards Fair Cross-Domain Adaptation via Generative Learning [50.76694500782927]
Domain Adaptation (DA) targets at adapting a model trained over the well-labeled source domain to the unlabeled target domain lying in different distributions.
We develop a novel Generative Few-shot Cross-domain Adaptation (GFCA) algorithm for fair cross-domain classification.
arXiv Detail & Related papers (2020-03-04T23:25:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.