Related papers: Automated Classification of Dry Bean Varieties Using XGBoost and SVM Models

Related papers

Soil Compaction Parameters Prediction Based on Automated Machine Learning Approach [0.0]
This study proposes an automated machine learning (AutoML) approach to predict optimum moisture content (OMC) and maximum dry density (MDD)<n>The study found that the Extreme Gradient Boosting (XGBoost) algorithm provided the best performance, achieving R-squared values of 80.4% for MDD and 89.1% for OMC on a separate dataset.
arXiv Detail & Related papers (2025-12-09T08:13:04Z)
Enhancing Sentiment Classification with Machine Learning and Combinatorial Fusion [41.99844472131922]
This paper presents a novel approach to sentiment classification using the application of Combinatorial Fusion Analysis (CFA)<n>CFA leverages the concept of cognitive diversity, which utilizes rank-score characteristic functions to quantify the dissimilarity between models and strategically combine their predictions.<n> Experimental results also indicate that CFA outperforms traditional ensemble methods by effectively computing and employing model diversity.
arXiv Detail & Related papers (2025-10-30T21:30:30Z)
Intrinsic Explainability of Multimodal Learning for Crop Yield Prediction [36.766406330345525]
We leverage the intrinsic explainability of Transformer-based models to explain multimodal learning networks.<n>This study focuses on the task of crop yield prediction at the subfield level.
arXiv Detail & Related papers (2025-08-09T11:09:10Z)
Honey Classification using Hyperspectral Imaging and Machine Learning [0.0]
We use a class transformation method in the dataset preparation phase to maximize the separability across classes.<n>The feature extraction phase employs the Linear Discriminant Analysis (LDA) technique for extracting relevant features.<n>In the classification phase, we use Support Vector Machines (SVM) and K-Nearest Neighbors (KNN) models to classify the extracted features into their botanical origins.
arXiv Detail & Related papers (2025-08-01T06:45:42Z)
Agentomics-ML: Autonomous Machine Learning Experimentation Agent for Genomic and Transcriptomic Data [33.7054351451505]
We introduce Agentomics-ML, a fully autonomous agent-based system designed to produce a classification model.<n>We show that Agentomics-ML outperforms existing state-of-the-art agent-based methods in both generalization and success rates.
arXiv Detail & Related papers (2025-06-05T19:44:38Z)
Ustnlp16 at SemEval-2025 Task 9: Improving Model Performance through Imbalance Handling and Focal Loss [38.70308073598037]
classification tasks often suffer from severe class imbalances, short and unstructured text, and overlapping semantic categories.<n>We present our system for SemEval- 2025 Task 9: Food Hazard Detection, which ad- dresses these issues by applying data augmenta- tion techniques to improve classification perfor- mance.
arXiv Detail & Related papers (2025-04-24T16:35:44Z)
Gradient-Optimized Fuzzy Classifier: A Benchmark Study Against State-of-the-Art Models [0.0]
This paper presents a performance benchmarking study of a Gradient-d Fuzzy Inference System (GF) against several state-of-the-art machine learning models. Results demonstrate that the GF model achieved competitive, and in several cases superior, classification accuracy while maintaining high precision and exceptionally low training times. These findings support the potential of gradient optimized fuzzy systems as interpretable, efficient, and adaptable alternatives to more complex deep learning models in supervised learning tasks.
arXiv Detail & Related papers (2025-04-22T20:47:06Z)
GBFRS: Robust Fuzzy Rough Sets via Granular-ball Computing [48.33779268699777]
Fuzzy rough set theory is effective for processing datasets with complex attributes. Most existing models operate at the finest granularity, rendering them inefficient and sensitive to noise. This paper proposes integrating multi-granularity granular-ball computing into fuzzy rough set theory, using granular-balls to replace sample points.
arXiv Detail & Related papers (2025-01-30T15:09:26Z)
A Robust Support Vector Machine Approach for Raman COVID-19 Data Classification [0.7864304771129751]
In this paper, we investigate the performance of a novel robust formulation for Support Vector Machine (SVM) in classifying COVID-19 samples obtained from Raman spectroscopy. We derive robust counterpart models of deterministic formulations using bounded-by-norm uncertainty sets around each observation. The effectiveness of our approach is validated on real-world COVID-19 datasets provided by Italian hospitals.
arXiv Detail & Related papers (2025-01-29T14:02:45Z)
Artificial Liver Classifier: A New Alternative to Conventional Machine Learning Models [4.395397502990339]
This paper introduces the Artificial Liver (ALC), a novel supervised learning classifier inspired by the human liver's detoxification function. The ALC is characterized by its simplicity, speed, hyperparameters-free, ability to reduce overfitting, and effectiveness in addressing multi-classification problems. It was evaluated on five benchmark machine learning datasets: Iris Flower, Breast Cancer Wisconsin, Wine, Voice Gender, and MNIST.
arXiv Detail & Related papers (2025-01-14T12:42:01Z)
Predictive Maintenance Study for High-Pressure Industrial Compressors: Hybrid Clustering Models [39.58317527488534]
Clustering algorithms were evaluated using quality metrics like Normalized Mutual Information (NMI) and Adjusted Rand Index (ARI) These features enriched regression models, improving failure detection accuracy by 4.87 percent on average. Cross validation and key performance metrics confirmed the benefits of clustering based features in predictive maintenance models.
arXiv Detail & Related papers (2024-11-21T08:14:26Z)
Machine Learning Approaches on Crop Pattern Recognition a Comparative Analysis [0.0]
Time series remote sensing data were used for the generation of the cropping pattern. Classification algorithms are used to classify crop patterns and mapped agriculture land used. In this paper, we are proposing Deep Neural Network (DNN) based classification to improve the performance of crop pattern recognition.
arXiv Detail & Related papers (2024-11-19T17:19:20Z)
AgEval: A Benchmark for Zero-Shot and Few-Shot Plant Stress Phenotyping with Multimodal LLMs [19.7240633020344]
AgEval is a benchmark comprising 12 diverse plant stress phenotyping tasks. Our study assesses zero-shot and few-shot in-context learning performance of state-of-the-art models.
arXiv Detail & Related papers (2024-07-29T00:39:51Z)
Predictive Analytics of Varieties of Potatoes [2.336821989135698]
We explore the application of machine learning algorithms specifically to enhance the selection process of Russet potato clones in breeding trials. This study addresses the challenge of efficiently identifying high-yield, disease-resistant, and climate-resilient potato varieties.
arXiv Detail & Related papers (2024-04-04T00:49:05Z)
Extension of Transformational Machine Learning: Classification Problems [0.0]
This study explores the application and performance of Transformational Machine Learning (TML) in drug discovery. TML, a meta learning algorithm, excels in exploiting common attributes across various domains. The drug discovery process, which is complex and time-consuming, can benefit greatly from the enhanced prediction accuracy.
arXiv Detail & Related papers (2023-08-07T07:34:18Z)
Benchmarking the Effectiveness of Classification Algorithms and SVM Kernels for Dry Beans [0.6263481844384227]
This study analyses different Support Vector Machine (SVM) classification algorithms, namely linear, and radial basis function (RBF) The analysis is performed on the Dry Bean dataset, with PCA (Principal Component Analysis) conducted as a preprocessing step for dimensionality reduction. The RBF SVM kernel algorithm achieves the highest Accuracy of 93.34%, Precision of 92.61%, Recall of 92.35% and F1 Score as 91.40%.
arXiv Detail & Related papers (2023-07-15T18:13:29Z)
PruMUX: Augmenting Data Multiplexing with Model Compression [42.89593283051397]
In this paper, we combine two such methods -- structured pruning and data multiplexing -- to compound the speedup gains obtained by either method. Our approach, PruMUX, obtains up to 7.5-29.5X throughput improvement over BERT-base model with accuracy threshold from 80% to 74%. We propose Auto-PruMUX, a meta-level model that can predict the high-performance parameters for pruning and multiplexing given a desired accuracy loss budget.
arXiv Detail & Related papers (2023-05-24T04:22:38Z)
Boosting Out-of-Distribution Detection with Multiple Pre-trained Models [41.66566916581451]
Post hoc detection utilizing pre-trained models has shown promising performance and can be scaled to large-scale problems. We propose a detection enhancement method by ensembling multiple detection decisions derived from a zoo of pre-trained models. Our method substantially improves the relative performance by 65.40% and 26.96% on the CIFAR10 and ImageNet benchmarks.
arXiv Detail & Related papers (2022-12-24T12:11:38Z)
Towards Automated Imbalanced Learning with Deep Hierarchical Reinforcement Learning [57.163525407022966]
Imbalanced learning is a fundamental challenge in data mining, where there is a disproportionate ratio of training samples in each class. Over-sampling is an effective technique to tackle imbalanced learning through generating synthetic samples for the minority class. We propose AutoSMOTE, an automated over-sampling algorithm that can jointly optimize different levels of decisions.
arXiv Detail & Related papers (2022-08-26T04:28:01Z)
SelectAugment: Hierarchical Deterministic Sample Selection for Data Augmentation [72.58308581812149]
We propose an effective approach, dubbed SelectAugment, to select samples to be augmented in a deterministic and online manner. Specifically, in each batch, we first determine the augmentation ratio, and then decide whether to augment each training sample under this ratio. In this way, the negative effects of the randomness in selecting samples to augment can be effectively alleviated and the effectiveness of DA is improved.
arXiv Detail & Related papers (2021-12-06T08:38:38Z)
Guiding Generative Language Models for Data Augmentation in Few-Shot Text Classification [59.698811329287174]
We leverage GPT-2 for generating artificial training instances in order to improve classification performance. Our results show that fine-tuning GPT-2 in a handful of label instances leads to consistent classification improvements.
arXiv Detail & Related papers (2021-11-17T12:10:03Z)
Active Hybrid Classification [79.02441914023811]
This paper shows how crowd and machines can support each other in tackling classification problems. We propose an architecture that orchestrates active learning and crowd classification and combines them in a virtuous cycle.
arXiv Detail & Related papers (2021-01-21T21:09:07Z)
GIM: Gaussian Isolation Machines [40.7916016364212]
In many cases, neural network classifiers are exposed to input data that is outside of their training distribution data. We present a novel hybrid (generative-discriminative) classifier aimed at solving the problem arising when OOD data is encountered. The proposed GIM's novelty lies in its discriminative performance and generative capabilities, a combination of characteristics not usually seen in a single classifier.
arXiv Detail & Related papers (2020-02-06T09:51:47Z)

This list is automatically generated from the titles and abstracts of the papers in this site.