A Video-Based Activity Classification of Human Pickers in Agriculture
- URL: http://arxiv.org/abs/2311.10885v1
- Date: Fri, 17 Nov 2023 22:02:42 GMT
- Title: A Video-Based Activity Classification of Human Pickers in Agriculture
- Authors: Abhishesh Pal, Antonio C. Leite, Jon G. O. Gjevestad, P{\aa}l J. From
- Abstract summary: A benchmark model is created for video-based human pickers detection.
Our solution uses the combination of a Mask Region-based Convolutional Neural Network (Mask R-CNN) for object detection and optical flow for motion estimation.
A classification criterion is defined based on the Kernel Density Estimation (KDE) analysis and K-means clustering algorithm.
- Score: 0.4915744683251149
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In farming systems, harvesting operations are tedious, time- and
resource-consuming tasks. Based on this, deploying a fleet of autonomous robots
to work alongside farmworkers may provide vast productivity and logistics
benefits. Then, an intelligent robotic system should monitor human behavior,
identify the ongoing activities and anticipate the worker's needs. In this
work, the main contribution consists of creating a benchmark model for
video-based human pickers detection, classifying their activities to serve in
harvesting operations for different agricultural scenarios. Our solution uses
the combination of a Mask Region-based Convolutional Neural Network (Mask
R-CNN) for object detection and optical flow for motion estimation with newly
added statistical attributes of flow motion descriptors, named as Correlation
Sensitivity (CS). A classification criterion is defined based on the Kernel
Density Estimation (KDE) analysis and K-means clustering algorithm, which are
implemented upon in-house collected dataset from different crop fields like
strawberry polytunnels and apple tree orchards. The proposed framework is
quantitatively analyzed using sensitivity, specificity, and accuracy measures
and shows satisfactory results amidst various dataset challenges such as
lighting variation, blur, and occlusions.
Related papers
- Neural Clustering based Visual Representation Learning [61.72646814537163]
Clustering is one of the most classic approaches in machine learning and data analysis.
We propose feature extraction with clustering (FEC), which views feature extraction as a process of selecting representatives from data.
FEC alternates between grouping pixels into individual clusters to abstract representatives and updating the deep features of pixels with current representatives.
arXiv Detail & Related papers (2024-03-26T06:04:50Z) - Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement
Learning [53.00683059396803]
Mask image model (MIM) has been widely used due to its simplicity and effectiveness in recovering original information from masked images.
We propose a decision-based MIM that utilizes reinforcement learning (RL) to automatically search for optimal image masking ratio and masking strategy.
Our approach has a significant advantage over alternative self-supervised methods on the task of neuron segmentation.
arXiv Detail & Related papers (2023-10-06T10:40:46Z) - Machine Learning to detect cyber-attacks and discriminating the types of
power system disturbances [0.0]
This research proposes a machine learning-based attack detection model for power systems, specifically targeting smart grids.
By utilizing data and logs collected from Phasor Measuring Devices (PMUs), the model aims to learn system behaviors and effectively identify potential security boundaries.
arXiv Detail & Related papers (2023-07-06T22:32:06Z) - A Semi-supervised Approach for Activity Recognition from Indoor
Trajectory Data [0.822021749810331]
We consider the task of classifying the activities of moving objects from their noisy indoor trajectory data in a collaborative manufacturing environment.
We present a semi-supervised machine learning approach that first applies an information theoretic criterion to partition a long trajectory into a set of segments.
The segments are then labelled automatically based on a constrained hierarchical clustering method.
arXiv Detail & Related papers (2023-01-09T01:20:50Z) - Scalable Intervention Target Estimation in Linear Models [52.60799340056917]
Current approaches to causal structure learning either work with known intervention targets or use hypothesis testing to discover the unknown intervention targets.
This paper proposes a scalable and efficient algorithm that consistently identifies all intervention targets.
The proposed algorithm can be used to also update a given observational Markov equivalence class into the interventional Markov equivalence class.
arXiv Detail & Related papers (2021-11-15T03:16:56Z) - TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild [77.59069361196404]
TRiPOD is a novel method for predicting body dynamics based on graph attentional networks.
To incorporate a real-world challenge, we learn an indicator representing whether an estimated body joint is visible/invisible at each frame.
Our evaluation shows that TRiPOD outperforms all prior work and state-of-the-art specifically designed for each of the trajectory and pose forecasting tasks.
arXiv Detail & Related papers (2021-04-08T20:01:00Z) - Federated Learning with Heterogeneous Labels and Models for Mobile
Activity Monitoring [0.7106986689736827]
On-device Federated Learning proves to be an effective approach for distributed and collaborative machine learning.
We propose a framework for federated label-based aggregation, which leverages overlapping information gain across activities.
Empirical evaluation with the Heterogeneity Human Activity Recognition (HHAR) dataset on Raspberry Pi 2 indicates an average deterministic accuracy increase of at least 11.01%.
arXiv Detail & Related papers (2020-12-04T11:44:17Z) - AIRSENSE-TO-ACT: A Concept Paper for COVID-19 Countermeasures based on
Artificial Intelligence algorithms and multi-sources Data Processing [0.0]
This paper describes a new tool to support institutions in the implementation of targeted countermeasures, based on quantitative and multi-scale elements, for the fight and prevention of emergencies, such as the current COVID-19 pandemic.
The tool is a centralized system (web application), single multi-user platform, which relies on Artificial Intelligence (AI) algorithms for the processing of heterogeneous data, and which can produce an output level of risk.
The model includes a specific neural network which will be first trained to learn the correlation between selected inputs, related to the case of interest: environmental variables (chemical-physical, such as meteorological), human activity
arXiv Detail & Related papers (2020-11-07T17:50:14Z) - Estimating Structural Target Functions using Machine Learning and
Influence Functions [103.47897241856603]
We propose a new framework for statistical machine learning of target functions arising as identifiable functionals from statistical models.
This framework is problem- and model-agnostic and can be used to estimate a broad variety of target parameters of interest in applied statistics.
We put particular focus on so-called coarsening at random/doubly robust problems with partially unobserved information.
arXiv Detail & Related papers (2020-08-14T16:48:29Z) - A Trainable Optimal Transport Embedding for Feature Aggregation and its
Relationship to Attention [96.77554122595578]
We introduce a parametrized representation of fixed size, which embeds and then aggregates elements from a given input set according to the optimal transport plan between the set and a trainable reference.
Our approach scales to large datasets and allows end-to-end training of the reference, while also providing a simple unsupervised learning mechanism with small computational cost.
arXiv Detail & Related papers (2020-06-22T08:35:58Z) - Clustering with Fast, Automated and Reproducible assessment applied to
longitudinal neural tracking [3.817161834189992]
C-FAR is a novel method for Fast, Automated and Reproducible assessment of hierarchical clustering algorithms simultaneously.
Our algorithm takes any number of hierarchical clustering trees as input, then strategically queries pairs for human feedback, and outputs an optimal clustering among those nominated by these trees.
Our flagship application is the cluster aggregation step in spike-sorting, the task of assigning waveforms (spikes) in recordings to neurons.
arXiv Detail & Related papers (2020-03-19T01:33:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.