The Role of Cross-Silo Federated Learning in Facilitating Data Sharing
in the Agri-Food Sector
- URL: http://arxiv.org/abs/2104.07468v2
- Date: Thu, 4 May 2023 14:41:08 GMT
- Title: The Role of Cross-Silo Federated Learning in Facilitating Data Sharing
in the Agri-Food Sector
- Authors: Aiden Durrant, Milan Markovic, David Matthews, David May, Jessica
Enright and Georgios Leontidis
- Abstract summary: Data sharing remains a major hindering factor when it comes to adopting emerging AI technologies in the agri-food sector.
We propose a technical solution based on federated learning that uses decentralized data.
Our results demonstrate that our approach performs better than each of the models trained on an individual data source.
- Score: 5.219568203653523
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Data sharing remains a major hindering factor when it comes to adopting
emerging AI technologies in general, but particularly in the agri-food sector.
Protectiveness of data is natural in this setting; data is a precious commodity
for data owners, which if used properly can provide them with useful insights
on operations and processes leading to a competitive advantage. Unfortunately,
novel AI technologies often require large amounts of training data in order to
perform well, something that in many scenarios is unrealistic. However, recent
machine learning advances, e.g. federated learning and privacy-preserving
technologies, can offer a solution to this issue via providing the
infrastructure and underpinning technologies needed to use data from various
sources to train models without ever sharing the raw data themselves. In this
paper, we propose a technical solution based on federated learning that uses
decentralized data, (i.e. data that are not exchanged or shared but remain with
the owners) to develop a cross-silo machine learning model that facilitates
data sharing across supply chains. We focus our data sharing proposition on
improving production optimization through soybean yield prediction, and provide
potential use-cases that such methods can assist in other problem settings. Our
results demonstrate that our approach not only performs better than each of the
models trained on an individual data source, but also that data sharing in the
agri-food sector can be enabled via alternatives to data exchange, whilst also
helping to adopt emerging machine learning technologies to boost productivity.
Related papers
- Generative AI like ChatGPT in Blockchain Federated Learning: use cases, opportunities and future [4.497001527881303]
This research explores potential integrations of generative AI in federated learning.
generative adversarial networks (GANs) and variational autoencoders (VAEs)
Generating synthetic data helps federated learning address challenges related to limited data availability.
arXiv Detail & Related papers (2024-07-25T19:43:49Z) - The Frontier of Data Erasure: Machine Unlearning for Large Language Models [56.26002631481726]
Large Language Models (LLMs) are foundational to AI advancements.
LLMs pose risks by potentially memorizing and disseminating sensitive, biased, or copyrighted information.
Machine unlearning emerges as a cutting-edge solution to mitigate these concerns.
arXiv Detail & Related papers (2024-03-23T09:26:15Z) - FRAMU: Attention-based Machine Unlearning using Federated Reinforcement
Learning [16.86560475992975]
We introduce Attention-based Machine Unlearning using Federated Reinforcement Learning (FRAMU)
FRAMU incorporates adaptive learning mechanisms, privacy preservation techniques, and optimization strategies.
Our experiments, conducted on both single-modality and multi-modality datasets, revealed that FRAMU significantly outperformed baseline models.
arXiv Detail & Related papers (2023-09-19T03:13:17Z) - The Dimensions of Data Labor: A Road Map for Researchers, Activists, and
Policymakers to Empower Data Producers [14.392208044851976]
Data producers have little say in what data is captured, how it is used, or who it benefits.
Organizations with the ability to access and process this data, e.g. OpenAI and Google, possess immense power in shaping the technology landscape.
By synthesizing related literature that reconceptualizes the production of data for computing as data labor'', we outline opportunities for researchers, policymakers, and activists to empower data producers.
arXiv Detail & Related papers (2023-05-22T17:11:22Z) - Towards Generalizable Data Protection With Transferable Unlearnable
Examples [50.628011208660645]
We present a novel, generalizable data protection method by generating transferable unlearnable examples.
To the best of our knowledge, this is the first solution that examines data privacy from the perspective of data distribution.
arXiv Detail & Related papers (2023-05-18T04:17:01Z) - Decentralized Learning with Multi-Headed Distillation [12.90857834791378]
Decentralized learning with private data is a central problem in machine learning.
We propose a novel distillation-based decentralized learning technique that allows multiple agents with private non-iid data to learn from each other.
arXiv Detail & Related papers (2022-11-28T21:01:43Z) - Privacy-Preserving Machine Learning for Collaborative Data Sharing via
Auto-encoder Latent Space Embeddings [57.45332961252628]
Privacy-preserving machine learning in data-sharing processes is an ever-critical task.
This paper presents an innovative framework that uses Representation Learning via autoencoders to generate privacy-preserving embedded data.
arXiv Detail & Related papers (2022-11-10T17:36:58Z) - Asynchronous Collaborative Learning Across Data Silos [9.094748832034746]
We propose a framework to enable asynchronous collaborative training of machine learning models across data silos.
This allows data science teams to collaboratively train a machine learning model, without sharing data with one another.
arXiv Detail & Related papers (2022-03-23T18:00:19Z) - Deep Reinforcement Learning Assisted Federated Learning Algorithm for
Data Management of IIoT [82.33080550378068]
The continuous expanded scale of the industrial Internet of Things (IIoT) leads to IIoT equipments generating massive amounts of user data every moment.
How to manage these time series data in an efficient and safe way in the field of IIoT is still an open issue.
This paper studies the FL technology applications to manage IIoT equipment data in wireless network environments.
arXiv Detail & Related papers (2022-02-03T07:12:36Z) - From Distributed Machine Learning to Federated Learning: A Survey [49.7569746460225]
Federated learning emerges as an efficient approach to exploit distributed data and computing resources.
We propose a functional architecture of federated learning systems and a taxonomy of related techniques.
We present the distributed training, data communication, and security of FL systems.
arXiv Detail & Related papers (2021-04-29T14:15:11Z) - DeGAN : Data-Enriching GAN for Retrieving Representative Samples from a
Trained Classifier [58.979104709647295]
We bridge the gap between the abundance of available data and lack of relevant data, for the future learning tasks of a trained network.
We use the available data, that may be an imbalanced subset of the original training dataset, or a related domain dataset, to retrieve representative samples.
We demonstrate that data from a related domain can be leveraged to achieve state-of-the-art performance.
arXiv Detail & Related papers (2019-12-27T02:05:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.