Interpretable collaborative data analysis on distributed data
- URL: http://arxiv.org/abs/2011.04437v1
- Date: Mon, 9 Nov 2020 13:59:32 GMT
- Title: Interpretable collaborative data analysis on distributed data
- Authors: Akira Imakura, Hiroaki Inaba, Yukihiko Okada, Tetsuya Sakurai
- Abstract summary: This paper proposes an interpretable non-model sharing collaborative data analysis method as one of the federated learning systems.
By centralizing intermediate representations, which are individually constructed in each party, the proposed method obtains an interpretable model.
Numerical experiments indicate that the proposed method achieves better recognition performance for artificial and real-world problems than individual analysis.
- Score: 9.434133337939498
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper proposes an interpretable non-model sharing collaborative data
analysis method as one of the federated learning systems, which is an emerging
technology to analyze distributed data. Analyzing distributed data is essential
in many applications such as medical, financial, and manufacturing data
analyses due to privacy, and confidentiality concerns. In addition,
interpretability of the obtained model has an important role for practical
applications of the federated learning systems. By centralizing intermediate
representations, which are individually constructed in each party, the proposed
method obtains an interpretable model, achieving a collaborative analysis
without revealing the individual data and learning model distributed over local
parties. Numerical experiments indicate that the proposed method achieves
better recognition performance for artificial and real-world problems than
individual analysis.
Related papers
- Data Analysis in the Era of Generative AI [56.44807642944589]
This paper explores the potential of AI-powered tools to reshape data analysis, focusing on design considerations and challenges.
We explore how the emergence of large language and multimodal models offers new opportunities to enhance various stages of data analysis workflow.
We then examine human-centered design principles that facilitate intuitive interactions, build user trust, and streamline the AI-assisted analysis workflow across multiple apps.
arXiv Detail & Related papers (2024-09-27T06:31:03Z) - Information Fusion for Assistance Systems in Production Assessment [49.40442046458756]
We provide a framework for the fusion of n number of information sources using the evidence theory.
We provide a methodology for the information fusion of two primary sources: an ensemble classifier based on machine data and an expert-centered model.
We address the problem of data drift by proposing a methodology to update the data-based models using an evidence theory approach.
arXiv Detail & Related papers (2023-08-31T22:08:01Z) - Tackling Computational Heterogeneity in FL: A Few Theoretical Insights [68.8204255655161]
We introduce and analyse a novel aggregation framework that allows for formalizing and tackling computational heterogeneous data.
Proposed aggregation algorithms are extensively analyzed from a theoretical, and an experimental prospective.
arXiv Detail & Related papers (2023-07-12T16:28:21Z) - Distributed sequential federated learning [0.0]
We develop a data-driven method for efficiently and effectively aggregating valued information by analyzing local data.
We use numerical studies of simulated data and an application to COVID-19 data collected from 32 hospitals in Mexico.
arXiv Detail & Related papers (2023-01-31T21:20:45Z) - Non-readily identifiable data collaboration analysis for multiple
datasets including personal information [7.315551060433141]
Data confidentiality and cross-institutional communication are critical for medical datasets.
In this study, the identifiability of the data collaboration analysis is investigated.
The proposed method exhibits a non-readily identifiability while maintaining a high recognition performance.
arXiv Detail & Related papers (2022-08-31T03:19:17Z) - Another Use of SMOTE for Interpretable Data Collaboration Analysis [8.143750358586072]
Data collaboration (DC) analysis has been developed for privacy-preserving integrated analysis across multiple institutions.
This study proposes an anchor data construction technique to improve the recognition performance without increasing the risk of data leakage.
arXiv Detail & Related papers (2022-08-26T06:39:13Z) - DRFLM: Distributionally Robust Federated Learning with Inter-client
Noise via Local Mixup [58.894901088797376]
federated learning has emerged as a promising approach for training a global model using data from multiple organizations without leaking their raw data.
We propose a general framework to solve the above two challenges simultaneously.
We provide comprehensive theoretical analysis including robustness analysis, convergence analysis, and generalization ability.
arXiv Detail & Related papers (2022-04-16T08:08:29Z) - A Field Guide to Federated Optimization [161.3779046812383]
Federated learning and analytics are a distributed approach for collaboratively learning models (or statistics) from decentralized data.
This paper provides recommendations and guidelines on formulating, designing, evaluating and analyzing federated optimization algorithms.
arXiv Detail & Related papers (2021-07-14T18:09:08Z) - A Tree-based Federated Learning Approach for Personalized Treatment
Effect Estimation from Heterogeneous Data Sources [5.049057348282933]
Federated learning is an appealing framework for analyzing sensitive data from distributed health data networks.
We develop an efficient and interpretable tree-based ensemble of personalized treatment effect estimators to join results across hospital sites.
arXiv Detail & Related papers (2021-03-10T18:51:30Z) - FedH2L: Federated Learning with Model and Statistical Heterogeneity [75.61234545520611]
Federated learning (FL) enables distributed participants to collectively learn a strong global model without sacrificing their individual data privacy.
We introduce FedH2L, which is agnostic to both the model architecture and robust to different data distributions across participants.
In contrast to approaches sharing parameters or gradients, FedH2L relies on mutual distillation, exchanging only posteriors on a shared seed set between participants in a decentralized manner.
arXiv Detail & Related papers (2021-01-27T10:10:18Z) - Accuracy and Privacy Evaluations of Collaborative Data Analysis [4.987315310656657]
A collaborative data analysis through sharing dimensionality reduced representations of data has been proposed as a non-model sharing-type federated learning.
This paper analyzes the accuracy and privacy evaluations of this novel framework.
arXiv Detail & Related papers (2021-01-27T00:38:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.