A partition-based similarity for classification distributions
        - URL: http://arxiv.org/abs/2011.06557v1
- Date: Thu, 12 Nov 2020 18:21:11 GMT
- Title: A partition-based similarity for classification distributions
- Authors: Hayden S. Helm, Ronak D. Mehta, Brandon Duderstadt, Weiwei Yang,
  Christoper M. White, Ali Geisa, Joshua T. Vogelstein, Carey E. Priebe
- Abstract summary: We define a measure of similarity between classification distributions that is both principled from the perspective of statistical pattern recognition and useful from the perspective of machine learning practitioners.
We propose a novel similarity on classification distributions, dubbed task similarity, that quantifies how an optimally-transformed optimal representation for a source distribution performs when applied to inference related to a target distribution.
- Score: 11.877906044513272
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   Herein we define a measure of similarity between classification distributions
that is both principled from the perspective of statistical pattern recognition
and useful from the perspective of machine learning practitioners. In
particular, we propose a novel similarity on classification distributions,
dubbed task similarity, that quantifies how an optimally-transformed optimal
representation for a source distribution performs when applied to inference
related to a target distribution. The definition of task similarity allows for
natural definitions of adversarial and orthogonal distributions. We highlight
limiting properties of representations induced by (universally) consistent
decision rules and demonstrate in simulation that an empirical estimate of task
similarity is a function of the decision rule deployed for inference. We
demonstrate that for a given target distribution, both transfer efficiency and
semantic similarity of candidate source distributions correlate with empirical
task similarity.
 
      
        Related papers
        - When Does Closeness in Distribution Imply Representational Similarity?   An Identifiability Perspective [9.871955852117912]
 We prove that a small Kullback-Leibler divergence between the model distributions does not guarantee that the corresponding representations are similar.<n>We then define a distributional distance for which closeness implies representational similarity.<n>In synthetic experiments, we find that wider networks learn distributions which are closer with respect to our distance and have more similar representations.
 arXiv  Detail & Related papers  (2025-06-04T09:44:22Z)
- A Connection Between Learning to Reject and Bhattacharyya Divergences [57.942664964198286]
 We consider learning a joint ideal distribution over both inputs and labels.<n>We develop a link between rejection and thresholding different statistical divergences.<n>In general, we find that rejecting via a Bhattacharyya divergence is less aggressive than Chow's Rule.
 arXiv  Detail & Related papers  (2025-05-08T14:18:42Z)
- Counterfactual Realizability [52.85109506684737]
 We introduce a formal definition of realizability, the ability to draw samples from a distribution, and then develop a complete algorithm to determine whether an arbitrary counterfactual distribution is realizable.
We illustrate the implications of this new framework for counterfactual data collection using motivating examples from causal fairness and causal reinforcement learning.
 arXiv  Detail & Related papers  (2025-03-14T20:54:27Z)
- Enhancing Model Fairness and Accuracy with Similarity Networks: A   Methodological Approach [0.20718016474717196]
 We use different techniques to map instances into a similarity feature space.
Our method's ability to adjust the resolution of pairwise similarity provides clear insights into the relationship between the dataset classification complexity and model fairness.
 arXiv  Detail & Related papers  (2024-11-08T15:43:01Z)
- Cluster-Aware Similarity Diffusion for Instance Retrieval [64.40171728912702]
 Diffusion-based re-ranking is a common method used for retrieving instances by performing similarity propagation in a nearest neighbor graph.
We propose a novel Cluster-Aware Similarity (CAS) diffusion for instance retrieval.
 arXiv  Detail & Related papers  (2024-06-04T14:19:50Z)
- Distributed Markov Chain Monte Carlo Sampling based on the Alternating
  Direction Method of Multipliers [143.6249073384419]
 In this paper, we propose a distributed sampling scheme based on the alternating direction method of multipliers.
We provide both theoretical guarantees of our algorithm's convergence and experimental evidence of its superiority to the state-of-the-art.
In simulation, we deploy our algorithm on linear and logistic regression tasks and illustrate its fast convergence compared to existing gradient-based methods.
 arXiv  Detail & Related papers  (2024-01-29T02:08:40Z)
- Transductive conformal inference with adaptive scores [3.591224588041813]
 We consider the transductive setting, where decisions are made on a test sample of $m$ new points.
We show that their joint distribution follows a P'olya urn model, and establish a concentration inequality for their empirical distribution function.
We demonstrate the usefulness of these theoretical results through uniform, in-probability guarantees for two machine learning tasks.
 arXiv  Detail & Related papers  (2023-10-27T12:48:30Z)
- Counting Like Human: Anthropoid Crowd Counting on Modeling the
  Similarity of Objects [92.80955339180119]
 mainstream crowd counting methods regress density map and integrate it to obtain counting results.
Inspired by this, we propose a rational and anthropoid crowd counting framework.
 arXiv  Detail & Related papers  (2022-12-02T07:00:53Z)
- Evaluation of taxonomic and neural embedding methods for calculating
  semantic similarity [0.0]
 We study the mechanisms between taxonomic and distributional similarity measures.
We find that taxonomic similarity measures can depend on the shortest path length as a prime factor to predict semantic similarity.
The synergy of retrofitting neural embeddings with concept relations in similarity prediction may indicate a new trend to leverage knowledge bases on transfer learning.
 arXiv  Detail & Related papers  (2022-09-30T02:54:21Z)
- Fairness Transferability Subject to Bounded Distribution Shift [5.62716254065607]
 Given an algorithmic predictor that is "fair" on some source distribution, will it still be fair on an unknown target distribution that differs from the source within some bound?
We study the transferability of statistical group fairness for machine learning predictors subject to bounded distribution shifts.
 arXiv  Detail & Related papers  (2022-05-31T22:16:44Z)
- Concurrent Discrimination and Alignment for Self-Supervised Feature
  Learning [52.213140525321165]
 Existing self-supervised learning methods learn by means of pretext tasks which are either (1) discriminating that explicitly specify which features should be separated or (2) aligning that precisely indicate which features should be closed together.
In this work, we combine the positive aspects of the discriminating and aligning methods, and design a hybrid method that addresses the above issue.
Our method explicitly specifies the repulsion and attraction mechanism respectively by discriminative predictive task and concurrently maximizing mutual information between paired views.
Our experiments on nine established benchmarks show that the proposed model consistently outperforms the existing state-of-the-art results of self-supervised and transfer
 arXiv  Detail & Related papers  (2021-08-19T09:07:41Z)
- Personalized Trajectory Prediction via Distribution Discrimination [78.69458579657189]
 Trarimiy prediction is confronted with the dilemma to capture the multi-modal nature of future dynamics.
We present a distribution discrimination (DisDis) method to predict personalized motion patterns.
Our method can be integrated with existing multi-modal predictive models as a plug-and-play module.
 arXiv  Detail & Related papers  (2021-07-29T17:42:12Z)
- Robust Bayesian Classification Using an Optimistic Score Ratio [18.047694351309204]
 We use an optimistic score ratio for robust binary classification when there is limited information on the class-conditional, or contextual, distribution.
The optimistic score searches for the distribution that is most plausible to explain the observed outcomes in the testing sample.
 arXiv  Detail & Related papers  (2020-07-08T22:25:29Z)
- A Distributional Analysis of Sampling-Based Reinforcement Learning
  Algorithms [67.67377846416106]
 We present a distributional approach to theoretical analyses of reinforcement learning algorithms for constant step-sizes.
We show that value-based methods such as TD($lambda$) and $Q$-Learning have update rules which are contractive in the space of distributions of functions.
 arXiv  Detail & Related papers  (2020-03-27T05:13:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.