Label-Free Model Evaluation with Semi-Structured Dataset Representations
- URL: http://arxiv.org/abs/2112.00694v1
- Date: Wed, 1 Dec 2021 18:15:58 GMT
- Title: Label-Free Model Evaluation with Semi-Structured Dataset Representations
- Authors: Xiaoxiao Sun, Yunzhong Hou, Hongdong Li, Liang Zheng
- Abstract summary: Label-free model evaluation, or AutoEval, estimates model accuracy on unlabeled test sets.
In the absence of image labels, based on dataset representations, we estimate model performance for AutoEval with regression.
We propose a new semi-structured dataset representation that is manageable for regression learning while containing rich information for AutoEval.
- Score: 78.54590197704088
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Label-free model evaluation, or AutoEval, estimates model accuracy on
unlabeled test sets, and is critical for understanding model behaviors in
various unseen environments. In the absence of image labels, based on dataset
representations, we estimate model performance for AutoEval with regression. On
the one hand, image feature is a straightforward choice for such
representations, but it hampers regression learning due to being unstructured
(\ie no specific meanings for component at certain location) and of
large-scale. On the other hand, previous methods adopt simple structured
representations (like average confidence or average feature), but insufficient
to capture the data characteristics given their limited dimensions. In this
work, we take the best of both worlds and propose a new semi-structured dataset
representation that is manageable for regression learning while containing rich
information for AutoEval. Based on image features, we integrate distribution
shapes, clusters, and representative samples for a semi-structured dataset
representation. Besides the structured overall description with distribution
shapes, the unstructured description with clusters and representative samples
include additional fine-grained information facilitating the AutoEval task. On
three existing datasets and 25 newly introduced ones, we experimentally show
that the proposed representation achieves competitive results. Code and dataset
are available at
https://github.com/sxzrt/Semi-Structured-Dataset-Representations.
Related papers
- Diffusion Models as Data Mining Tools [87.77999285241219]
This paper demonstrates how to use generative models trained for image synthesis as tools for visual data mining.
We show that after finetuning conditional diffusion models to synthesize images from a specific dataset, we can use these models to define a typicality measure.
This measure assesses how typical visual elements are for different data labels, such as geographic location, time stamps, semantic labels, or even the presence of a disease.
arXiv Detail & Related papers (2024-07-20T17:14:31Z) - TRIAGE: Characterizing and auditing training data for improved
regression [80.11415390605215]
We introduce TRIAGE, a novel data characterization framework tailored to regression tasks and compatible with a broad class of regressors.
TRIAGE utilizes conformal predictive distributions to provide a model-agnostic scoring method, the TRIAGE score.
We show that TRIAGE's characterization is consistent and highlight its utility to improve performance via data sculpting/filtering, in multiple regression settings.
arXiv Detail & Related papers (2023-10-29T10:31:59Z) - Zero-shot Composed Text-Image Retrieval [72.43790281036584]
We consider the problem of composed image retrieval (CIR)
It aims to train a model that can fuse multi-modal information, e.g., text and images, to accurately retrieve images that match the query, extending the user's expression ability.
arXiv Detail & Related papers (2023-06-12T17:56:01Z) - infoVerse: A Universal Framework for Dataset Characterization with
Multidimensional Meta-information [68.76707843019886]
infoVerse is a universal framework for dataset characterization.
infoVerse captures multidimensional characteristics of datasets by incorporating various model-driven meta-information.
In three real-world applications (data pruning, active learning, and data annotation), the samples chosen on infoVerse space consistently outperform strong baselines.
arXiv Detail & Related papers (2023-05-30T18:12:48Z) - A Bag-of-Prototypes Representation for Dataset-Level Applications [24.629132557336312]
This work investigates dataset vectorization for two dataset-level tasks: assessing training set suitability and test set difficulty.
We propose a bag-of-prototypes (BoP) dataset representation that extends the image-level bag consisting of patch descriptors to dataset-level bag consisting of semantic prototypes.
BoP consistently shows its advantage over existing representations on a series of benchmarks for two dataset-level tasks.
arXiv Detail & Related papers (2023-03-23T13:33:58Z) - MSeg: A Composite Dataset for Multi-domain Semantic Segmentation [100.17755160696939]
We present MSeg, a composite dataset that unifies semantic segmentation datasets from different domains.
We reconcile the generalization and bring the pixel-level annotations into alignment by relabeling more than 220,000 object masks in more than 80,000 images.
A model trained on MSeg ranks first on the WildDash-v1 leaderboard for robust semantic segmentation, with no exposure to WildDash data during training.
arXiv Detail & Related papers (2021-12-27T16:16:35Z) - Understanding Dynamics of Nonlinear Representation Learning and Its
Application [12.697842097171119]
We study the dynamics of implicit nonlinear representation learning.
We show that the data-architecture alignment condition is sufficient for the global convergence.
We derive a new training framework, which satisfies the data-architecture alignment condition without assuming it.
arXiv Detail & Related papers (2021-06-28T16:31:30Z) - Are Labels Always Necessary for Classifier Accuracy Evaluation? [28.110519483540482]
We aim to estimate the classification accuracy on unlabeled test datasets.
We construct a meta-dataset comprised of datasets generated from the original images.
As the classification accuracy of the model on each sample (dataset) is known from the original dataset labels, our task can be solved via regression.
arXiv Detail & Related papers (2020-07-06T17:45:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.