Related papers: Attribution Score Alignment in Explainable Data Management

Related papers

Empirical Evaluation of No Free Lunch Violations in Permutation-Based Optimization [0.0]
We study an iterative-search setting with sampling without replacement, where algorithms differ only in evaluation order.<n>Results show how objective reformulation and benchmark design can generate structured local departures from NFL intuition.<n>This message applies to evolutionary computation as well as to statistical procedures based on relabeling, resampling, and permutation tests.
arXiv Detail & Related papers (2026-03-04T00:55:25Z)
Stress-Testing Causal Claims via Cardinality Repairs [11.043119484281531]
How robust is a causal claim to small, targeted modifications in the data?<n>We introduce SubCure, a framework for auditing via cardinality repairs.<n>We develop efficient algorithms that incorporate machine unlearning techniques to update causal estimates without retraining from scratch.
arXiv Detail & Related papers (2025-12-02T07:31:03Z)
Reference-Free Rating of LLM Responses via Latent Information [53.463883683503106]
We study the common practice of asking a judge model to assign Likert-scale scores to free-text responses.<n>We then propose and evaluate Latent Judges, which derive scalar ratings from internal model signals.<n>Across a broad suite of pairwise and single-rating benchmarks, latent methods match or surpass standard prompting.
arXiv Detail & Related papers (2025-09-29T12:15:52Z)
Testing for LLM response differences: the case of a composite null consisting of semantically irrelevant query perturbations [10.216191904121178]
Given two input queries, it is natural to ask if their response distributions are the same.<n>A traditional test of equality might indicate that two semantically equivalent queries induce statistically different response distributions.<n>In this paper, we address this misalignment by incorporating into the testing procedure consideration of a collection of semantically similar queries.
arXiv Detail & Related papers (2025-09-13T19:44:42Z)
Unbiased Learning to Rank with Query-Level Click Propensity Estimation: Beyond Pointwise Observation and Relevance [74.43264459255121]
In real-world scenarios, users often click only one or two results after examining multiple relevant options. We propose a query-level click propensity model to capture the probability that users will click on different result lists. Our method introduces a Dual Inverse Propensity Weighting mechanism to address both relevance saturation and position bias.
arXiv Detail & Related papers (2025-02-17T03:55:51Z)
Attribution-Scores in Data Management and Explainable Machine Learning [0.0]
We describe recent research on the use of actual causality in the definition of responsibility scores in databases. In the case of databases, useful connections with database repairs are illustrated and exploited. For classification models, the responsibility score is properly extended and illustrated.
arXiv Detail & Related papers (2023-07-31T22:41:17Z)
From Database Repairs to Causality in Databases and Beyond [0.0]
We describe some recent approaches to score-based explanations for query answers in databases. Special emphasis is placed on the use of counterfactual reasoning for score specification and computation.
arXiv Detail & Related papers (2023-06-15T04:08:23Z)
Learnable Pillar-based Re-ranking for Image-Text Retrieval [119.9979224297237]
Image-text retrieval aims to bridge the modality gap and retrieve cross-modal content based on semantic similarities. Re-ranking, a popular post-processing practice, has revealed the superiority of capturing neighbor relations in single-modality retrieval tasks. We propose a novel learnable pillar-based re-ranking paradigm for image-text retrieval.
arXiv Detail & Related papers (2023-04-25T04:33:27Z)
Learning List-Level Domain-Invariant Representations for Ranking [59.3544317373004]
We propose list-level alignment -- learning domain-invariant representations at the higher level of lists. The benefits are twofold: it leads to the first domain adaptation generalization bound for ranking, in turn providing theoretical support for the proposed method.
arXiv Detail & Related papers (2022-12-21T04:49:55Z)
Integrating Rankings into Quantized Scores in Peer Review [61.27794774537103]
In peer review, reviewers are usually asked to provide scores for the papers. To mitigate this issue, conferences have started to ask reviewers to additionally provide a ranking of the papers they have reviewed. There are no standard procedure for using this ranking information and Area Chairs may use it in different ways. We take a principled approach to integrate the ranking information into the scores.
arXiv Detail & Related papers (2022-04-05T19:39:13Z)
Knowledge Base Question Answering by Case-based Reasoning over Subgraphs [81.22050011503933]
We show that our model answers queries requiring complex reasoning patterns more effectively than existing KG completion algorithms. The proposed model outperforms or performs competitively with state-of-the-art models on several KBQA benchmarks.
arXiv Detail & Related papers (2022-02-22T01:34:35Z)
Leveraging semantically similar queries for ranking via combining representations [20.79800117378761]
In data-scarce settings, the amount of labeled data available for a particular query can lead to a highly variable and ineffective ranking function. One way to mitigate the effect of the small amount of data is to leverage information from semantically similar queries. We describe and explore this phenomenon in the context of the bias-variance trade off and apply it to the data-scarce settings of a Bing navigational graph and the Drosophila larva connectome.
arXiv Detail & Related papers (2021-06-23T18:36:20Z)
Applying Transfer Learning for Improving Domain-Specific Search Experience Using Query to Question Similarity [0.0]
We discuss a framework for calculating similarities between a given input query and a set of predefined questions to retrieve the question which matches to it the most. We have used it for the financial domain, but the framework is generalized for any domain-specific search engine and can be used in other domains as well.
arXiv Detail & Related papers (2021-01-07T03:27:32Z)
Surprise: Result List Truncation via Extreme Value Theory [92.5817701697342]
We propose a statistical method that produces interpretable and calibrated relevance scores at query time using nothing more than the ranked scores. We demonstrate its effectiveness on the result list truncation task across image, text, and IR datasets.
arXiv Detail & Related papers (2020-10-19T19:15:50Z)
Robust Question Answering Through Sub-part Alignment [53.94003466761305]
We model question answering as an alignment problem. We train our model on SQuAD v1.1 and test it on several adversarial and out-of-domain datasets.
arXiv Detail & Related papers (2020-04-30T09:10:57Z)
Query Focused Multi-Document Summarization with Distant Supervision [88.39032981994535]
Existing work relies heavily on retrieval-style methods for estimating the relevance between queries and text segments. We propose a coarse-to-fine modeling framework which introduces separate modules for estimating whether segments are relevant to the query. We demonstrate that our framework outperforms strong comparison systems on standard QFS benchmarks.
arXiv Detail & Related papers (2020-04-06T22:35:19Z)

This list is automatically generated from the titles and abstracts of the papers in this site.