Fugu-MT 論文翻訳(概要): AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval

論文の概要: AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval

arxiv url: http://arxiv.org/abs/2311.14084v3
Date: Mon, 15 Jan 2024 02:31:04 GMT
ステータス: 翻訳完了
システム内更新日: 2024-01-18 00:36:30.146519
Title: AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval
Title（参考訳）: テキスト画像検索に可視的関連バイアスをもたらすai生成画像
Authors: Shicheng Xu, Danyang Hou, Liang Pang, Jingcheng Deng, Jun Xu, Huawei Shen, Xueqi Cheng
Abstract要約: 我々は,AI生成画像がテキスト画像検索モデルに目に見えない関連性バイアスをもたらすことを示す。検索モデルのトレーニングデータにAI生成画像を含めると、目に見えない関連性バイアスが増す。本研究では,目に見えない関連バイアスを軽減するための効果的なトレーニング手法を提案する。
参考スコア（独自算出の注目度）: 70.54890125718387
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: With the advancement of generation models, AI-generated content (AIGC) is becoming more realistic, flooding the Internet. A recent study suggests that this phenomenon causes source bias in text retrieval for web search. Specifically, neural retrieval models tend to rank generated texts higher than human-written texts. In this paper, we extend the study of this bias to cross-modal retrieval. Firstly, we successfully construct a suitable benchmark to explore the existence of the bias. Subsequent extensive experiments on this benchmark reveal that AI-generated images introduce an invisible relevance bias to text-image retrieval models. Specifically, our experiments show that text-image retrieval models tend to rank the AI-generated images higher than the real images, even though the AI-generated images do not exhibit more visually relevant features to the query than real images. This invisible relevance bias is prevalent across retrieval models with varying training data and architectures. Furthermore, our subsequent exploration reveals that the inclusion of AI-generated images in the training data of the retrieval models exacerbates the invisible relevance bias. The above phenomenon triggers a vicious cycle, which makes the invisible relevance bias become more and more serious. To elucidate the potential causes of invisible relevance and address the aforementioned issues, we introduce an effective training method aimed at alleviating the invisible relevance bias. Subsequently, we apply our proposed debiasing method to retroactively identify the causes of invisible relevance, revealing that the AI-generated images induce the image encoder to embed additional information into their representation. This information exhibits a certain consistency across generated images with different semantics and can make the retriever estimate a higher relevance score.
Abstract（参考訳）: 世代モデルの発展に伴い、AIGC(AI- generated content)がより現実的になり、インターネットが溢れている。最近の研究では、この現象がWeb検索のためのテキスト検索の源泉バイアスを引き起こすことが示唆されている。具体的には、ニューラル検索モデルは、人間が書いたテキストよりも高いテキストをランク付けする傾向にある。本稿では,このバイアスの研究をクロスモーダル検索に拡張する。まず,バイアスの存在を調べるための適切なベンチマークの構築に成功しました。このベンチマークのさらなる実験により、AI生成画像はテキスト画像検索モデルに目に見えない関連性バイアスをもたらすことが明らかになった。具体的には,テキスト画像検索モデルが,実際の画像よりも視覚的に関連した特徴を提示していないにもかかわらず,実際の画像よりもai生成画像を上位にランク付けする傾向があることを示す。この目に見えない関連性バイアスは、トレーニングデータやアーキテクチャの異なる検索モデルに共通している。さらに, 検索モデルの学習データにai生成画像が組み込まれることにより, 可視性バイアスが悪化することが明らかとなった。上記の現象は悪循環を引き起こし、目に見えない関連性バイアスがますます深刻になる。見えない関連性の潜在的原因を解明し、上記の問題に対処するために、目に見えない関連性バイアスを緩和するための効果的なトレーニング手法を提案する。次に,提案手法を適用して,視覚的関連性の原因を遡及的に同定し,AI生成画像が画像エンコーダを誘導し,その表現に付加情報を埋め込むことを示した。この情報は、異なる意味を持つ生成された画像間で一定の一貫性を示し、レトリバーが高い関連性スコアを推定することができる。

論文の概要: AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval

関連論文リスト