Fugu-MT 論文翻訳(概要): ToxicTextCLIP: Text-Based Poisoning and Backdoor Attacks on CLIP Pre-training

論文の概要: ToxicTextCLIP: Text-Based Poisoning and Backdoor Attacks on CLIP Pre-training

arxiv url: http://arxiv.org/abs/2511.00446v1
Date: Sat, 01 Nov 2025 08:25:49 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-05 16:37:26.780546
Title: ToxicTextCLIP: Text-Based Poisoning and Backdoor Attacks on CLIP Pre-training
Title（参考訳）: ToxicTextCLIP:CLIP事前トレーニングにおけるテキストベースのポジショニングとバックドアアタック
Authors: Xin Yao, Haiyang Zhao, Yimin Chen, Jiawei Guo, Kecheng Huang, Ming Zhao,
Abstract要約: ToxicTextCLIPは,事前学習期間中に高品質なテキストを生成するためのフレームワークである。このフレームワークは、2つの主要な課題に対処する。背景の不整合による意味的不整合と、背景一貫性テキストの不足である。最大95.83%の毒殺、98.68%のバックドアHit@1、RoCLIP、CleanCLIP、SafeCLIPをバイパスする。
参考スコア（独自算出の注目度）: 12.65848279918585
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The Contrastive Language-Image Pretraining (CLIP) model has significantly advanced vision-language modeling by aligning image-text pairs from large-scale web data through self-supervised contrastive learning. Yet, its reliance on uncurated Internet-sourced data exposes it to data poisoning and backdoor risks. While existing studies primarily investigate image-based attacks, the text modality, which is equally central to CLIP's training, remains underexplored. In this work, we introduce ToxicTextCLIP, a framework for generating high-quality adversarial texts that target CLIP during the pre-training phase. The framework addresses two key challenges: semantic misalignment caused by background inconsistency with the target class, and the scarcity of background-consistent texts. To this end, ToxicTextCLIP iteratively applies: 1) a background-aware selector that prioritizes texts with background content aligned to the target class, and 2) a background-driven augmenter that generates semantically coherent and diverse poisoned samples. Extensive experiments on classification and retrieval tasks show that ToxicTextCLIP achieves up to 95.83% poisoning success and 98.68% backdoor Hit@1, while bypassing RoCLIP, CleanCLIP and SafeCLIP defenses. The source code can be accessed via https://github.com/xinyaocse/ToxicTextCLIP/.
Abstract（参考訳）: Contrastive Language-Image Pretraining (CLIP) モデルは、自己教師付きコントラスト学習を通じて、大規模なWebデータから画像テキストペアをアライメントすることで、視覚言語モデリングを著しく進歩させる。しかし、未処理のインターネットソースデータへの依存は、データ中毒やバックドアのリスクに晒される。既存の研究では画像ベースの攻撃を主に研究しているが、CLIPの訓練に等しく中心的なテキストモダリティは未調査のままである。本稿では,CLIPを対象とした高品質なテキストを生成するフレームワークであるToxicTextCLIPを紹介する。このフレームワークは、2つの主要な課題に対処する。背景の不整合による意味的不整合と、背景一貫性テキストの不足である。この目的のために、ToxicTextCLIPは次のように繰り返し適用される。 1【対象クラスに整列した背景コンテンツによるテキストを優先する背景対応セレクタ】 2) セマンティック・コヒーレントで多彩な有毒な試料を生成する背景駆動型増強剤。分類と検索タスクに関する大規模な実験によると、ToxicTextCLIPは最大95.83%の毒殺成功と98.68%のバックドアHit@1を達成し、RoCLIP、CleanCLIP、SafeCLIPの防衛をバイパスしている。ソースコードはhttps://github.com/xinyaocse/ToxicTextCLIP/を通じてアクセスすることができる。

論文の概要: ToxicTextCLIP: Text-Based Poisoning and Backdoor Attacks on CLIP Pre-training

関連論文リスト