Fugu-MT 論文翻訳(概要): CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification

論文の概要: CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification

arxiv url: http://arxiv.org/abs/2502.18176v1
Date: Tue, 25 Feb 2025 13:09:34 GMT
ステータス: 翻訳完了
システム内更新日: 2025-02-26 17:42:46.065006
Title: CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification
Title（参考訳）: CLIPure: 逆ロバストゼロショット分類のためのCLIPによる潜時空間の浄化
Authors: Mingkun Zhang, Keping Bi, Wei Chen, Jiafeng Guo, Xueqi Cheng,
Abstract要約: 画像とテキストプロンプトをマッチングすることでゼロショット分類を行うことができる、視覚言語で事前訓練されたエンコーダモデルであるCLIPについて検討する。次に, 共分散精製プロセス間のKL分散として精製リスクを定式化する。画像の潜伏ベクトルの確率をモデル化するCLI-Diffと、画像の埋め込みとaの写真とのコサイン類似度をモデル化するCLI-Cosの2つのバリエーションを提案する。
参考スコア（独自算出の注目度）: 65.46685389276443
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In this paper, we aim to build an adversarially robust zero-shot image classifier. We ground our work on CLIP, a vision-language pre-trained encoder model that can perform zero-shot classification by matching an image with text prompts ``a photo of a <class-name>.''. Purification is the path we choose since it does not require adversarial training on specific attack types and thus can cope with any foreseen attacks. We then formulate purification risk as the KL divergence between the joint distributions of the purification process of denoising the adversarial samples and the attack process of adding perturbations to benign samples, through bidirectional Stochastic Differential Equations (SDEs). The final derived results inspire us to explore purification in the multi-modal latent space of CLIP. We propose two variants for our CLIPure approach: CLIPure-Diff which models the likelihood of images' latent vectors with the DiffusionPrior module in DaLLE-2 (modeling the generation process of CLIP's latent vectors), and CLIPure-Cos which models the likelihood with the cosine similarity between the embeddings of an image and ``a photo of a.''. As far as we know, CLIPure is the first purification method in multi-modal latent space and CLIPure-Cos is the first purification method that is not based on generative models, which substantially improves defense efficiency. We conducted extensive experiments on CIFAR-10, ImageNet, and 13 datasets that previous CLIP-based defense methods used for evaluating zero-shot classification robustness. Results show that CLIPure boosts the SOTA robustness by a large margin, e.g., from 71.7% to 91.1% on CIFAR10, from 59.6% to 72.6% on ImageNet, and 108% relative improvements of average robustness on the 13 datasets over previous SOTA. The code is available at https://github.com/TMLResearchGroup-CAS/CLIPure.
Abstract（参考訳）: 本稿では,逆向きに頑健なゼロショット画像分類器を構築することを目的とする。画像に<class-name>の写真を表示することで、ゼロショット分類を行うことができる視覚言語事前学習エンコーダモデルであるCLIPについて検討する。と。特定の攻撃タイプに対する敵の訓練を必要とせず、従っていかなる前向きな攻撃にも対処できるため、浄化は私たちが選択する道です。次に, 両方向確率微分方程式(SDE)を用いて, 対向検体を識別する浄化過程のKL分布と, 良性検体に摂動を加える攻撃過程との相違点として, 浄化リスクを定式化する。最終結果はCLIPの多モード潜伏空間における清浄の探求を刺激した。 CLIPure-DiffはDiffusionPriorモジュールをDaLLE-2(CLIPの潜伏ベクトルの生成過程をモデル化する)で画像の潜伏ベクトルの確率をモデル化し、CLIPure-Cosは画像の埋め込みと「aの写真」のコサイン類似度で確率をモデル化する。と。私たちの知る限り、CLIPureはマルチモーダル潜在空間における最初の浄化法であり、CLIPure-Cosは生成モデルに基づいていない最初の浄化法であり、防衛効率を大幅に向上させる。 CIFAR-10, ImageNet, および13のデータセットに対して, ゼロショット分類ロバスト性を評価するために, 従来のCLIPベースの防御手法を用いた広範な実験を行った。 CIFAR10では71.7%から91.1%、ImageNetでは59.6%から72.6%、以前のSOTAよりも平均ロバストネスが108%向上した。コードはhttps://github.com/TMLResearchGroup-CAS/CLIPureで公開されている。

論文の概要: CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification

関連論文リスト