Fugu-MT 論文翻訳(概要): BanglaLorica: Design and Evaluation of a Robust Watermarking Algorithm for Large Language Models in Bangla Text Generation

論文の概要: BanglaLorica: Design and Evaluation of a Robust Watermarking Algorithm for Large Language Models in Bangla Text Generation

arxiv url: http://arxiv.org/abs/2601.04534v1
Date: Thu, 08 Jan 2026 03:01:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-09 17:01:53.005321
Title: BanglaLorica: Design and Evaluation of a Robust Watermarking Algorithm for Large Language Models in Bangla Text Generation
Title（参考訳）: BanglaLorica:Banglaテキスト生成における大規模言語モデルのためのロバストな透かしアルゴリズムの設計と評価
Authors: Amit Bin Tariqul, A N M Zahid Hossain Milkan, Sahab-Al-Chowdhury, Syed Rifat Raiyan, Hasan Mahmud, Md Kamrul Hasan,
Abstract要約: 本研究は,最新のテキスト透かし手法の体系的評価を初めて行ったものである。組込み時間とポストジェネレーション透かしを組み合わせた層状透かし方式を提案する。本研究は,多言語透かしにおけるロバスト性品質のトレードオフを定量化し,実践的,トレーニング不要なソリューションとして層状透かしを確立した。
参考スコア（独自算出の注目度）: 2.464003792743989
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: As large language models (LLMs) are increasingly deployed for text generation, watermarking has become essential for authorship attribution, intellectual property protection, and misuse detection. While existing watermarking methods perform well in high-resource languages, their robustness in low-resource languages remains underexplored. This work presents the first systematic evaluation of state-of-the-art text watermarking methods: KGW, Exponential Sampling (EXP), and Waterfall, for Bangla LLM text generation under cross-lingual round-trip translation (RTT) attacks. Under benign conditions, KGW and EXP achieve high detection accuracy (>88%) with negligible perplexity and ROUGE degradation. However, RTT causes detection accuracy to collapse below RTT causes detection accuracy to collapse to 9-13%, indicating a fundamental failure of token-level watermarking. To address this, we propose a layered watermarking strategy that combines embedding-time and post-generation watermarks. Experimental results show that layered watermarking improves post-RTT detection accuracy by 25-35%, achieving 40-50% accuracy, representing a 3$\times$ to 4$\times$ relative improvement over single-layer methods, at the cost of controlled semantic degradation. Our findings quantify the robustness-quality trade-off in multilingual watermarking and establish layered watermarking as a practical, training-free solution for low-resource languages such as Bangla. Our code and data will be made public.
Abstract（参考訳）: 大規模言語モデル (LLM) がテキスト生成のためにますますデプロイされるにつれて、著者の帰属、知的財産保護、誤用検出にウォーターマーキングが不可欠になっている。既存のウォーターマーキング手法は、高リソース言語ではよく機能するが、低リソース言語のロバスト性はいまだに未調査である。 KGW, Exponential Sampling (EXP), Waterfall, for Bangla LLM text generation under cross-lingual round-trip translation (RTT) attack。良性条件下では、KGWとEXPは、無視できるパープレキシティとROUGE劣化を伴う高い検出精度(>88%)を達成する。しかし、RTTは検出精度をRTT以下に低下させ、検出精度を9-13%に低下させ、トークンレベルの透かしの根本的な失敗を示す。そこで本研究では,組込み時間とポストジェネレーション透かしを組み合わせた階層型透かし手法を提案する。実験結果から,層状透かしによりRTT後検出精度が25～35%向上し,40～50%の精度が得られた。本研究は,多言語透かしにおけるロバスト性品質のトレードオフを定量化し,Banglaなどの低リソース言語に対する実践的,トレーニング不要なソリューションとして階層化透かしを確立した。私たちのコードとデータは公開されます。

論文の概要: BanglaLorica: Design and Evaluation of a Robust Watermarking Algorithm for Large Language Models in Bangla Text Generation

関連論文リスト