Fugu-MT 論文翻訳(概要): TrojanGYM: A Detector-in-the-Loop LLM for Adaptive RTL Hardware Trojan Insertion

論文の概要: TrojanGYM: A Detector-in-the-Loop LLM for Adaptive RTL Hardware Trojan Insertion

arxiv url: http://arxiv.org/abs/2601.17178v1
Date: Fri, 23 Jan 2026 21:11:44 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-27 15:23:07.321605
Title: TrojanGYM: A Detector-in-the-Loop LLM for Adaptive RTL Hardware Trojan Insertion
Title（参考訳）: TrojanGYM: 適応RTLハードウェアトロイの木馬挿入用LLM検出器
Authors: Saideep Sreekumar, Zeng Wang, Akashdeep Saha, Weihua Xiao, Minghao Shao, Muhammad Shafique, Ozgur Sinanoglu, Ramesh Karri, Johann Knechtel,
Abstract要約: ハードウェアトロイの木馬(HT)は、学習検出器がパターンやスタイリングされた小さなベンチマークをトリガー/ロードするのに過度に適合するため、依然として重要な脅威である。本稿では,HT挿入を自動的にキュレートして検出盲点を露呈するフレームワークであるTrojanGYMを紹介する。また,グラフ抽出,トレーニング,予測信頼性を改善したGNN4TJの新たな実装であるRobust-GNN4TJを提案する。
参考スコア（独自算出の注目度）: 14.250356355764389
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Hardware Trojans (HTs) remain a critical threat because learning-based detectors often overfit to narrow trigger/payload patterns and small, stylized benchmarks. We introduce TrojanGYM, an agentic, LLM-driven framework that automatically curates HT insertions to expose detector blind spots while preserving design correctness. Given high-level HT specifications, a suite of cooperating LLM agents (instantiated with GPT-4, LLaMA-3.3-70B, and Gemini-2.5Pro) proposes and refines RTL modifications that realize diverse triggers and payloads without impacting normal functionality. TrojanGYM implements a feedback-driven benchmark generation loop co-designed with HT detectors, in which constraint-aware syntactic checking and GNN-based HT detectors provide feedback that iteratively refines HT specifications and insertion strategies to better surface detector blind spots. We further propose Robust-GNN4TJ, a new implementation of the GNN4TJ with improved graph extraction, training robustness, and prediction reliability, especially on LLM-generated HT designs. On the most challenging TrojanGYM-generated benchmarks, Robust-GNN4TJ raises HT detection rates from 0% to 60% relative to a prior GNN-based detector. We instantiate TrojanGYM on SRAM, AES-128, and UART designs at RTL level, and show that it systematically produces diverse, functionally correct HTs that reach up to 83.33% evasion rates against modern GNN-based detectors, revealing robustness gaps that are not apparent when these detectors are evaluated solely on existing TrustHub-style benchmarks. Post peer-review, we will release all codes and artifacts.
Abstract（参考訳）: ハードウェアトロイの木馬(HT)は、学習ベースの検出器が狭いトリガー/ペイロードパターンと小さなスタイリングされたベンチマークに過度に適合するため、依然として重大な脅威である。我々は,HT挿入を自動的にキュレートして,設計の正しさを保ちながら検出盲点を露呈する,エージェント型LLM駆動のフレームワークであるTrojanGYMを紹介する。高レベルのHT仕様を与えられたLLMエージェント群(GPT-4、LLaMA-3.3-70B、Gemini-2.5Pro)は、通常の機能に影響を与えることなく多様なトリガやペイロードを実現するRTL修正を提案し、改良する。 TrojanGYMは、HT検出器と共同設計されたフィードバック駆動ベンチマーク生成ループを実装しており、制約を意識した構文チェックとGNNベースのHT検出器は、HT仕様を反復的に洗練し、インサート戦略によりディザスタブラインドスポットをよりよく表面化するフィードバックを提供する。さらに, GNN4TJ の新たな実装である Robust-GNN4TJ を提案する。最も困難なTrojanGYM生成ベンチマークでは、Robost-GNN4TJは、以前のGNNベースの検出器と比較して、HT検出率を0%から60%に引き上げている。我々は、SRAM、AES-128、UARTの設計上のTrojanGYMをRTLレベルでインスタンス化し、現在のGNNベースの検出器に対して最大83.33%の回避率に達する多様な機能的なHTを体系的に生成し、これらの検出器が既存のTrustHubスタイルのベンチマークでのみ評価された場合に、堅牢性ギャップが明らかになることを示した。ピアレビューの後、すべてのコードとアーティファクトをリリースします。

論文の概要: TrojanGYM: A Detector-in-the-Loop LLM for Adaptive RTL Hardware Trojan Insertion

関連論文リスト