Fugu-MT 論文翻訳(概要): HSGM: Hierarchical Segment-Graph Memory for Scalable Long-Text Semantics

論文の概要: HSGM: Hierarchical Segment-Graph Memory for Scalable Long-Text Semantics

arxiv url: http://arxiv.org/abs/2509.18168v1
Date: Wed, 17 Sep 2025 10:11:02 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-24 20:41:27.444759
Title: HSGM: Hierarchical Segment-Graph Memory for Scalable Long-Text Semantics
Title（参考訳）: HSGM:スケーラブルな長文セマンティックスのための階層的なセグメントグラフメモリ
Authors: Dong Liu, Yanxuan Yu,
Abstract要約: textbfHierarchical Segment-Graph Memory (HSGM) - 長さ$N$の入力を意味のあるセグメントに分解する新しいフレームワーク。 HSGMはインセンメンタルアップデートをサポートする。ローカルグラフの構築とサマリノードの統合を行うのは,新たに到着したセグメントのみである。提案手法は,超長文に対するスケーラブルで高精度なセマンティックモデリングを開放し,リアルタイムかつリソース制約のあるNLPアプリケーションを実現する。
参考スコア（独自算出の注目度）: 5.216774377033164
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Semantic parsing of long documents remains challenging due to quadratic growth in pairwise composition and memory requirements. We introduce \textbf{Hierarchical Segment-Graph Memory (HSGM)}, a novel framework that decomposes an input of length $N$ into $M$ meaningful segments, constructs \emph{Local Semantic Graphs} on each segment, and extracts compact \emph{summary nodes} to form a \emph{Global Graph Memory}. HSGM supports \emph{incremental updates} -- only newly arrived segments incur local graph construction and summary-node integration -- while \emph{Hierarchical Query Processing} locates relevant segments via top-$K$ retrieval over summary nodes and then performs fine-grained reasoning within their local graphs. Theoretically, HSGM reduces worst-case complexity from $O(N^2)$ to $O\!\left(N\,k + (N/k)^2\right)$, with segment size $k \ll N$, and we derive Frobenius-norm bounds on the approximation error introduced by node summarization and sparsification thresholds. Empirically, on three benchmarks -- long-document AMR parsing, segment-level semantic role labeling (OntoNotes), and legal event extraction -- HSGM achieves \emph{2--4$\times$ inference speedup}, \emph{$>60\%$ reduction} in peak memory, and \emph{$\ge 95\%$} of baseline accuracy. Our approach unlocks scalable, accurate semantic modeling for ultra-long texts, enabling real-time and resource-constrained NLP applications.
Abstract（参考訳）: 長い文書のセマンティック解析は、ペア構成とメモリ要求の二次的な成長のために依然として困難である。長さ$N$の入力を$M$意味セグメントに分解し、各セグメントに \emph{Local Semantic Graphs} を構築し、コンパクトな \emph{summary node} を抽出して \emph{Global Graph Memory} を形成する新しいフレームワークである \textbf{Hierarchical Segment-Graph Memory (HSGM) を紹介する。 HSGM は \emph{incremental updates} -- 新たに到着したセグメントのみがローカルグラフの構成とサマリノードの統合を発生させる -- をサポートし、一方 \emph{Hierarchical Query Processing} はサマリノード上のトップ$K$検索を通じて関連するセグメントを特定し、その後、ローカルグラフ内できめ細かい推論を行う。理論的には、HSGMは最悪のケースの複雑さを$O(N^2)$から$O\! セグメントサイズが $k \ll N$ である \left(N\,k + (N/k)^2\right)$ と、ノード要約とスペーサー化しきい値によって導入された近似誤差にフロベニウス-ノルム境界を導出する。実証的には、長いドキュメントAMR解析、セグメントレベルのセマンティックロールラベリング(OntoNotes)、法的イベント抽出の3つのベンチマークで、HSGMは、ピークメモリにおける \emph{2--4$\times$ inference speedup}、 \emph{$>60\%$ reduction}、ベースライン精度の \emph{$\ge 95\%$}を達成している。提案手法は,超長文に対するスケーラブルで高精度なセマンティックモデリングを開放し,リアルタイムかつリソース制約のあるNLPアプリケーションを実現する。

論文の概要: HSGM: Hierarchical Segment-Graph Memory for Scalable Long-Text Semantics

関連論文リスト