Fugu-MT 論文翻訳(概要): FeLoG: Scalable and Efficient Distributed Graph Embedding with Feedback Loop Mechanism

論文の概要: FeLoG: Scalable and Efficient Distributed Graph Embedding with Feedback Loop Mechanism

arxiv url: http://arxiv.org/abs/2606.22180v1
Date: Sat, 20 Jun 2026 18:21:15 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-25 22:16:03.55415
Title: FeLoG: Scalable and Efficient Distributed Graph Embedding with Feedback Loop Mechanism
Title（参考訳）: FeLoG: フィードバックループ機構を備えたスケーラブルで効率的な分散グラフ埋め込み
Authors: Peng Fang, Arijit Khan, Ziqiang Wu, Zhenli Li, Yibo Zhou, Fang Wang, Dan Feng,
Abstract要約: スケーラブルな分散グラフ埋め込みのためのフィードバックループ駆動システムFeLoGを提案する。フィードバックを結合したサンプリングとトレーニングを導入し、リアルタイムな埋め込み品質のフィードバックに従って、トレーニングされていないノードを動的に優先順位付けする。 27.9倍のスピードアップを実現し、通信コストを53.1%以上削減し、CPU-GPU使用率を80%以上維持する。
参考スコア（独自算出の注目度）: 10.118550615899972
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Graph embedding maps graph nodes into low-dimensional vectors to support applications such as recommendation, fraud detection, and graph-based retrieval-augmented generation (GraphRAG). As graphs scale to billions of edges, scalable and efficient graph embedding has become increasingly important. Existing frameworks commonly adopt a sampling-training paradigm, in which mini-batches are constructed by sampling nodes and their neighbors. However, sampling is typically decoupled from evolving embedding quality, causing redundant exploration of well-trained regions while under-sampling undertrained nodes. At the system level, such decoupling further leads to excessive communication, serialized execution, and low resource utilization in distributed environments. We present FeLoG, a feedback loop-driven system for scalable distributed graph embedding. (1) FeLoG introduces feedback-coupled sampling and training, dynamically prioritizing undertrained nodes according to real-time embedding-quality feedback, thereby reducing redundant computation and accelerating convergence. (2) It employs activity-aware communication that compresses frequently occurring node sequences to reduce intra-machine PCIe traffic and selectively synchronizes frequently updated embeddings to reduce inter-machine communication. (3) It adopts a round-interleaved pipeline that overlaps next-round sampling with current-round training to improve CPU-GPU utilization. Experiments against six state-of-the-art baselines on large-scale graphs show that FeLoG achieves an average speedup of 27.9x, reduces communication cost by more than 53.1%, and sustains over 80% CPU-GPU utilization.
Abstract（参考訳）: グラフ埋め込みは、グラフノードを低次元ベクトルにマッピングし、レコメンデーション、不正検出、グラフベースの検索拡張生成(GraphRAG)などのアプリケーションをサポートする。グラフが数十億のエッジにスケールするにつれ、スケーラブルで効率的なグラフ埋め込みがますます重要になっている。既存のフレームワークではサンプリングトレーニングのパラダイムが一般的であり、ミニバッチはサンプリングノードとその隣人によって構築される。しかし、サンプリングは通常、埋め込み品質の進化から切り離され、未サンプリングノードをアンダーサンプリングしながら、よく訓練された領域を冗長に探索する。システムレベルでは、このような分離は、分散環境での過剰な通信、シリアライズされた実行、低リソース利用につながる。スケーラブルな分散グラフ埋め込みのためのフィードバックループ駆動システムFeLoGを提案する。 1)FeLoGは,フィードバック結合型サンプリングとトレーニングを導入し,リアルタイムな埋め込み品質フィードバックに基づいて非学習ノードを動的に優先順位付けすることで,冗長な計算を削減し,収束を加速する。 2) 頻繁に発生するノードシーケンスを圧縮してマシン内PCIeトラフィックを低減し, 頻繁に更新される埋め込みを選択的に同期させ, マシン間通信を減らす。 (3) CPU-GPU利用を改善するために、次のラウンドサンプリングと現在のラウンドトレーニングの重複するラウンドインターリーブパイプラインを採用する。大規模グラフ上の6つの最先端ベースラインに対する実験により、FeLoGは平均速度を27.9倍にし、通信コストを53.1%以上削減し、CPU-GPU使用率を80%以上維持している。

論文の概要: FeLoG: Scalable and Efficient Distributed Graph Embedding with Feedback Loop Mechanism

関連論文リスト