Fugu-MT 論文翻訳(概要): Higress-RAG: A Holistic Optimization Framework for Enterprise Retrieval-Augmented Generation via Dual Hybrid Retrieval, Adaptive Routing, and CRAG

論文の概要: Higress-RAG: A Holistic Optimization Framework for Enterprise Retrieval-Augmented Generation via Dual Hybrid Retrieval, Adaptive Routing, and CRAG

arxiv url: http://arxiv.org/abs/2602.23374v1
Date: Tue, 30 Dec 2025 05:28:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-09 01:20:07.938569
Title: Higress-RAG: A Holistic Optimization Framework for Enterprise Retrieval-Augmented Generation via Dual Hybrid Retrieval, Adaptive Routing, and CRAG
Title（参考訳）: Higress-RAG: Dual Hybrid Retrieval, Adaptive Routing, CRAGによるエンタープライズ検索拡張生成のための全体最適化フレームワーク
Authors: Weixi Lin,
Abstract要約: Higress RAG MCP Serverは、AIデプロイメントのための、新しいエンタープライズ中心のアーキテクチャである。システムは適応ルーティング、セマンティックキャッシュ、ハイブリッド検索、修正RAGを編成する。 Systemは、エンタープライズAIデプロイメントのためのスケーラブルで幻覚に強いソリューションを提供する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: The integration of Large Language Models (LLMs) into enterprise knowledge management systems has been catalyzed by the Retrieval-Augmented Generation (RAG) paradigm, which augments parametric memory with non-parametric external data. However, the transition from proof-of-concept to production-grade RAG systems is hindered by three persistent challenges: low retrieval precision for complex queries, high rates of hallucination in the generation phase, and unacceptable latency for real-time applications. This paper presents a comprehensive analysis of the Higress RAG MCP Server, a novel, enterprise-centric architecture designed to resolve these bottlenecks through a "Full-Link Optimization" strategy. Built upon the Model Context Protocol (MCP), the system introduces a layered architecture that orchestrates a sophisticated pipeline of Adaptive Routing, Semantic Caching, Hybrid Retrieval, and Corrective RAG (CRAG). We detail the technical implementation of key innovations, including the Higress-Native Splitter for structure-aware data ingestion, the application of Reciprocal Rank Fusion (RRF) for merging dense and sparse retrieval signals, and a 50ms-latency Semantic Caching mechanism with dynamic thresholding. Experimental evaluations on domain-specific Higress technical documentation and blogs verify the system's architectural robustness. The results demonstrate that by optimizing the entire retrieval lifecycle - from pre-retrieval query rewriting to post-retrieval corrective evaluation - the Higress RAG system offers a scalable, hallucination-resistant solution for enterprise AI deployment.
Abstract（参考訳）: 企業知識管理システムへのLarge Language Models (LLM) の統合は、非パラメトリック外部データによるパラメトリックメモリの拡張であるRetrieval-Augmented Generation (RAG) パラダイムによって実現されている。しかし、概念実証から実運用レベルのRAGへの移行は、複雑なクエリの検索精度の低いこと、生成フェーズにおける幻覚率の高いこと、リアルタイムアプリケーションでは許容できないレイテンシという3つの永続的な課題によって妨げられている。本稿では,Higress RAG MCP Serverを包括的に分析する。このアーキテクチャは,これらのボトルネックを解決するために,"Full-Link Optimization"戦略によって設計された新しいエンタープライズ中心アーキテクチャである。 Model Context Protocol(MCP)に基づいて構築されたこのシステムは、適応ルーティング、セマンティックキャッシュ、ハイブリッド検索、修正RAG(CRAG)の洗練されたパイプラインをオーケストレーションする階層アーキテクチャを導入する。本稿では,構造を意識したデータ取り込みのためのHigress-Native Splitter,高密度かつスパースな検索信号の統合のためのReciprocal Rank Fusion(RRF)の適用,動的しきい値付き50ms遅延セマンティックキャッシング機構など,重要なイノベーションの技術的実装について述べる。ドメイン固有のHigress技術ドキュメントとブログに関する実験的評価は、システムのアーキテクチャの堅牢性を検証する。その結果、検索ライフサイクル全体 - 検索前クエリ書き換えから検索後修正評価に至るまで - を最適化することで、Higress RAGシステムは、エンタープライズAIデプロイメントのためのスケーラブルで幻覚に耐性のあるソリューションを提供する。

論文の概要: Higress-RAG: A Holistic Optimization Framework for Enterprise Retrieval-Augmented Generation via Dual Hybrid Retrieval, Adaptive Routing, and CRAG

関連論文リスト