Fugu-MT 論文翻訳(概要): Are we still able to recognize pearls? Machine-driven peer review and the risk to creativity: An explainable RAG-XAI detection framework with markers extraction

論文の概要: Are we still able to recognize pearls? Machine-driven peer review and the risk to creativity: An explainable RAG-XAI detection framework with markers extraction

arxiv url: http://arxiv.org/abs/2604.07964v1
Date: Thu, 09 Apr 2026 08:25:49 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-10 18:34:05.801175
Title: Are we still able to recognize pearls? Machine-driven peer review and the risk to creativity: An explainable RAG-XAI detection framework with markers extraction
Title（参考訳）: 真珠はまだ認識できるのか? 機械駆動のピアレビューと創造性へのリスク:マーカー抽出による説明可能なRAG-XAI検出フレームワーク
Authors: Alin-Gabriel Văduva, Simona-Vasilica Oprea, Adela Bâra,
Abstract要約: 本稿では、レビュー品質を評価し、自動パターンを検出するための説明可能なフレームワーク(RAG-XAI)を提案する。 XGBoost、Random Forest、LightGBMは99.61%、AUC-ROCは0.999以上、F1スコアは0.9925である。
参考スコア（独自算出の注目度）: 7.723181091241251
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The integration of large language models (LLMs) into peer review raises a concern beyond authorship and detection: the potential cascading automation of the entire editorial process. As reviews become partially or fully machine-generated, it becomes plausible that editorial decisions may also be delegated to algorithmic systems, leading to a fully automated evaluation pipeline. They risk reshaping the criteria by which scientific work is assessed. This paper argues that machine-driven assessment may systematically favor standardized, pattern-conforming research while penalizing unconventional and paradigm-shifting ideas that require contextual human judgment. We consider that this shift could lead to epistemic homogenization, where researchers are implicitly incentivized to optimize their work for algorithmic approval rather than genuine discovery. To address this risk, we introduce an explainable framework (RAG-XAI) for assessing review quality and detecting automated patterns using markers LLM extractor, aiming to preserve transparency, accountability and creativity in science. The proposed framework achieves near-perfect detection performance, with XGBoost, Random Forest and LightGBM reaching 99.61% accuracy, AUC-ROC above 0.999 and F1-scores of 0.9925 on the test set, while maintaining extremely low false positive rates (<0.23%) and false negative rates (~0.8%). In contrast, the logistic regression baseline performs substantially worse (89.97% accuracy, F1-score 0.8314). Feature importance and SHAP analyses identify absence of personal signals and repetition patterns as the dominant predictors. Additionally, the RAG component achieves 90.5% top-1 retrieval accuracy, with strong same-class clustering in the embedding space, further supporting the reliability of the framework's outputs.
Abstract（参考訳）: 大規模言語モデル(LLM)をピアレビューに統合することで、著者や検出以上の懸念が生まれ、編集プロセス全体をカスケードする可能性がある。レビューが部分的にあるいは完全に機械生成されるにつれて、編集決定がアルゴリズムシステムに委譲される可能性が高くなり、完全に自動化された評価パイプラインがもたらされる。彼らは科学的な研究が評価される基準を改定するリスクがある。本稿では, 機械駆動型評価は, 文脈的人的判断を必要とする非伝統的な, パラダイムシフトの考え方を罰する一方で, 標準化されたパターンコンフォーミング研究を体系的に好む可能性があることを論じる。この変化は、真の発見ではなく、アルゴリズムの承認のために研究を最適化するために暗黙のインセンティブを与えられている、てんかんの均質化に繋がる可能性があると考えている。このリスクに対処するために,学術における透明性,説明責任,創造性を維持することを目的として,レビュー品質を評価し,マーカーLPM抽出器を用いて自動パターンを検出するための説明可能なフレームワーク(RAG-XAI)を導入する。 XGBoost、Random Forest、LightGBMは99.61%、AUC-ROCは0.999以上、F1スコアは0.9925であり、非常に低い偽陽性率 (0.23%) と偽陰性率 (~0.8%) を維持している。対照的に、ロジスティック回帰ベースラインは著しく悪化する(89.97%の精度、F1スコア0.8314)。特徴的重要性とSHAP分析は、個人信号と反復パターンの欠如を支配的な予測因子として同定する。さらに、RAGコンポーネントは90.5%のTop-1検索精度を達成し、埋め込み空間に強い同クラスのクラスタリングを行い、フレームワークの出力の信頼性をさらに向上する。

論文の概要: Are we still able to recognize pearls? Machine-driven peer review and the risk to creativity: An explainable RAG-XAI detection framework with markers extraction

関連論文リスト