Fugu-MT 論文翻訳(概要): From Complaint Narratives to Monetary Relief: A Hybrid Machine Learning Framework for CFPB Consumer Complaints

論文の概要: From Complaint Narratives to Monetary Relief: A Hybrid Machine Learning Framework for CFPB Consumer Complaints

arxiv url: http://arxiv.org/abs/2606.22664v1
Date: Sun, 21 Jun 2026 20:41:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-25 07:44:07.999064
Title: From Complaint Narratives to Monetary Relief: A Hybrid Machine Learning Framework for CFPB Consumer Complaints
Title（参考訳）: 苦情から金銭的救済へ:CFPB消費者不満のためのハイブリッド機械学習フレームワーク
Authors: Zhuoer Wang, Sizhen Zhu, Xiongyu Chen,
Abstract要約: 本稿では、消費者金融保護局の苦情データを用いて、金融救済効果を予測するためのハイブリッド機械学習フレームワークを提案する。本研究では,不均衡な二項分類問題としてその課題を定式化し,金銭的救済でクローズドな苦情を補償可能な結果として扱う。 XGBoost分類器は、時間的列車分割を用いて訓練され、モデル開発に使用された初期の苦情と、より最近のサンプル外評価のために予約された苦情がある。
参考スコア（独自算出の注目度）: 1.3465168113172192
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Consumer financial complaints provide a valuable source of information for identifying service failures, dispute frictions, and operational deficiencies in consumer-facing financial institutions. This paper proposes a hybrid machine learning framework for predicting monetary relief outcomes using Consumer Financial Protection Bureau complaint data. We formulate the task as an imbalanced binary classification problem, where complaints closed with monetary relief are treated as compensable outcomes. The proposed framework integrates multiple sources of predictive information, including complaint narrative text, LDA-based topic representations, interpretable text-engineered features, and structured categorical attributes such as company and state. An XGBoost classifier is trained using a temporal train-test split, with earlier complaints used for model development and more recent complaints reserved for out-of-sample evaluation. Compared with a TF-IDF baseline, the proposed framework substantially improves predictive performance, increasing AUC-ROC from 0.69 to 0.78 and improving PR-AUC under class imbalance. Feature importance analysis shows that textual signals, latent complaint topics, and company identity all contribute meaningful predictive information. In particular, company-level effects reveal systematic variation in complaint resolution patterns across financial institutions. These findings suggest that consumer complaint narratives can serve as alternative data for monitoring consumer harm, identifying firm-level operational weaknesses, and supporting early-stage risk surveillance in consumer finance.
Abstract（参考訳）: 消費者金融苦情は、消費者が直面する金融機関のサービス障害、紛争摩擦、運用上の欠陥を特定するための貴重な情報源を提供する。本稿では、消費者金融保護局の苦情データを用いて、金融救済効果を予測するためのハイブリッド機械学習フレームワークを提案する。本研究では,不均衡な二項分類問題としてその課題を定式化し,金銭的救済でクローズドな苦情を補償可能な結果として扱う。提案フレームワークは,苦情文,LDAに基づく話題表現,解釈可能なテキスト工学的特徴,企業や状態などの構造的分類属性など,複数の予測情報ソースを統合している。 XGBoost分類器は、時間的列車分割を用いて訓練され、モデル開発に使用された初期の苦情と、より最近のサンプル外評価のために予約された苦情がある。 TF-IDFベースラインと比較して、提案フレームワークは予測性能を大幅に向上し、AUC-ROCを0.69から0.78に増加させ、PR-AUCをクラス不均衡下で改善する。特徴重要度分析は, 有意な予測情報に, テキスト信号, 潜在的苦情, 企業アイデンティティが寄与していることを示している。特に、企業レベルの影響は、金融機関間での苦情解決パターンの体系的な変化を示している。これらの結果は、消費者の苦情は、消費者の被害を監視し、企業レベルの運用上の弱点を特定し、消費者金融における早期のリスク監視を支援するための代替データとして役立つことを示唆している。

論文の概要: From Complaint Narratives to Monetary Relief: A Hybrid Machine Learning Framework for CFPB Consumer Complaints

関連論文リスト