Fugu-MT 論文翻訳(概要): How Well Can We Decode Vowels from Auditory EEG -- A Rigorous Cross-Subject Benchmark with Honest Assessment

論文の概要: How Well Can We Decode Vowels from Auditory EEG -- A Rigorous Cross-Subject Benchmark with Honest Assessment

arxiv url: http://arxiv.org/abs/2605.00865v1
Date: Wed, 22 Apr 2026 05:50:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 06:56:26.476781
Title: How Well Can We Decode Vowels from Auditory EEG -- A Rigorous Cross-Subject Benchmark with Honest Assessment
Title（参考訳）: 聴覚脳波から母音をいかにうまくデコードできるか -- 厳格なクロスオブジェクトベンチマークと正直な評価
Authors: Xiaoyang Li,
Abstract要約: そこで我々は,OpenNeuro ds006104を用いて,聴覚脳波から5種類の母音復号(a,e,i,o,u)を抽出した。最高のフル機能モデル(XGBoost)は24.5パーセントの精度(20パーセント以下)で、LightGBMの差分エントロピー機能は25.5%である。
参考スコア（独自算出の注目度）: 3.942402228954563
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: EEG based phoneme decoding is promising for brain computer interfaces, but many prior studies rely on within subject evaluation, small cohorts, or weak leakage control. We present a reproducible cross subject benchmark for five class vowel decoding (a, e, i, o, u) from auditory EEG using OpenNeuro ds006104 (16 subjects, 61 channels, 256 Hz). Under strict leave one subject out evaluation with training only normalization and explicit anti leakage checks, we compare 14 pipelines from classical machine learning, deep learning, and Riemannian methods. The best full feature model (XGBoost) reaches 24.5 percent accuracy (chance 20 percent), while differential entropy features with LightGBM reach 25.5 percent in feature specific analysis. After multiple comparison correction, strong pairwise model advantages are limited. Classical methods are competitive with deep models in this low signal regime. Additional analyses (ablation, pairwise vowels, within subject CV, ERP, temporal generalization, and electrode importance) indicate that vowel information is real but weak and mainly carried by early transient auditory responses. We release code and evaluation scripts for full reproducibility.
Abstract（参考訳）: 脳波に基づく音素デコーディングは脳コンピュータインタフェースに有望であるが、多くの先行研究は被験者評価、小さなコホート、弱い漏洩制御に依存している。聴覚脳波からの5種類の母音復号(a, e, i, o, u)に対して, OpenNeuro ds006104 (16例, 61チャンネル, 256Hz) を用いて再現可能なクロス被験者ベンチマークを行った。トレーニングのみの正規化と明示的なアンチリークチェックによる評価を厳格に残すため、古典的な機械学習、ディープラーニング、リーマン手法の14のパイプラインを比較した。最高のフル機能モデル(XGBoost)は24.5パーセントの精度(20%以下)で、LightGBMの差分エントロピー機能は25.5%である。複数の比較補正の後、強いペアワイズモデルの利点は限定される。古典的な手法は、この低信号方式のディープモデルと競合する。追加分析 (アブレーション, 対母音, 対象CV, ERP, 時間的一般化, 電極重要度) により, 母音情報は真だが弱く, 主に初期過渡的な聴覚応答によってもたらされることが示された。完全な再現性のためのコードと評価スクリプトをリリースします。

論文の概要: How Well Can We Decode Vowels from Auditory EEG -- A Rigorous Cross-Subject Benchmark with Honest Assessment

関連論文リスト