Fugu-MT 論文翻訳(概要): BenchHAR: Benchmarking Self-Supervised Learning for Generalizable Sensor-based Activity Recognition

論文の概要: BenchHAR: Benchmarking Self-Supervised Learning for Generalizable Sensor-based Activity Recognition

arxiv url: http://arxiv.org/abs/2605.08296v1
Date: Fri, 08 May 2026 10:54:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:49.543312
Title: BenchHAR: Benchmarking Self-Supervised Learning for Generalizable Sensor-based Activity Recognition
Title（参考訳）: BenchHAR: 一般化可能なセンサによる行動認識のための自己監督学習のベンチマーク
Authors: Yize Cai, Rui Feng, Anlan Yu, Baoshen Guo, Zhiqing Hong,
Abstract要約: ウェアラブルセンサーからのヒューマンアクティビティ認識(HAR)は、幅広い医療や行動科学の応用をサポートする。視覚領域と言語領域における自己教師あり学習(SSL)の最近の進歩は、ラベルなしデータから一般化可能な表現を学習する強力な能力を示している。センサベースHARにおけるSSL手法の一般化能力を評価するための統合フレームワークであるBenchHARについて述べる。
参考スコア（独自算出の注目度）: 20.30670002928755
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Human Activity Recognition (HAR) from wearable sensors supports broad healthcare and behavior science applications. However, data heterogeneity and the scarcity of labeled data limit its real-world generalization. Recent advances in self-supervised learning (SSL) in vision and language domains have shown strong capability for learning generalizable representations from unlabeled data. Yet, few studies have systematically compared the generalization performance of SSL methods or explored how to adapt them for generalizable HAR. To address these gaps, we present BenchHAR, a unified framework for evaluating the generalization capability of SSL methods for sensor-based HAR on unseen target distributions. BenchHAR curates a large-scale dataset (~258K samples) and evaluates eight representative SSL methods across 12 encoder-classifier architectures. Our results reveal that existing SSL methods struggle to achieve satisfactory generalization performance. We find that: (1) For HAR models, the hybrid paradigm (combining reconstruction and contrastive pretraining) achieves the best overall performance. The CNN encoder exhibits the strongest ability to learn generalizable representations, while more expressive classifier architectures further improve generalization. (2) For data scale, increasing the amount of pretraining data from downstream activity classes consistently improves generalization, while adding more labeled data yields limited gains. Interestingly, incorporating unlabeled data from non-downstream activity classes does not improve generalization. (3) Sensor data collected from custom-grade devices generalizes better than that from research-grade devices, and data from limb transfers more effectively to trunk positions. BenchHAR provides a unified benchmark and actionable insights for generalizable sensor-based HAR systems. Our code is available at https://github.com/saiketa/HAR-Bench.
Abstract（参考訳）: ウェアラブルセンサーからのヒューマンアクティビティ認識(HAR)は、幅広い医療や行動科学の応用をサポートする。しかし、データの不均一性とラベル付きデータの不足は、実世界の一般化を制限する。視覚領域と言語領域における自己教師あり学習(SSL)の最近の進歩は、ラベルなしデータから一般化可能な表現を学習する強力な能力を示している。しかし、SSLメソッドの一般化性能を体系的に比較したり、一般化可能なHARに適応する方法を探ったりした研究はほとんどない。このようなギャップに対処するため,センサベースHARにおけるSSL手法の一般化能力を評価する統合フレームワークであるBenchHARを提案する。 BenchHARは大規模なデータセット(約258Kサンプル)をキュレートし、12のエンコーダ分類アーキテクチャにまたがる8つの代表的なSSLメソッドを評価する。以上の結果から,既存のSSLメソッドは,良好な一般化性能を達成するのに苦慮していることが明らかとなった。 1)HARモデルでは、ハイブリッドパラダイム(再構成とコントラスト事前学習を組み合わせた)が全体的な性能を最高のものにする。 CNNエンコーダは、一般化可能な表現を学習する最も強力な能力を示し、より表現力のある分類器アーキテクチャは一般化をさらに改善する。 2)データスケールでは,下流のアクティビティクラスからの事前学習データの増加が常に一般化を向上する一方で,ラベル付きデータの追加は限られたゲインを得る。興味深いことに、ダウンストリームでないアクティビティクラスからラベルなしのデータを取り込むことで、一般化は改善されない。 3) カスタムグレードデバイスから収集したセンサデータは,研究グレードデバイスより優れており,手足からのデータをトランク位置へ効率よく転送する。 BenchHARは、一般化可能なセンサーベースのHARシステムに対して、統一されたベンチマークと実行可能な洞察を提供する。私たちのコードはhttps://github.com/saiketa/HAR-Bench.comで公開されています。

論文の概要: BenchHAR: Benchmarking Self-Supervised Learning for Generalizable Sensor-based Activity Recognition

関連論文リスト