Fugu-MT 論文翻訳(概要): STRADAViT: Towards a Foundational Model for Radio Astronomy through Self-Supervised Transfer

論文の概要: STRADAViT: Towards a Foundational Model for Radio Astronomy through Self-Supervised Transfer

arxiv url: http://arxiv.org/abs/2603.29660v3
Date: Wed, 08 Apr 2026 09:44:15 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-09 14:06:04.895727
Title: STRADAViT: Towards a Foundational Model for Radio Astronomy through Self-Supervised Transfer
Title（参考訳）: STRADAViT:自己監督移動による電波天文学の基礎モデルを目指して
Authors: Andrea DeMarco, Ian Fenech Conti, Hayley Camilleri, Ardiana Bushi, Simone Riggi,
Abstract要約: STRADAViTは、無線天文学の画像から転送可能なエンコーダを学習するための、自己監督型ビジョントランスフォーマーの継続的な事前訓練フレームワークである。復元のみ、コントラストのみ、および2段階のブランチをサポートする。線形探索と微調整により,2次および複数クラス設定にまたがる3つの形態素ベンチマークの転送を評価する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Next-generation radio astronomy surveys are delivering millions of resolved sources, but robust and scalable morphology analysis remains difficult across heterogeneous telescopes and imaging pipelines. We present STRADAViT, a self-supervised Vision Transformer continued-pretraining framework for learning transferable encoders from radio astronomy imagery. The framework combines mixed-survey data curation, radio astronomy-aware training-view generation, and a ViT-MAE-initialized encoder family with optional register tokens. It supports reconstruction-only, contrastive-only, and two-stage branches. Our pretraining dataset comprises radio astronomy cutouts drawn from four complementary sources. We evaluate transfer with linear probing and fine-tuning on three morphology benchmarks spanning binary and multi-class settings. Relative to the ViT-MAE initialization used for continued pretraining, the best two-stage models improve Macro-F1 in all reported linear-probe settings and in two of three fine-tuning settings, with the largest gain on RGZ DR1. Relative to DINOv2, gains are selective rather than universal: the best two-stage models achieve higher mean Macro-F1 than the strongest DINOv2 baseline on LoTSS DR2 and RGZ DR1 under linear probing, and on MiraBest and RGZ DR1 under fine-tuning. A targeted DINOv2 initialization ablation further indicates that the adaptation recipe is not specific to the ViT-MAE starting point and that, under the same recipe. The ViT-MAE-based STRADAViT checkpoint is retained as the released checkpoint because it combines competitive transfer with substantially lower token count and downstream cost than the DINOv2-based alternative. These results indicate that radio astronomy-aware view generation and staged continued pretraining can provide a stronger domain-adapted starting point than off-the-shelf ViT checkpoints for radio astronomy transfer.
Abstract（参考訳）: 次世代の電波天文学調査は、何百万もの解決された情報源を提供しているが、頑丈でスケーラブルな形態解析は、異種望遠鏡や撮像パイプラインで難しいままである。本稿では、無線天文学画像から転送可能なエンコーダを学習するための自己教師型ビジョントランスフォーマーであるSTRADAViTについて述べる。このフレームワークは、混合サーベイデータキュレーション、無線天文学対応のトレーニングビュー生成、ViT-MAE初期化エンコーダファミリーとオプションレジスタトークンを組み合わせる。復元のみ、コントラストのみ、および2段階のブランチをサポートする。予備訓練データセットは4つの補完源から抽出された電波天文学のカットアウトからなる。線形探索と微調整により,2次および複数クラス設定にまたがる3つの形態素ベンチマークの転送を評価する。 ViT-MAEの初期化と異なり、最も優れた2段階のモデルでは、すべての報告されたリニアプローブ設定と3つの微調整設定でマクロF1が改善され、RGZ DR1で最大の利益を得た。 DINOv2とは対照的に、ゲインは普遍性よりも選択的であり、最高の2段階モデルでは、リニアプローブ下でのLOTSS DR2とRGZ DR1、微調整時にMiraBestとRGZ DR1において最強のDINOv2ベースラインよりも高い平均のマクロ-F1を達成する。標的とするDINOv2初期化アブレーションは、適応レシピがViT-MAE開始点に特異的でないことを示し、同じレシピの下ではそれを示す。 ViT-MAEベースのSTRADAViTチェックポイントは、競合転送と、DINOv2ベースの代替機よりもかなり低いトークン数とダウンストリームコストを組み合わせているため、リリースされたチェックポイントとして保持されている。これらの結果から, 電波天文学を意識した視線生成と連続事前訓練により, 既設のVTチェックポイントよりも強い領域適応開始点が得られることが示唆された。

論文の概要: STRADAViT: Towards a Foundational Model for Radio Astronomy through Self-Supervised Transfer

関連論文リスト