Fugu-MT 論文翻訳(概要): Beyond Masks: The Case for Medical Image Parsing

論文の概要: Beyond Masks: The Case for Medical Image Parsing

arxiv url: http://arxiv.org/abs/2605.11438v1
Date: Tue, 12 May 2026 02:47:53 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-13 21:48:56.535018
Title: Beyond Masks: The Case for Medical Image Parsing
Title（参考訳）: マスクを超えて:医療画像解析の事例
Authors: Siddharth Gupta, Alan L. Yuille, Zongwei Zhou,
Abstract要約: 医用画像研究は、医用画像解析を中心的出力とするべきであると論じる。属性は、それらのエンティティを記述し、マージンの規則性、エンハンスメントパターン、グレードなどのものをキャプチャする。このような出力を生成するためのフィールドがどの程度近いかをテストするために、3つのパースプリミティブとクロージャに対して11の代表的なシステムを監査する。
参考スコア（独自算出の注目度）: 55.19291862464811
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Medical imaging research has spent a decade getting very good at one thing: producing per-voxel masks. Masks tell us size, volume, and location, and a decade of clinical infrastructure rests on those outputs. Yet the report a radiologist writes contains almost nothing a mask can express. We argue that medical imaging research should adopt medical image parsing as its central output: a structured representation in which entities, attributes, and relationships are emitted together and mutually consistent. Entities are the named structures and findings, present or absent. Attributes describe those entities, capturing things like margin regularity, enhancement pattern, or severity grade. Relationships connect them, naming where one structure sits relative to another, what abuts what, and what has changed since the prior scan. A good parse satisfies three properties, in order: (1) decision (the parse names the right things in the current image), (2) reconstruction (its content is rich enough to regenerate that image), and (3) prediction (its content is rich enough to forecast how the patient state will evolve). Quantitative measurements are derived from this content; they are not predicted alongside it. To test how close the field is to producing such an output, we audit eleven representative systems against the three parsing primitives plus closure. None emits a well-formed parse. Entities are largely solved. Attributes, relationships, and closure remain near-empty. The path forward is not a new architecture. It is a commitment to a richer output, and to training signals that reward it. Segmentation taught models to measure. Parsing asks them to explain.
Abstract（参考訳）: 医療画像研究は10年もの間、1ボクセル当たりのマスクの製作に長けてきました。マスクはサイズ、体積、位置を教えてくれ、臨床インフラの10年はそれらの出力に依存している。しかし、放射線学者が書いた報告書には、マスクが表現できるものはほとんどない。我々は、医用画像解析を中心的な出力として、実体、属性、関係が互いに一致して放出される構造的表現として、医療画像解析を採用するべきであると論じている。エンティティは、現在または欠落している名前の付いた構造と発見である。属性はそれらのエンティティを記述し、マージンの規則性、強化パターン、重大度等をキャプチャする。関係はそれらを結び付け、ある構造が他の構造と相対的に位置し、何と何に似ていて、前回のスキャンで何が変わったかを指定する。良いパースは、(1)決定(パースは現在の画像に正しいものを命名する)、(2)再構成(イメージを再生するのに十分な内容)、(3)予測(患者の状態がどのように進化するかを予測するのに十分な内容)の3つの特性を満たす。定量的な測定は、この内容から導かれる。このような出力を生成するためのフィールドがどの程度近いかをテストするために、3つのパースプリミティブとクロージャに対して11の代表的なシステムを監査する。整形されたパースを出力する人はいません。エンティティは大部分が解決されている。属性、関係、閉鎖は、ほとんど空白のままである。前進する道は新しいアーキテクチャではありません。それは、よりリッチなアウトプットへのコミットメントであり、それに報いる信号のトレーニングです。セグメンテーションは測定するモデルを教えた。パーシングは彼らに説明を求める。

論文の概要: Beyond Masks: The Case for Medical Image Parsing

関連論文リスト