Fugu-MT 論文翻訳(概要): SEMv2: Table Separation Line Detection Based on Instance Segmentation

論文の概要: SEMv2: Table Separation Line Detection Based on Instance Segmentation

arxiv url: http://arxiv.org/abs/2303.04384v2
Date: Fri, 12 Jan 2024 07:00:30 GMT
ステータス: 翻訳完了
システム内更新日: 2024-01-16 00:32:38.707390
Title: SEMv2: Table Separation Line Detection Based on Instance Segmentation
Title（参考訳）: SEMv2:インスタンスセグメンテーションに基づくテーブル分離線検出
Authors: Zhenrong Zhang, Pengfei Hu, Jiefeng Ma, Jun Du, Jianshu Zhang, Huihui Zhu, Baocai Yin, Bing Yin and Cong Liu
Abstract要約: SEMv2(SEM: Split, Embed, Merge)と呼ばれるテーブル構造認識器を提案する。本稿では,テーブル分離ラインのインスタンスレベルの識別問題に対処し,条件付き畳み込みに基づくテーブル分離ライン検出戦略を提案する。 SEMv2を包括的に評価するために、iFLYTABと呼ばれるテーブル構造認識のためのより困難なデータセットも提示する。
参考スコア（独自算出の注目度）: 96.36188168694781
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Table structure recognition is an indispensable element for enabling machines to comprehend tables. Its primary purpose is to identify the internal structure of a table. Nevertheless, due to the complexity and diversity of their structure and style, it is highly challenging to parse the tabular data into a structured format that machines can comprehend. In this work, we adhere to the principle of the split-and-merge based methods and propose an accurate table structure recognizer, termed SEMv2 (SEM: Split, Embed and Merge). Unlike the previous works in the ``split'' stage, we aim to address the table separation line instance-level discrimination problem and introduce a table separation line detection strategy based on conditional convolution. Specifically, we design the ``split'' in a top-down manner that detects the table separation line instance first and then dynamically predicts the table separation line mask for each instance. The final table separation line shape can be accurately obtained by processing the table separation line mask in a row-wise/column-wise manner. To comprehensively evaluate the SEMv2, we also present a more challenging dataset for table structure recognition, dubbed iFLYTAB, which encompasses multiple style tables in various scenarios such as photos, scanned documents, etc. Extensive experiments on publicly available datasets (e.g. SciTSR, PubTabNet and iFLYTAB) demonstrate the efficacy of our proposed approach. The code and iFLYTAB dataset are available at https://github.com/ZZR8066/SEMv2.
Abstract（参考訳）: テーブル構造認識は、機械がテーブルを理解するために欠かせない要素である。その主な目的はテーブルの内部構造を特定することである。それでも、その構造とスタイルの複雑さと多様性のため、表形式のデータを機械が理解できる構造化形式に解析することは極めて困難である。本研究では,スプリット・アンド・マージ方式の原理に従い,semv2 (sem: split, embedded and merge) と呼ばれる正確な表構造認識器を提案する。従来の「スプリット」段階とは違って、テーブル分離ラインのインスタンスレベルの識別問題に対処し、条件付き畳み込みに基づくテーブル分離ライン検出戦略を導入することを目指している。具体的には、``split''をトップダウンで設計し、まずテーブル分離ラインインスタンスを検出し、次に各インスタンスのテーブル分離ラインマスクを動的に予測する。テーブル分離線マスクを行方向/列方向に加工することにより、最終テーブル分離線形状を正確に得ることができる。また,semv2を包括的に評価するために,iflytabと呼ばれるテーブル構造認識のためのより難解なデータセットを提案する。公開データセット(SciTSR、PubTabNet、iFLYTABなど)に関する大規模な実験は、提案手法の有効性を実証している。コードとiFLYTABデータセットはhttps://github.com/ZZR8066/SEMv2で公開されている。

論文の概要: SEMv2: Table Separation Line Detection Based on Instance Segmentation

関連論文リスト