SEMv2: Table Separation Line Detection Based on Instance Segmentation
- URL: http://arxiv.org/abs/2303.04384v2
- Date: Fri, 12 Jan 2024 07:00:30 GMT
- Title: SEMv2: Table Separation Line Detection Based on Instance Segmentation
- Authors: Zhenrong Zhang, Pengfei Hu, Jiefeng Ma, Jun Du, Jianshu Zhang, Huihui
Zhu, Baocai Yin, Bing Yin and Cong Liu
- Abstract summary: We propose an accurate table structure recognizer, termed SEMv2 (SEM: Split, Embed and Merge)
We address the table separation line instance-level discrimination problem and introduce a table separation line detection strategy based on conditional convolution.
To comprehensively evaluate the SEMv2, we also present a more challenging dataset for table structure recognition, dubbed iFLYTAB.
- Score: 96.36188168694781
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Table structure recognition is an indispensable element for enabling machines
to comprehend tables. Its primary purpose is to identify the internal structure
of a table. Nevertheless, due to the complexity and diversity of their
structure and style, it is highly challenging to parse the tabular data into a
structured format that machines can comprehend. In this work, we adhere to the
principle of the split-and-merge based methods and propose an accurate table
structure recognizer, termed SEMv2 (SEM: Split, Embed and Merge). Unlike the
previous works in the ``split'' stage, we aim to address the table separation
line instance-level discrimination problem and introduce a table separation
line detection strategy based on conditional convolution. Specifically, we
design the ``split'' in a top-down manner that detects the table separation
line instance first and then dynamically predicts the table separation line
mask for each instance. The final table separation line shape can be accurately
obtained by processing the table separation line mask in a row-wise/column-wise
manner. To comprehensively evaluate the SEMv2, we also present a more
challenging dataset for table structure recognition, dubbed iFLYTAB, which
encompasses multiple style tables in various scenarios such as photos, scanned
documents, etc. Extensive experiments on publicly available datasets (e.g.
SciTSR, PubTabNet and iFLYTAB) demonstrate the efficacy of our proposed
approach. The code and iFLYTAB dataset are available at
https://github.com/ZZR8066/SEMv2.
Related papers
- 2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models [48.123582712115336]
Existing methods often flatten the two-dimensional (2D) table structure into a sequence of tokens.
2D-TPE effectively mitigates the risk of losing essential spatial information while preserving computational efficiency.
Extensive experiments across five benchmarks demonstrate that 2D-TPE outperforms strong baselines.
arXiv Detail & Related papers (2024-09-29T13:16:37Z) - SEMv3: A Fast and Robust Approach to Table Separation Line Detection [48.75713662571455]
Table structure recognition (TSR) aims to parse the inherent structure of a table from its input image.
"Split-and-merge" paradigm is a pivotal approach to parse table structure, where the table separation line detection is crucial.
We propose SEMv3 (SEM: Split, Embed and Merge), a method that is both fast and robust for detecting table separation lines.
arXiv Detail & Related papers (2024-05-20T08:13:46Z) - TRUST: An Accurate and End-to-End Table structure Recognizer Using
Splitting-based Transformers [56.56591337457137]
We propose an accurate and end-to-end transformer-based table structure recognition method, referred to as TRUST.
Transformers are suitable for table structure recognition because of their global computations, perfect memory, and parallel computation.
We conduct experiments on several popular benchmarks including PubTabNet and SynthTable, our method achieves new state-of-the-art results.
arXiv Detail & Related papers (2022-08-31T08:33:36Z) - Split, embed and merge: An accurate table structure recognizer [42.579215135672094]
We introduce Split, Embed and Merge (SEM) as an accurate table structure recognizer.
SEM can achieve an average F-Measure of $96.9%$ on the SciTSR dataset.
arXiv Detail & Related papers (2021-07-12T06:26:19Z) - TGRNet: A Table Graph Reconstruction Network for Table Structure
Recognition [76.06530816349763]
We propose an end-to-end trainable table graph reconstruction network (TGRNet) for table structure recognition.
Specifically, the proposed method has two main branches, a cell detection branch and a cell logical location branch, to jointly predict the spatial location and the logical location of different cells.
arXiv Detail & Related papers (2021-06-20T01:57:05Z) - Identifying Table Structure in Documents using Conditional Generative
Adversarial Networks [0.0]
In many industries and in academic research, information is primarily transmitted in the form of unstructured documents.
We propose a top-down approach, first using a conditional generative adversarial network to map a table image into a standardised skeleton' table form.
We then deriving latent table structure using xy-cut projection and Genetic Algorithm optimisation.
arXiv Detail & Related papers (2020-01-13T20:42:40Z) - Table Structure Extraction with Bi-directional Gated Recurrent Unit
Networks [5.350788087718877]
This paper proposes a robust deep learning based approach to extract rows and columns from a detected table in document images with a high precision.
We have benchmarked our system on publicly available UNLV as well as ICDAR 2013 datasets on which it outperformed the state-of-the-art table structure extraction systems by a significant margin.
arXiv Detail & Related papers (2020-01-08T13:17:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.