TableFormer: Table Structure Understanding with Transformers
- URL: http://arxiv.org/abs/2203.01017v1
- Date: Wed, 2 Mar 2022 10:46:24 GMT
- Title: TableFormer: Table Structure Understanding with Transformers
- Authors: Ahmed Nassar, Nikolaos Livathinos, Maksym Lysak, Peter Staar
- Abstract summary: We present a new table-structure identification model.
New object detection decoder for table-cells.
Second, we replace the LSTM decoders with transformer based decoders.
- Score: 2.121963121603413
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Tables organize valuable content in a concise and compact representation.
This content is extremely valuable for systems such as search engines,
Knowledge Graph's, etc, since they enhance their predictive capabilities.
Unfortunately, tables come in a large variety of shapes and sizes. Furthermore,
they can have complex column/row-header configurations, multiline rows,
different variety of separation lines, missing entries, etc. As such, the
correct identification of the table-structure from an image is a non-trivial
task. In this paper, we present a new table-structure identification model. The
latter improves the latest end-to-end deep learning model (i.e.
encoder-dual-decoder from PubTabNet) in two significant ways. First, we
introduce a new object detection decoder for table-cells. In this way, we can
obtain the content of the table-cells from programmatic PDF's directly from the
PDF source and avoid the training of the custom OCR decoders. This
architectural change leads to more accurate table-content extraction and allows
us to tackle non-english tables. Second, we replace the LSTM decoders with
transformer based decoders. This upgrade improves significantly the previous
state-of-the-art tree-editing-distance-score (TEDS) from 91% to 98.5% on simple
tables and from 88.7% to 95% on complex tables.
Related papers
- SEMv2: Table Separation Line Detection Based on Instance Segmentation [96.36188168694781]
We propose an accurate table structure recognizer, termed SEMv2 (SEM: Split, Embed and Merge)
We address the table separation line instance-level discrimination problem and introduce a table separation line detection strategy based on conditional convolution.
To comprehensively evaluate the SEMv2, we also present a more challenging dataset for table structure recognition, dubbed iFLYTAB.
arXiv Detail & Related papers (2023-03-08T05:15:01Z) - TRUST: An Accurate and End-to-End Table structure Recognizer Using
Splitting-based Transformers [56.56591337457137]
We propose an accurate and end-to-end transformer-based table structure recognition method, referred to as TRUST.
Transformers are suitable for table structure recognition because of their global computations, perfect memory, and parallel computation.
We conduct experiments on several popular benchmarks including PubTabNet and SynthTable, our method achieves new state-of-the-art results.
arXiv Detail & Related papers (2022-08-31T08:33:36Z) - Table Retrieval May Not Necessitate Table-specific Model Design [83.27735758203089]
We focus on the task of table retrieval, and ask: "is table-specific model design necessary for table retrieval?"
Based on an analysis on a table-based portion of the Natural Questions dataset (NQ-table), we find that structure plays a negligible role in more than 70% of the cases.
We then experiment with three modules to explicitly encode table structures, namely auxiliary row/column embeddings, hard attention masks, and soft relation-based attention biases.
None of these yielded significant improvements, suggesting that table-specific model design may not be necessary for table retrieval.
arXiv Detail & Related papers (2022-05-19T20:35:23Z) - TableParser: Automatic Table Parsing with Weak Supervision from
Spreadsheets [5.5347995556789105]
We devise a system capable of parsing tables in both native PDFs and scanned images with high precision.
We also create TableAnnotator and ExcelAnnotator, which constitute a spreadsheet-based weak supervision mechanism.
arXiv Detail & Related papers (2022-01-05T15:21:06Z) - MATE: Multi-view Attention for Table Transformer Efficiency [21.547074431324024]
More than 20% of relational tables on the web have 20 or more rows.
Current Transformer models are typically limited to 512 tokens.
We propose MATE, a novel Transformer architecture designed to model the structure of web tables.
arXiv Detail & Related papers (2021-09-09T14:39:30Z) - Split, embed and merge: An accurate table structure recognizer [42.579215135672094]
We introduce Split, Embed and Merge (SEM) as an accurate table structure recognizer.
SEM can achieve an average F-Measure of $96.9%$ on the SciTSR dataset.
arXiv Detail & Related papers (2021-07-12T06:26:19Z) - TGRNet: A Table Graph Reconstruction Network for Table Structure
Recognition [76.06530816349763]
We propose an end-to-end trainable table graph reconstruction network (TGRNet) for table structure recognition.
Specifically, the proposed method has two main branches, a cell detection branch and a cell logical location branch, to jointly predict the spatial location and the logical location of different cells.
arXiv Detail & Related papers (2021-06-20T01:57:05Z) - Multi-Type-TD-TSR -- Extracting Tables from Document Images using a
Multi-stage Pipeline for Table Detection and Table Structure Recognition:
from OCR to Structured Table Representations [63.98463053292982]
The recognition of tables consists of two main tasks, namely table detection and table structure recognition.
Recent work shows a clear trend towards deep learning approaches coupled with the use of transfer learning for the task of table structure recognition.
We present a multistage pipeline named Multi-Type-TD-TSR, which offers an end-to-end solution for the problem of table recognition.
arXiv Detail & Related papers (2021-05-23T21:17:18Z) - A Graph Representation of Semi-structured Data for Web Question
Answering [96.46484690047491]
We propose a novel graph representation of Web tables and lists based on a systematic categorization of the components in semi-structured data as well as their relations.
Our method improves F1 score by 3.90 points over the state-of-the-art baselines.
arXiv Detail & Related papers (2020-10-14T04:01:54Z) - Identifying Table Structure in Documents using Conditional Generative
Adversarial Networks [0.0]
In many industries and in academic research, information is primarily transmitted in the form of unstructured documents.
We propose a top-down approach, first using a conditional generative adversarial network to map a table image into a standardised skeleton' table form.
We then deriving latent table structure using xy-cut projection and Genetic Algorithm optimisation.
arXiv Detail & Related papers (2020-01-13T20:42:40Z) - Table Structure Extraction with Bi-directional Gated Recurrent Unit
Networks [5.350788087718877]
This paper proposes a robust deep learning based approach to extract rows and columns from a detected table in document images with a high precision.
We have benchmarked our system on publicly available UNLV as well as ICDAR 2013 datasets on which it outperformed the state-of-the-art table structure extraction systems by a significant margin.
arXiv Detail & Related papers (2020-01-08T13:17:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.