Tabular Structure Detection from Document Images for Resource
Constrained Devices Using A Row Based Similarity Measure
- URL: http://arxiv.org/abs/2008.11842v1
- Date: Wed, 26 Aug 2020 21:59:27 GMT
- Title: Tabular Structure Detection from Document Images for Resource
Constrained Devices Using A Row Based Similarity Measure
- Authors: Soumyadeep Dey, Jayanta Mukhopadhyay, Shamik Sural
- Abstract summary: Tabular structures are used to present crucial information in a structured and crisp manner.
Most of the existing techniques detect tables from a document image by using prior knowledge of the structures of the tables.
- Score: 0.9814898713780167
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Tabular structures are used to present crucial information in a structured
and crisp manner. Detection of such regions is of great importance for proper
understanding of a document. Tabular structures can be of various layouts and
types. Therefore, detection of these regions is a hard problem. Most of the
existing techniques detect tables from a document image by using prior
knowledge of the structures of the tables. However, these methods are not
applicable for generalized tabular structures. In this work, we propose a
similarity measure to find similarities between pairs of rows in a tabular
structure. This similarity measure is utilized to identify a tabular region.
Since the tabular regions are detected exploiting the similarities among all
rows, the method is inherently independent of layouts of the tabular regions
present in the training data. Moreover, the proposed similarity measure can be
used to identify tabular regions without using large sets of parameters
associated with recent deep learning based methods. Thus, the proposed method
can easily be used with resource constrained devices such as mobile devices
without much of an overhead.
Related papers
- A Closer Look at Deep Learning on Tabular Data [52.50778536274327]
Tabular data is prevalent across various domains in machine learning.
Deep Neural Network (DNN)-based methods have shown promising performance comparable to tree-based ones.
arXiv Detail & Related papers (2024-07-01T04:24:07Z) - SEMv2: Table Separation Line Detection Based on Instance Segmentation [96.36188168694781]
We propose an accurate table structure recognizer, termed SEMv2 (SEM: Split, Embed and Merge)
We address the table separation line instance-level discrimination problem and introduce a table separation line detection strategy based on conditional convolution.
To comprehensively evaluate the SEMv2, we also present a more challenging dataset for table structure recognition, dubbed iFLYTAB.
arXiv Detail & Related papers (2023-03-08T05:15:01Z) - TRUST: An Accurate and End-to-End Table structure Recognizer Using
Splitting-based Transformers [56.56591337457137]
We propose an accurate and end-to-end transformer-based table structure recognition method, referred to as TRUST.
Transformers are suitable for table structure recognition because of their global computations, perfect memory, and parallel computation.
We conduct experiments on several popular benchmarks including PubTabNet and SynthTable, our method achieves new state-of-the-art results.
arXiv Detail & Related papers (2022-08-31T08:33:36Z) - Evaluating Table Structure Recognition: A New Perspective [2.1067139116005595]
Existing metrics used to evaluate table structure recognition algorithms have shortcomings with regard to capturing text and empty cells alignment.
In this paper, we propose a new metric - TEDS based IOU similarity (TEDS (IOU)) for table structure recognition which uses bounding boxes instead of text while simultaneously being robust against the above disadvantages.
arXiv Detail & Related papers (2022-07-31T07:48:36Z) - Split, embed and merge: An accurate table structure recognizer [42.579215135672094]
We introduce Split, Embed and Merge (SEM) as an accurate table structure recognizer.
SEM can achieve an average F-Measure of $96.9%$ on the SciTSR dataset.
arXiv Detail & Related papers (2021-07-12T06:26:19Z) - TGRNet: A Table Graph Reconstruction Network for Table Structure
Recognition [76.06530816349763]
We propose an end-to-end trainable table graph reconstruction network (TGRNet) for table structure recognition.
Specifically, the proposed method has two main branches, a cell detection branch and a cell logical location branch, to jointly predict the spatial location and the logical location of different cells.
arXiv Detail & Related papers (2021-06-20T01:57:05Z) - Multi-Type-TD-TSR -- Extracting Tables from Document Images using a
Multi-stage Pipeline for Table Detection and Table Structure Recognition:
from OCR to Structured Table Representations [63.98463053292982]
The recognition of tables consists of two main tasks, namely table detection and table structure recognition.
Recent work shows a clear trend towards deep learning approaches coupled with the use of transfer learning for the task of table structure recognition.
We present a multistage pipeline named Multi-Type-TD-TSR, which offers an end-to-end solution for the problem of table recognition.
arXiv Detail & Related papers (2021-05-23T21:17:18Z) - Table Structure Recognition using Top-Down and Bottom-Up Cues [28.65687982486627]
We present an approach for table structure recognition that combines cell detection and interaction modules.
We empirically validate our method on the publicly available real-world datasets.
arXiv Detail & Related papers (2020-10-09T13:32:53Z) - A Comparative Study on Structural and Semantic Properties of Sentence
Embeddings [77.34726150561087]
We propose a set of experiments using a widely-used large-scale data set for relation extraction.
We show that different embedding spaces have different degrees of strength for the structural and semantic properties.
These results provide useful information for developing embedding-based relation extraction methods.
arXiv Detail & Related papers (2020-09-23T15:45:32Z) - Identifying Table Structure in Documents using Conditional Generative
Adversarial Networks [0.0]
In many industries and in academic research, information is primarily transmitted in the form of unstructured documents.
We propose a top-down approach, first using a conditional generative adversarial network to map a table image into a standardised skeleton' table form.
We then deriving latent table structure using xy-cut projection and Genetic Algorithm optimisation.
arXiv Detail & Related papers (2020-01-13T20:42:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.