Efficient Information Sharing in ICT Supply Chain Social Network via
Table Structure Recognition
- URL: http://arxiv.org/abs/2211.02128v1
- Date: Thu, 3 Nov 2022 20:03:07 GMT
- Title: Efficient Information Sharing in ICT Supply Chain Social Network via
Table Structure Recognition
- Authors: Bin Xiao, Yakup Akkaya, Murat Simsek, Burak Kantarci, Ala Abu Alkheir
- Abstract summary: Table Structure Recognition (TSR) aims to represent tables with complex structures in a machine-interpretable format.
We implement our proposed method based on Faster-RCNN and achieve 94.79% on mean Average Precision (AP)
- Score: 12.79419287446918
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: The global Information and Communications Technology (ICT) supply chain is a
complex network consisting of all types of participants. It is often formulated
as a Social Network to discuss the supply chain network's relations,
properties, and development in supply chain management. Information sharing
plays a crucial role in improving the efficiency of the supply chain, and
datasheets are the most common data format to describe e-component commodities
in the ICT supply chain because of human readability. However, with the surging
number of electronic documents, it has been far beyond the capacity of human
readers, and it is also challenging to process tabular data automatically
because of the complex table structures and heterogeneous layouts. Table
Structure Recognition (TSR) aims to represent tables with complex structures in
a machine-interpretable format so that the tabular data can be processed
automatically. In this paper, we formulate TSR as an object detection problem
and propose to generate an intuitive representation of a complex table
structure to enable structuring of the tabular data related to the commodities.
To cope with border-less and small layouts, we propose a cost-sensitive loss
function by considering the detection difficulty of each class. Besides, we
propose a novel anchor generation method using the character of tables that
columns in a table should share an identical height, and rows in a table should
share the same width. We implement our proposed method based on Faster-RCNN and
achieve 94.79% on mean Average Precision (AP), and consistently improve more
than 1.5% AP for different benchmark models.
Related papers
- A Survey on Deep Tabular Learning [0.0]
Tabular data presents unique challenges for deep learning due to its heterogeneous nature and lack of spatial structure.
This survey reviews the evolution of deep learning models for Tabular data, from early fully connected networks (FCNs) to advanced architectures like TabNet, SAINT, TabTranSELU, and MambaNet.
arXiv Detail & Related papers (2024-10-15T20:08:08Z) - TC-OCR: TableCraft OCR for Efficient Detection & Recognition of Table Structure & Content [39.34067105360439]
We propose an end-to-end pipeline that integrates deep learning models, including DETR, CascadeTabNet, and PP OCR v2, to achieve comprehensive image-based table recognition.
Our system achieves simultaneous table detection (TD), table structure recognition (TSR), and table content recognition (TCR)
Our proposed approach achieves an IOU of 0.96 and an OCR Accuracy of 78%, showcasing a remarkable improvement of approximately 25% in the OCR Accuracy compared to the previous Table Transformer approach.
arXiv Detail & Related papers (2024-04-16T06:24:53Z) - Zero-shot Composed Text-Image Retrieval [72.43790281036584]
We consider the problem of composed image retrieval (CIR)
It aims to train a model that can fuse multi-modal information, e.g., text and images, to accurately retrieve images that match the query, extending the user's expression ability.
arXiv Detail & Related papers (2023-06-12T17:56:01Z) - TRUST: An Accurate and End-to-End Table structure Recognizer Using
Splitting-based Transformers [56.56591337457137]
We propose an accurate and end-to-end transformer-based table structure recognition method, referred to as TRUST.
Transformers are suitable for table structure recognition because of their global computations, perfect memory, and parallel computation.
We conduct experiments on several popular benchmarks including PubTabNet and SynthTable, our method achieves new state-of-the-art results.
arXiv Detail & Related papers (2022-08-31T08:33:36Z) - Handling big tabular data of ICT supply chains: a multi-task,
machine-interpretable approach [13.976736586808308]
We define a Table Structure Recognition (TSR) task and a Table Cell Type Classification (CTC) task.
Our proposed method can outperform state-of-the-art methods on ICDAR2013 and UNLV datasets.
arXiv Detail & Related papers (2022-08-11T20:29:45Z) - Split, embed and merge: An accurate table structure recognizer [42.579215135672094]
We introduce Split, Embed and Merge (SEM) as an accurate table structure recognizer.
SEM can achieve an average F-Measure of $96.9%$ on the SciTSR dataset.
arXiv Detail & Related papers (2021-07-12T06:26:19Z) - TGRNet: A Table Graph Reconstruction Network for Table Structure
Recognition [76.06530816349763]
We propose an end-to-end trainable table graph reconstruction network (TGRNet) for table structure recognition.
Specifically, the proposed method has two main branches, a cell detection branch and a cell logical location branch, to jointly predict the spatial location and the logical location of different cells.
arXiv Detail & Related papers (2021-06-20T01:57:05Z) - Multi-Type-TD-TSR -- Extracting Tables from Document Images using a
Multi-stage Pipeline for Table Detection and Table Structure Recognition:
from OCR to Structured Table Representations [63.98463053292982]
The recognition of tables consists of two main tasks, namely table detection and table structure recognition.
Recent work shows a clear trend towards deep learning approaches coupled with the use of transfer learning for the task of table structure recognition.
We present a multistage pipeline named Multi-Type-TD-TSR, which offers an end-to-end solution for the problem of table recognition.
arXiv Detail & Related papers (2021-05-23T21:17:18Z) - TCN: Table Convolutional Network for Web Table Interpretation [52.32515851633981]
We propose a novel table representation learning approach considering both the intra- and inter-table contextual information.
Our method can outperform competitive baselines by +4.8% of F1 for column type prediction and by +4.1% of F1 for column pairwise relation prediction.
arXiv Detail & Related papers (2021-02-17T02:18:10Z) - A Graph Representation of Semi-structured Data for Web Question
Answering [96.46484690047491]
We propose a novel graph representation of Web tables and lists based on a systematic categorization of the components in semi-structured data as well as their relations.
Our method improves F1 score by 3.90 points over the state-of-the-art baselines.
arXiv Detail & Related papers (2020-10-14T04:01:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.