Web Table Classification based on Visual Features
- URL: http://arxiv.org/abs/2103.05110v1
- Date: Thu, 25 Feb 2021 07:39:19 GMT
- Title: Web Table Classification based on Visual Features
- Authors: Babette B\"uhler and Heiko Paulheim
- Abstract summary: We propose an approach for web table classification by exploiting the full visual appearance of a table.
The evaluation of CNN image classification with fine tuned ResNet50 shows that this approach achieves results comparable to previous solutions.
- Score: 1.52292571922932
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Tables on the web constitute a valuable data source for many applications,
like factual search and knowledge base augmentation. However, as genuine tables
containing relational knowledge only account for a small proportion of tables
on the web, reliable genuine web table classification is a crucial first step
of table extraction. Previous works usually rely on explicit feature
construction from the HTML code. In contrast, we propose an approach for web
table classification by exploiting the full visual appearance of a table, which
works purely by applying a convolutional neural network on the rendered image
of the web table. Since these visual features can be extracted automatically,
our approach circumvents the need for explicit feature construction. A new hand
labeled gold standard dataset containing HTML source code and images for 13,112
tables was generated for this task. Transfer learning techniques are applied to
well known VGG16 and ResNet50 architectures. The evaluation of CNN image
classification with fine tuned ResNet50 (F1 93.29%) shows that this approach
achieves results comparable to previous solutions using explicitly defined HTML
code based features. By combining visual and explicit features, an F-measure of
93.70% can be achieved by Random Forest classification, which beats current
state of the art methods.
Related papers
- UniTabNet: Bridging Vision and Language Models for Enhanced Table Structure Recognition [55.153629718464565]
We introduce UniTabNet, a novel framework for table structure parsing based on the image-to-text model.
UniTabNet employs a divide-and-conquer'' strategy, utilizing an image-to-text model to decouple table cells and integrating both physical and logical decoders to reconstruct the complete table structure.
arXiv Detail & Related papers (2024-09-20T01:26:32Z) - TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy [81.76462101465354]
We present a novel large vision-hugging model, TabPedia, equipped with a concept synergy mechanism.
This unified framework allows TabPedia to seamlessly integrate VTU tasks, such as table detection, table structure recognition, table querying, and table question answering.
To better evaluate the VTU task in real-world scenarios, we establish a new and comprehensive table VQA benchmark, ComTQA.
arXiv Detail & Related papers (2024-06-03T13:54:05Z) - Rethinking Image-based Table Recognition Using Weakly Supervised Methods [3.9993134366218857]
We propose a weakly supervised model named WSTabNet for table recognition that relies only on HTML (or) code-level annotations of table images.
To facilitate table recognition with deep learning, we create and release WikiTableSet, the largest publicly available table image-based dataset built from Wikipedia.
arXiv Detail & Related papers (2023-03-14T06:03:57Z) - Data augmentation on graphs for table type classification [1.1859913430860336]
We address the classification of tables using a Graph Neural Network, exploiting the table structure for the message passing algorithm in use.
We achieve promising preliminary results, proposing a data augmentation method suitable for graph-based table representation.
arXiv Detail & Related papers (2022-08-23T21:54:46Z) - Generating Table Vector Representations [11.092714216647245]
This paper is an evaluation of methods for table-to-class annotation.
We provide a formal definition for table classification as a machine learning task.
arXiv Detail & Related papers (2021-10-28T14:05:21Z) - TGRNet: A Table Graph Reconstruction Network for Table Structure
Recognition [76.06530816349763]
We propose an end-to-end trainable table graph reconstruction network (TGRNet) for table structure recognition.
Specifically, the proposed method has two main branches, a cell detection branch and a cell logical location branch, to jointly predict the spatial location and the logical location of different cells.
arXiv Detail & Related papers (2021-06-20T01:57:05Z) - Multi-Type-TD-TSR -- Extracting Tables from Document Images using a
Multi-stage Pipeline for Table Detection and Table Structure Recognition:
from OCR to Structured Table Representations [63.98463053292982]
The recognition of tables consists of two main tasks, namely table detection and table structure recognition.
Recent work shows a clear trend towards deep learning approaches coupled with the use of transfer learning for the task of table structure recognition.
We present a multistage pipeline named Multi-Type-TD-TSR, which offers an end-to-end solution for the problem of table recognition.
arXiv Detail & Related papers (2021-05-23T21:17:18Z) - TCN: Table Convolutional Network for Web Table Interpretation [52.32515851633981]
We propose a novel table representation learning approach considering both the intra- and inter-table contextual information.
Our method can outperform competitive baselines by +4.8% of F1 for column type prediction and by +4.1% of F1 for column pairwise relation prediction.
arXiv Detail & Related papers (2021-02-17T02:18:10Z) - A Graph Representation of Semi-structured Data for Web Question
Answering [96.46484690047491]
We propose a novel graph representation of Web tables and lists based on a systematic categorization of the components in semi-structured data as well as their relations.
Our method improves F1 score by 3.90 points over the state-of-the-art baselines.
arXiv Detail & Related papers (2020-10-14T04:01:54Z) - TURL: Table Understanding through Representation Learning [29.6016859927782]
TURL is a novel framework that introduces the pre-training/finetuning paradigm to relational Web tables.
During pre-training, our framework learns deep contextualized representations on relational tables in an unsupervised manner.
We show that TURL generalizes well to all tasks and substantially outperforms existing methods in almost all instances.
arXiv Detail & Related papers (2020-06-26T05:44:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.