TabEAno: Table to Knowledge Graph Entity Annotation
- URL: http://arxiv.org/abs/2010.01829v1
- Date: Mon, 5 Oct 2020 07:39:02 GMT
- Title: TabEAno: Table to Knowledge Graph Entity Annotation
- Authors: Phuc Nguyen and Natthawut Kertkeidkachorn and Ryutaro Ichise and
Hideaki Takeda
- Abstract summary: We propose a novel approach, namely TabEAno, to semantically annotate table rows toward knowledge graph entities.
We introduce a "two-cells" lookup strategy bases on the assumption that there is an existing logical relation occurring in the knowledge graph between the two closed cells in the same row of the table.
Despite the simplicity of the approach, TabEAno outperforms the state of the art approaches in the two standard datasets.
- Score: 7.451544182579802
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In the Open Data era, a large number of table resources have been made
available on the Web and data portals. However, it is difficult to directly
utilize such data due to the ambiguity of entities, name variations,
heterogeneous schema, missing, or incomplete metadata. To address these issues,
we propose a novel approach, namely TabEAno, to semantically annotate table
rows toward knowledge graph entities. Specifically, we introduce a "two-cells"
lookup strategy bases on the assumption that there is an existing logical
relation occurring in the knowledge graph between the two closed cells in the
same row of the table. Despite the simplicity of the approach, TabEAno
outperforms the state of the art approaches in the two standard datasets e.g,
T2D, Limaye with, and in the large-scale Wikipedia tables dataset.
Related papers
- TRACE: Table Reconstruction Aligned to Corner and Edges [7.536220920052911]
We analyze the natural characteristics of a table, where a table is composed of cells and each cell is made up of borders consisting of edges.
We propose a novel method to reconstruct the table in a bottom-up manner.
A simple design makes the model easier to train and requires less computation than previous two-stage methods.
arXiv Detail & Related papers (2023-05-01T02:26:15Z) - SEMv2: Table Separation Line Detection Based on Instance Segmentation [96.36188168694781]
We propose an accurate table structure recognizer, termed SEMv2 (SEM: Split, Embed and Merge)
We address the table separation line instance-level discrimination problem and introduce a table separation line detection strategy based on conditional convolution.
To comprehensively evaluate the SEMv2, we also present a more challenging dataset for table structure recognition, dubbed iFLYTAB.
arXiv Detail & Related papers (2023-03-08T05:15:01Z) - Tab2KG: Semantic Table Interpretation with Lightweight Semantic Profiles [3.655021726150368]
This article proposes Tab2KG - a novel method that targets at the semantic interpretation of tables with previously unseen data.
We introduce original semantic profiles that enrich a domain's concepts and relations and represent domain and table characteristics.
In contrast to the existing semantic table interpretation approaches, Tab2KG relies on the semantic profiles only and does not require any instance lookup.
arXiv Detail & Related papers (2023-02-02T15:12:30Z) - TRUST: An Accurate and End-to-End Table structure Recognizer Using
Splitting-based Transformers [56.56591337457137]
We propose an accurate and end-to-end transformer-based table structure recognition method, referred to as TRUST.
Transformers are suitable for table structure recognition because of their global computations, perfect memory, and parallel computation.
We conduct experiments on several popular benchmarks including PubTabNet and SynthTable, our method achieves new state-of-the-art results.
arXiv Detail & Related papers (2022-08-31T08:33:36Z) - TGRNet: A Table Graph Reconstruction Network for Table Structure
Recognition [76.06530816349763]
We propose an end-to-end trainable table graph reconstruction network (TGRNet) for table structure recognition.
Specifically, the proposed method has two main branches, a cell detection branch and a cell logical location branch, to jointly predict the spatial location and the logical location of different cells.
arXiv Detail & Related papers (2021-06-20T01:57:05Z) - Retrieving Complex Tables with Multi-Granular Graph Representation
Learning [20.72341939868327]
The task of natural language table retrieval seeks to retrieve semantically relevant tables based on natural language queries.
Existing learning systems treat tables as plain text based on the assumption that tables are structured as dataframes.
We propose Graph-based Table Retrieval (GTR), a generalizable NLTR framework with multi-granular graph representation learning.
arXiv Detail & Related papers (2021-05-04T20:19:03Z) - TCN: Table Convolutional Network for Web Table Interpretation [52.32515851633981]
We propose a novel table representation learning approach considering both the intra- and inter-table contextual information.
Our method can outperform competitive baselines by +4.8% of F1 for column type prediction and by +4.1% of F1 for column pairwise relation prediction.
arXiv Detail & Related papers (2021-02-17T02:18:10Z) - A Graph Representation of Semi-structured Data for Web Question
Answering [96.46484690047491]
We propose a novel graph representation of Web tables and lists based on a systematic categorization of the components in semi-structured data as well as their relations.
Our method improves F1 score by 3.90 points over the state-of-the-art baselines.
arXiv Detail & Related papers (2020-10-14T04:01:54Z) - DART: Open-Domain Structured Data Record to Text Generation [91.23798751437835]
We present DART, an open domain structured DAta Record to Text generation dataset with over 82k instances (DARTs)
We propose a procedure of extracting semantic triples from tables that encode their structures by exploiting the semantic dependencies among table headers and the table title.
Our dataset construction framework effectively merged heterogeneous sources from open domain semantic parsing and dialogue-act-based meaning representation tasks.
arXiv Detail & Related papers (2020-07-06T16:35:30Z) - Identifying Table Structure in Documents using Conditional Generative
Adversarial Networks [0.0]
In many industries and in academic research, information is primarily transmitted in the form of unstructured documents.
We propose a top-down approach, first using a conditional generative adversarial network to map a table image into a standardised skeleton' table form.
We then deriving latent table structure using xy-cut projection and Genetic Algorithm optimisation.
arXiv Detail & Related papers (2020-01-13T20:42:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.