CoVA: Context-aware Visual Attention for Webpage Information Extraction
- URL: http://arxiv.org/abs/2110.12320v1
- Date: Sun, 24 Oct 2021 00:21:46 GMT
- Title: CoVA: Context-aware Visual Attention for Webpage Information Extraction
- Authors: Anurendra Kumar, Keval Morabia, Jingjin Wang, Kevin Chen-Chuan Chang,
Alexander Schwing
- Abstract summary: We propose to reformulate WIE as a context-aware Webpage Object Detection task.
We develop a Context-aware Visual Attention-based (CoVA) detection pipeline which combines appearance features with syntactical structure from the DOM tree.
We show that the proposed CoVA approach is a new challenging baseline which improves upon prior state-of-the-art methods.
- Score: 65.11609398029783
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Webpage information extraction (WIE) is an important step to create knowledge
bases. For this, classical WIE methods leverage the Document Object Model (DOM)
tree of a website. However, use of the DOM tree poses significant challenges as
context and appearance are encoded in an abstract manner. To address this
challenge we propose to reformulate WIE as a context-aware Webpage Object
Detection task. Specifically, we develop a Context-aware Visual Attention-based
(CoVA) detection pipeline which combines appearance features with syntactical
structure from the DOM tree. To study the approach we collect a new large-scale
dataset of e-commerce websites for which we manually annotate every web element
with four labels: product price, product title, product image and background.
On this dataset we show that the proposed CoVA approach is a new challenging
baseline which improves upon prior state-of-the-art methods.
Related papers
- Augmented Commonsense Knowledge for Remote Object Grounding [67.30864498454805]
We propose an augmented commonsense knowledge model (ACK) to leverage commonsense information as atemporal knowledge graph for improving agent navigation.
ACK consists of knowledge graph-aware cross-modal and concept aggregation modules to enhance visual representation and visual-textual data alignment.
We add a new pipeline for the commonsense-based decision-making process which leads to more accurate local action prediction.
arXiv Detail & Related papers (2024-06-03T12:12:33Z) - Open-Vocabulary Camouflaged Object Segmentation [66.94945066779988]
We introduce a new task, open-vocabulary camouflaged object segmentation (OVCOS)
We construct a large-scale complex scene dataset (textbfOVCamo) containing 11,483 hand-selected images with fine annotations and corresponding object classes.
By integrating the guidance of class semantic knowledge and the supplement of visual structure cues from the edge and depth information, the proposed method can efficiently capture camouflaged objects.
arXiv Detail & Related papers (2023-11-19T06:00:39Z) - WebFormer: The Web-page Transformer for Structure Information Extraction [44.46531405460861]
Structure information extraction refers to the task of extracting structured text fields from web pages.
Recent natural language models with sequence modeling have demonstrated state-of-the-art performance on web information extraction.
We introduce WebFormer, a Web-page transFormer model for structure information extraction from web documents.
arXiv Detail & Related papers (2022-02-01T04:44:02Z) - Learning Context-Aware Representations of Subtrees [0.0]
This thesis tackles the problem of learning efficient representations of complex, structured data with a natural application to web page and element classification.
We hypothesise that the context around the element inside the web page is of high value to the problem and is currently under exploited.
This thesis aims to solve the problem of classifying web elements as subtrees of a DOM tree by also considering their context.
arXiv Detail & Related papers (2021-11-08T07:43:14Z) - Knowledge Graph Anchored Information-Extraction for Domain-Specific
Insights [1.6308268213252761]
We use a task-based approach for fulfilling specific information needs within a new domain.
A pipeline constructed of state of the art NLP technologies is used to automatically extract an instance level semantic structure.
arXiv Detail & Related papers (2021-04-18T19:28:10Z) - Simplified DOM Trees for Transferable Attribute Extraction from the Web [15.728164692696689]
Given a web page, extracting a structured object along with various attributes of interest can facilitate a variety of downstream applications.
Existing approaches formulate the problem as a DOM tree node tagging task.
We propose a novel transferable method, SimpDOM, to tackle the problem by efficiently retrieving useful context for each node.
arXiv Detail & Related papers (2021-01-07T07:41:55Z) - ZeroShotCeres: Zero-Shot Relation Extraction from Semi-Structured
Webpages [66.45377533562417]
We propose a solution for "zero-shot" open-domain relation extraction from webpages with a previously unseen template.
Our model uses a graph neural network-based approach to build a rich representation of text fields on a webpage.
arXiv Detail & Related papers (2020-05-14T16:15:58Z) - SciREX: A Challenge Dataset for Document-Level Information Extraction [56.83748634747753]
It is challenging to create a large-scale information extraction dataset at the document level.
We introduce SciREX, a document level IE dataset that encompasses multiple IE tasks.
We develop a neural model as a strong baseline that extends previous state-of-the-art IE models to document-level IE.
arXiv Detail & Related papers (2020-05-01T17:30:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.