Page Segmentation using Visual Adjacency Analysis
- URL: http://arxiv.org/abs/2112.11975v1
- Date: Sat, 11 Dec 2021 00:20:30 GMT
- Title: Page Segmentation using Visual Adjacency Analysis
- Authors: Mohammad Bajammal, Ali Mesbah
- Abstract summary: We propose a novel page segmentation approach based on visual analysis of localized adjacency regions.
It combines DOM attributes and visual analysis to build features of a given page and guide an unsupervised clustering.
We evaluate our approach on 35 real-world web pages, and examine the effectiveness and efficiency of segmentation.
- Score: 5.9521013526545925
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Page segmentation is a web page analysis process that divides a page into
cohesive segments, such as sidebars, headers, and footers. Current page
segmentation approaches use either the DOM, textual content, or rendering style
information of the page. However, these approaches have a number of drawbacks,
such as a large number of parameters and rigid assumptions about the page,
which negatively impact their segmentation accuracy. We propose a novel page
segmentation approach based on visual analysis of localized adjacency regions.
It combines DOM attributes and visual analysis to build features of a given
page and guide an unsupervised clustering. We evaluate our approach on 35
real-world web pages, and examine the effectiveness and efficiency of
segmentation. The results show that, compared with state-of-the-art, our
approach achieves an average of 156% increase in precision and 249% improvement
in F-measure.
Related papers
- Visual Prompt Selection for In-Context Learning Segmentation [77.15684360470152]
In this paper, we focus on rethinking and improving the example selection strategy.
We first demonstrate that ICL-based segmentation models are sensitive to different contexts.
Furthermore, empirical evidence indicates that the diversity of contextual prompts plays a crucial role in guiding segmentation.
arXiv Detail & Related papers (2024-07-14T15:02:54Z) - Language Guided Domain Generalized Medical Image Segmentation [68.93124785575739]
Single source domain generalization holds promise for more reliable and consistent image segmentation across real-world clinical settings.
We propose an approach that explicitly leverages textual information by incorporating a contrastive learning mechanism guided by the text encoder features.
Our approach achieves favorable performance against existing methods in literature.
arXiv Detail & Related papers (2024-04-01T17:48:15Z) - Page Layout Analysis of Text-heavy Historical Documents: a Comparison of
Textual and Visual Approaches [0.0]
Page layout analysis is a fundamental step in document processing which enables to segment a page into regions of interest.
With highly complex layouts and mixed scripts, scholarly annotated are text-heavy documents which remain challenging for state-of-the-art models.
arXiv Detail & Related papers (2022-12-12T10:10:29Z) - Semantic Parsing of Interpage Relations [0.0]
We formalize the task as semantic parsing of interpage relations and we propose an end-to-end approach for interpage dependency extraction.
We also design a multi-task training approach to jointly optimize for page embeddings to be used in segmentation, classification, and parsing of the page dependencies.
Our experimental results show that the proposed method increased LAS by 41 percentage points for semantic parsing, increased accuracy by 33 percentage points for page stream segmentation, and 45 percentage points for page classification over a naive baseline.
arXiv Detail & Related papers (2022-05-26T17:50:43Z) - Unsupervised Deep Learning for Handwritten Page Segmentation [0.0]
We present an unsupervised deep learning method for page segmentation.
A siamese neural network is trained to differentiate between patches using their measurable properties.
Our experiments show that the proposed unsupervised method is as effective as typical supervised methods.
arXiv Detail & Related papers (2021-01-19T07:13:38Z) - Rethinking Interactive Image Segmentation: Feature Space Annotation [68.8204255655161]
We propose interactive and simultaneous segment annotation from multiple images guided by feature space projection.
We show that our approach can surpass the accuracy of state-of-the-art methods in foreground segmentation datasets.
arXiv Detail & Related papers (2021-01-12T10:13:35Z) - WikiAsp: A Dataset for Multi-domain Aspect-based Summarization [69.13865812754058]
We propose WikiAsp, a large-scale dataset for multi-domain aspect-based summarization.
Specifically, we build the dataset using Wikipedia articles from 20 different domains, using the section titles and boundaries of each article as a proxy for aspect annotation.
Results highlight key challenges that existing summarization models face in this setting, such as proper pronoun handling of quoted sources and consistent explanation of time-sensitive events.
arXiv Detail & Related papers (2020-11-16T10:02:52Z) - Improving Semantic Segmentation via Decoupled Body and Edge Supervision [89.57847958016981]
Existing semantic segmentation approaches either aim to improve the object's inner consistency by modeling the global context, or refine objects detail along their boundaries by multi-scale feature fusion.
In this paper, a new paradigm for semantic segmentation is proposed.
Our insight is that appealing performance of semantic segmentation requires textitexplicitly modeling the object textitbody and textitedge, which correspond to the high and low frequency of the image.
We show that the proposed framework with various baselines or backbone networks leads to better object inner consistency and object boundaries.
arXiv Detail & Related papers (2020-07-20T12:11:22Z) - Webpage Segmentation for Extracting Images and Their Surrounding
Contextual Information [0.0]
We propose a webpage segmentation algorithm targeting the extraction of web images and their contextual information based on their characteristics as they appear on webpages.
We conducted a user study to obtain a human-labeled dataset to validate the effectiveness of our method and experiments demonstrated that our method can achieve better results than an existing segmentation algorithm.
arXiv Detail & Related papers (2020-05-18T19:00:03Z) - An Evaluation of DNN Architectures for Page Segmentation of Historical
Newspapers [0.0]
We evaluate 11 different published Deep Neural Networks backbone architectures and 9 different tiling and scaling configurations for separating text, tables or table column lines.
We show the influence of the number of labels and the number of training pages on the segmentation quality, which we measure using the Matthews Correlation Coefficient.
Our results show that (depending on the task) Inception-ResNet-v2 and EfficientNet backbones work best, vertical tiling is generally preferable to other tiling approaches.
arXiv Detail & Related papers (2020-04-15T20:05:54Z) - SceneEncoder: Scene-Aware Semantic Segmentation of Point Clouds with A
Learnable Scene Descriptor [51.298760338410624]
We propose a SceneEncoder module to impose a scene-aware guidance to enhance the effect of global information.
The module predicts a scene descriptor, which learns to represent the categories of objects existing in the scene.
We also design a region similarity loss to propagate distinguishing features to their own neighboring points with the same label.
arXiv Detail & Related papers (2020-01-24T16:53:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.