An extensible point-based method for data chart value detection
- URL: http://arxiv.org/abs/2308.11788v1
- Date: Tue, 22 Aug 2023 21:03:58 GMT
- Title: An extensible point-based method for data chart value detection
- Authors: Carlos Soto, Shinjae Yoo
- Abstract summary: We present a method for identifying semantic points to reverse engineer data charts.
Our method uses a point proposal network to directly predict the position of points of interest in a chart.
We focus on complex bar charts in the scientific literature, on which our model is able to detect salient points with an accuracy of 0.8705 F1.
- Score: 7.9137747195666455
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We present an extensible method for identifying semantic points to reverse
engineer (i.e. extract the values of) data charts, particularly those in
scientific articles. Our method uses a point proposal network (akin to region
proposal networks for object detection) to directly predict the position of
points of interest in a chart, and it is readily extensible to multiple chart
types and chart elements. We focus on complex bar charts in the scientific
literature, on which our model is able to detect salient points with an
accuracy of 0.8705 F1 (@1.5-cell max deviation); it achieves 0.9810 F1 on
synthetically-generated charts similar to those used in prior works. We also
explore training exclusively on synthetic data with novel augmentations,
reaching surprisingly competent performance in this way (0.6621 F1) on real
charts with widely varying appearance, and we further demonstrate our unchanged
method applied directly to synthetic pie charts (0.8343 F1). Datasets, trained
models, and evaluation code are available at
https://github.com/BNLNLP/PPN_model.
Related papers
- ChartEye: A Deep Learning Framework for Chart Information Extraction [2.4936576553283287]
In this study, we propose a deep learning-based framework that provides a solution for key steps in the chart information extraction pipeline.
The proposed framework utilizes hierarchal vision transformers for the tasks of chart-type and text-role classification, while YOLOv7 for text detection.
Our proposed framework achieves excellent performance at every stage with F1-scores of 0.97 for chart-type classification, 0.91 for text-role classification, and a mean Average Precision of 0.95 for text detection.
arXiv Detail & Related papers (2024-08-28T20:22:39Z) - Deep Generative Models for Subgraph Prediction [10.56335881963895]
This paper introduces subgraph queries as a new task for deep graph learning.
Subgraph queries jointly predict the components of a target subgraph based on evidence that is represented by an observed subgraph.
We utilize a probabilistic deep Graph Generative Model to answer subgraph queries.
arXiv Detail & Related papers (2024-08-07T19:24:02Z) - ChartDETR: A Multi-shape Detection Network for Visual Chart Recognition [33.89209291115389]
Current methods rely on keypoint detection to estimate data element shapes in charts but suffer from grouping errors in post-processing.
We propose ChartDETR, a transformer-based multi-shape detector that localizes keypoints at the corners of regular shapes to reconstruct multiple data elements in a single chart image.
Our method predicts all data element shapes at once by introducing query groups in set prediction, eliminating the need for further postprocessing.
arXiv Detail & Related papers (2023-08-15T12:50:06Z) - CorrMatch: Label Propagation via Correlation Matching for
Semi-Supervised Semantic Segmentation [73.89509052503222]
This paper presents a simple but performant semi-supervised semantic segmentation approach, called CorrMatch.
We observe that the correlation maps not only enable clustering pixels of the same category easily but also contain good shape information.
We propose to conduct pixel propagation by modeling the pairwise similarities of pixels to spread the high-confidence pixels and dig out more.
Then, we perform region propagation to enhance the pseudo labels with accurate class-agnostic masks extracted from the correlation maps.
arXiv Detail & Related papers (2023-06-07T10:02:29Z) - ChartReader: A Unified Framework for Chart Derendering and Comprehension
without Heuristic Rules [89.75395046894809]
We present ChartReader, a unified framework that seamlessly integrates chart derendering and comprehension tasks.
Our approach includes a transformer-based chart component detection module and an extended pre-trained vision-language model for chart-to-X tasks.
Our proposed framework can significantly reduce the manual effort involved in chart analysis, providing a step towards a universal chart understanding model.
arXiv Detail & Related papers (2023-04-05T00:25:27Z) - Learnable Graph Matching: A Practical Paradigm for Data Association [74.28753343714858]
We propose a general learnable graph matching method to address these issues.
Our method achieves state-of-the-art performance on several MOT datasets.
For image matching, our method outperforms state-of-the-art methods on a popular indoor dataset, ScanNet.
arXiv Detail & Related papers (2023-03-27T17:39:00Z) - VizExtract: Automatic Relation Extraction from Data Visualizations [7.2241069295727955]
This paper presents a framework for automatically extracting compared variables from statistical charts.
We leverage a computer vision based framework to automatically identify and localize visualization facets in line graphs, scatter plots, or bar graphs.
In controlled experiments, our framework is able to classify, with 87.5% accuracy, the correlation between variables for graphs with 1-3 series per graph, varying colors, and solid line styles.
arXiv Detail & Related papers (2021-12-07T04:27:08Z) - Learnable Graph Matching: Incorporating Graph Partitioning with Deep
Feature Learning for Multiple Object Tracking [58.30147362745852]
Data association across frames is at the core of Multiple Object Tracking (MOT) task.
Existing methods mostly ignore the context information among tracklets and intra-frame detections.
We propose a novel learnable graph matching method to address these issues.
arXiv Detail & Related papers (2021-03-30T08:58:45Z) - Robust Optimization as Data Augmentation for Large-scale Graphs [117.2376815614148]
We propose FLAG (Free Large-scale Adversarial Augmentation on Graphs), which iteratively augments node features with gradient-based adversarial perturbations during training.
FLAG is a general-purpose approach for graph data, which universally works in node classification, link prediction, and graph classification tasks.
arXiv Detail & Related papers (2020-10-19T21:51:47Z) - Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement [54.29252286561449]
We propose a two-stage graph-based and model-agnostic framework, called Graph-PCNN.
In the first stage, heatmap regression network is applied to obtain a rough localization result, and a set of proposal keypoints, called guided points, are sampled.
In the second stage, for each guided point, different visual feature is extracted by the localization.
The relationship between guided points is explored by the graph pose refinement module to get more accurate localization results.
arXiv Detail & Related papers (2020-07-21T04:59:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.