Plot2Spectra: an Automatic Spectra Extraction Tool
- URL: http://arxiv.org/abs/2107.02827v1
- Date: Tue, 6 Jul 2021 18:17:28 GMT
- Title: Plot2Spectra: an Automatic Spectra Extraction Tool
- Authors: Weixin Jiang, Eric Schwenker, Trevor Spreadbury, Kai Li, Maria K.Y.
Chan, Oliver Cossairt
- Abstract summary: This paper develops a plot digitizer, named Plot2Spectra, to extract data points from spectroscopy graph images in an automatic fashion.
In the first axis alignment stage, we adopt an anchor-free detector to detect the plot region and then refine the detected bounding boxes.
In the second plot data extraction stage, we first employ semantic segmentation to separate pixels belonging to plot lines from the background.
- Score: 10.64947007982639
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Different types of spectroscopies, such as X-ray absorption near edge
structure (XANES) and Raman spectroscopy, play a very important role in
analyzing the characteristics of different materials. In scientific literature,
XANES/Raman data are usually plotted in line graphs which is a visually
appropriate way to represent the information when the end-user is a human
reader. However, such graphs are not conducive to direct programmatic analysis
due to the lack of automatic tools. In this paper, we develop a plot digitizer,
named Plot2Spectra, to extract data points from spectroscopy graph images in an
automatic fashion, which makes it possible for large scale data acquisition and
analysis. Specifically, the plot digitizer is a two-stage framework. In the
first axis alignment stage, we adopt an anchor-free detector to detect the plot
region and then refine the detected bounding boxes with an edge-based
constraint to locate the position of two axes. We also apply scene text
detector to extract and interpret all tick information below the x-axis. In the
second plot data extraction stage, we first employ semantic segmentation to
separate pixels belonging to plot lines from the background, and from there,
incorporate optical flow constraints to the plot line pixels to assign them to
the appropriate line (data instance) they encode. Extensive experiments are
conducted to validate the effectiveness of the proposed plot digitizer, which
shows that such a tool could help accelerate the discovery and machine learning
of materials properties.
Related papers
- GenPlot: Increasing the Scale and Diversity of Chart Derendering Data [0.0]
We propose GenPlot, a plot generator that can generate billions of additional plots for chart-derendering using synthetic data.
OCR-free chart-to-text translation has achieved state-of-the-art results on visual language tasks.
arXiv Detail & Related papers (2023-06-20T17:25:53Z) - Graph Classification Gaussian Processes via Spectral Features [7.474662887810221]
Graph classification aims to categorise graphs based on their structure and node attributes.
In this work, we propose to tackle this task using tools from graph signal processing by deriving spectral features.
We show that even such a simple approach, having no learned parameters, can yield competitive performance compared to strong neural network and graph kernel baselines.
arXiv Detail & Related papers (2023-06-06T15:31:05Z) - Multiscale Analysis for Improving Texture Classification [62.226224120400026]
This paper employs the Gaussian-Laplacian pyramid to treat different spatial frequency bands of a texture separately.
We aggregate features extracted from gray and color texture images using bio-inspired texture descriptors, information-theoretic measures, gray-level co-occurrence matrix features, and Haralick statistical features into a single feature vector.
arXiv Detail & Related papers (2022-04-21T01:32:22Z) - Tracking perovskite crystallization via deep learning-based feature
detection on 2D X-ray scattering data [137.47124933818066]
We propose an automated pipeline for the analysis of X-ray diffraction images based on the Faster R-CNN deep learning architecture.
We demonstrate our method on real-time tracking of organic-inorganic perovskite structure crystallization and test it on two applications.
arXiv Detail & Related papers (2022-02-22T15:39:00Z) - Unsupervised Machine Learning for Exploratory Data Analysis of Exoplanet
Transmission Spectra [68.8204255655161]
We focus on unsupervised techniques for analyzing spectral data from transiting exoplanets.
We show that there is a high degree of correlation in the spectral data, which calls for appropriate low-dimensional representations.
We uncover interesting structures in the principal component basis, namely, well-defined branches corresponding to different chemical regimes.
arXiv Detail & Related papers (2022-01-07T22:26:33Z) - Weakly Supervised Change Detection Using Guided Anisotropic Difusion [97.43170678509478]
We propose original ideas that help us to leverage such datasets in the context of change detection.
First, we propose the guided anisotropic diffusion (GAD) algorithm, which improves semantic segmentation results.
We then show its potential in two weakly-supervised learning strategies tailored for change detection.
arXiv Detail & Related papers (2021-12-31T10:03:47Z) - CHARTER: heatmap-based multi-type chart data extraction [7.838284602257369]
We present a method and a system for end-to-end conversion of document charts into machine readable data format.
Our approach extracts and analyses charts along with their graphical elements and supporting structures.
Our detection system is based on neural networks, trained solely on synthetic data.
arXiv Detail & Related papers (2021-11-28T11:01:21Z) - SOLD2: Self-supervised Occlusion-aware Line Description and Detection [95.8719432775724]
We introduce the first joint detection and description of line segments in a single deep network.
Our method does not require any annotated line labels and can therefore generalize to any dataset.
We evaluate our approach against previous line detection and description methods on several multi-view datasets.
arXiv Detail & Related papers (2021-04-07T19:27:17Z) - Tensor Fields for Data Extraction from Chart Images: Bar Charts and
Scatter Plots [0.0]
Automated chart reading involves data extraction and contextual understanding of the data from chart images.
We identify an appropriate tensor field as the model and propose a methodology for the use of its degenerate point extraction for data extraction from chart images.
Our results show that tensor voting is effective for data extraction from bar charts and scatter plots, and histograms, as a special case of bar charts.
arXiv Detail & Related papers (2020-10-05T20:19:40Z) - Localized Spectral Graph Filter Frames: A Unifying Framework, Survey of
Design Considerations, and Numerical Comparison (Extended Cut) [1.52292571922932]
Representing data residing on a graph as a linear combination of building block signals can enable efficient and insightful visual or statistical analysis of the data.
We survey a particular class of dictionaries called localized spectral graph filter frames.
We emphasize computationally efficient methods that ensure the resulting transforms and their inverses can be applied to data residing on large, sparse graphs.
arXiv Detail & Related papers (2020-06-19T16:49:33Z) - Hypergraph Spectral Analysis and Processing in 3D Point Cloud [80.25162983501308]
3D point clouds have become a fundamental data structure to characterize 3D objects and surroundings.
To process 3D point clouds efficiently, a suitable model for the underlying structure and outlier noises is always critical.
We propose a hypergraph-based new point cloud model that is amenable to efficient analysis and processing.
arXiv Detail & Related papers (2020-01-08T05:30:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.