Providing Meaningful Data Summarizations Using Examplar-based Clustering
in Industry 4.0
- URL: http://arxiv.org/abs/2105.12026v1
- Date: Tue, 25 May 2021 15:55:14 GMT
- Title: Providing Meaningful Data Summarizations Using Examplar-based Clustering
in Industry 4.0
- Authors: Philipp-Jan Honysz and Alexander Schulze-Struchtrup and Sebastian
Buschj\"ager and Katharina Morik
- Abstract summary: We show, that our GPU implementation provides speedups of up to 72x using single-precision and up to 452x using half-precision compared to conventional CPU algorithms.
We apply our algorithm to real-world data from injection molding manufacturing processes and discuss how found summaries help with steering this specific process to cut costs and reduce the manufacturing of bad parts.
- Score: 67.80123919697971
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Data summarizations are a valuable tool to derive knowledge from large data
streams and have proven their usefulness in a great number of applications.
Summaries can be found by optimizing submodular functions. These functions map
subsets of data to real values, which indicate their "representativeness" and
which should be maximized to find a diverse summary of the underlying data. In
this paper, we studied Exemplar-based clustering as a submodular function and
provide a GPU algorithm to cope with its high computational complexity. We
show, that our GPU implementation provides speedups of up to 72x using
single-precision and up to 452x using half-precision computation compared to
conventional CPU algorithms. We also show, that the GPU algorithm not only
provides remarkable runtime benefits with workstation-grade GPUs but also with
low-power embedded computation units for which speedups of up to 35x are
possible. Furthermore, we apply our algorithm to real-world data from injection
molding manufacturing processes and discuss how found summaries help with
steering this specific process to cut costs and reduce the manufacturing of bad
parts. Beyond pure speedup considerations, we show, that our approach can
provide summaries within reasonable time frames for this kind of industrial,
real-world data.
Related papers
- Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation [4.02235104503587]
In this paper, we harness the information from the three-dimensional representation to proficiently capture local features.
A GPU-based KDTree allows for rapid building, querying, and enhancing projection with straightforward operations.
We show that a reduced version of our model not only demonstrates strong competitiveness against full-scale state-of-the-art models but also operates in real-time.
arXiv Detail & Related papers (2024-10-14T13:49:05Z) - Randomized Polar Codes for Anytime Distributed Machine Learning [66.46612460837147]
We present a novel distributed computing framework that is robust to slow compute nodes, and is capable of both approximate and exact computation of linear operations.
We propose a sequential decoding algorithm designed to handle real valued data while maintaining low computational complexity for recovery.
We demonstrate the potential applications of this framework in various contexts, such as large-scale matrix multiplication and black-box optimization.
arXiv Detail & Related papers (2023-09-01T18:02:04Z) - INR-Arch: A Dataflow Architecture and Compiler for Arbitrary-Order
Gradient Computations in Implicit Neural Representation Processing [66.00729477511219]
Given a function represented as a computation graph, traditional architectures face challenges in efficiently computing its nth-order gradient.
We introduce INR-Arch, a framework that transforms the computation graph of an nth-order gradient into a hardware-optimized dataflow architecture.
We present results that demonstrate 1.8-4.8x and 1.5-3.6x speedup compared to CPU and GPU baselines respectively.
arXiv Detail & Related papers (2023-08-11T04:24:39Z) - Efficient Dataset Distillation Using Random Feature Approximation [109.07737733329019]
We propose a novel algorithm that uses a random feature approximation (RFA) of the Neural Network Gaussian Process (NNGP) kernel.
Our algorithm provides at least a 100-fold speedup over KIP and can run on a single GPU.
Our new method, termed an RFA Distillation (RFAD), performs competitively with KIP and other dataset condensation algorithms in accuracy over a range of large-scale datasets.
arXiv Detail & Related papers (2022-10-21T15:56:13Z) - Efficient GPU implementation of randomized SVD and its applications [17.71779625877989]
Matrix decompositions are ubiquitous in machine learning, including applications in dimensionality data compression and deep learning algorithms.
Typical solutions for matrix decompositions have complexity which significantly increases their computational cost and time.
We leverage efficient processing operations that can be run in parallel on modern Graphical Processing Units (GPUs) to reduce the computational burden of computing matrix decompositions.
arXiv Detail & Related papers (2021-10-05T07:42:41Z) - GPU-Accelerated Optimizer-Aware Evaluation of Submodular Exemplar
Clustering [5.897728689802829]
optimization of submodular functions constitutes a viable way to perform clustering.
Strong approximation guarantees and feasible optimization w.r.t. streaming data make this clustering approach favorable.
Exemplar-based clustering is one of the possible submodular functions, but suffers from high computational complexity.
Half-precision GPU computation led to large speedups of up to 452x compared to single-precision, single-thread CPU computations.
arXiv Detail & Related papers (2021-01-21T18:23:44Z) - Kernel methods through the roof: handling billions of points efficiently [94.31450736250918]
Kernel methods provide an elegant and principled approach to nonparametric learning, but so far could hardly be used in large scale problems.
Recent advances have shown the benefits of a number of algorithmic ideas, for example combining optimization, numerical linear algebra and random projections.
Here, we push these efforts further to develop and test a solver that takes full advantage of GPU hardware.
arXiv Detail & Related papers (2020-06-18T08:16:25Z) - Fast Object Classification and Meaningful Data Representation of
Segmented Lidar Instances [0.966840768820136]
We propose a way to facilitate real-time Lidar object classification on CPU.
We show how our approach uses segmented object instances to extract important features.
We show, that our algorithm is capable of producing good results on public data, while running in real time on CPU without using specific optimisation.
arXiv Detail & Related papers (2020-06-17T17:16:38Z) - Real-Time Regression with Dividing Local Gaussian Processes [62.01822866877782]
Local Gaussian processes are a novel, computationally efficient modeling approach based on Gaussian process regression.
Due to an iterative, data-driven division of the input space, they achieve a sublinear computational complexity in the total number of training points in practice.
A numerical evaluation on real-world data sets shows their advantages over other state-of-the-art methods in terms of accuracy as well as prediction and update speed.
arXiv Detail & Related papers (2020-06-16T18:43:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.