Binary Classification as a Phase Separation Process
- URL: http://arxiv.org/abs/2009.02467v3
- Date: Sat, 18 Sep 2021 02:17:28 GMT
- Title: Binary Classification as a Phase Separation Process
- Authors: Rafael Monteiro
- Abstract summary: We propose a new binary classification model called Phase Separation Binary (PSBC)
It consists of a discretization of a nonlinear reaction-diffusion equation coupled with an Ordinary Differential Equation.
PSBC's equations can be seen as a dynamical system whose coefficients are trainable weights, with a similar architecture to that of a Recurrent Neural Network.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We propose a new binary classification model called Phase Separation Binary
Classifier (PSBC). It consists of a discretization of a nonlinear
reaction-diffusion equation coupled with an Ordinary Differential Equation, and
is inspired by fluids behavior, namely, on how binary fluids phase separate.
Thus, parameters and hyperparameters have physical meaning, whose effects are
studied in several different scenarios.
PSBC's equations can be seen as a dynamical system whose coefficients are
trainable weights, with a similar architecture to that of a Recurrent Neural
Network. As such, forward propagation amounts to an initial value problem.
Boundary conditions are also present, bearing similarity with figure padding
techniques in Computer Vision. Model compression is exploited in several ways,
with weight sharing taking place both across and within layers.
The model is tested on pairs of digits of the classical MNIST database. An
associated multiclass classifier is also constructed using a combination of
Ensemble Learning and one versus one techniques. It is also shown how the PSBC
can be combined with other methods - like aggregation and PCA - in order to
construct better binary classifiers. The role of boundary conditions and
viscosity is thoroughly studied in the case of digits ``0'' and ``1''.
Related papers
- Latent Schrodinger Bridge: Prompting Latent Diffusion for Fast Unpaired Image-to-Image Translation [58.19676004192321]
Diffusion models (DMs), which enable both image generation from noise and inversion from data, have inspired powerful unpaired image-to-image (I2I) translation algorithms.
We tackle this problem with Schrodinger Bridges (SBs), which are differential equations (SDEs) between distributions with minimal transport cost.
Inspired by this observation, we propose Latent Schrodinger Bridges (LSBs) that approximate the SB ODE via pre-trained Stable Diffusion.
We demonstrate that our algorithm successfully conduct competitive I2I translation in unsupervised setting with only a fraction of cost required by previous DM-
arXiv Detail & Related papers (2024-11-22T11:24:14Z) - Learning-based Multi-continuum Model for Multiscale Flow Problems [24.93423649301792]
We propose a learning-based multi-continuum model to enrich the homogenized equation and improve the accuracy of the single model for multiscale problems.
Our proposed learning-based multi-continuum model can resolve multiple interacted media within each coarse grid block and describe the mass transfer among them.
arXiv Detail & Related papers (2024-03-21T02:30:56Z) - Solving the Discretised Multiphase Flow Equations with Interface
Capturing on Structured Grids Using Machine Learning Libraries [0.6299766708197884]
This paper solves the discretised multiphase flow equations using tools and methods from machine-learning libraries.
For the first time, finite element discretisations of multiphase flows can be solved using an approach based on (untrained) convolutional neural networks.
arXiv Detail & Related papers (2024-01-12T18:42:42Z) - Tractable Bounding of Counterfactual Queries by Knowledge Compilation [51.47174989680976]
We discuss the problem of bounding partially identifiable queries, such as counterfactuals, in Pearlian structural causal models.
A recently proposed iterated EM scheme yields an inner approximation of those bounds by sampling the initialisation parameters.
We show how a single symbolic knowledge compilation allows us to obtain the circuit structure with symbolic parameters to be replaced by their actual values.
arXiv Detail & Related papers (2023-10-05T07:10:40Z) - Thermodynamics of bidirectional associative memories [0.0]
We investigate the equilibrium properties of bidirectional associative memories (BAMs)
introduced by Kosko in 1988 as a generalization of the Hopfield model to a bipartite structure.
We characterize the computational capabilities of a extension of this model in the thermodynamic limit.
arXiv Detail & Related papers (2022-11-17T17:35:37Z) - Linear Connectivity Reveals Generalization Strategies [54.947772002394736]
Some pairs of finetuned models have large barriers of increasing loss on the linear paths between them.
We find distinct clusters of models which are linearly connected on the test loss surface, but are disconnected from models outside the cluster.
Our work demonstrates how the geometry of the loss surface can guide models towards different functions.
arXiv Detail & Related papers (2022-05-24T23:43:02Z) - Dynamically-Scaled Deep Canonical Correlation Analysis [77.34726150561087]
Canonical Correlation Analysis (CCA) is a method for feature extraction of two views by finding maximally correlated linear projections of them.
We introduce a novel dynamic scaling method for training an input-dependent canonical correlation model.
arXiv Detail & Related papers (2022-03-23T12:52:49Z) - The Separation Capacity of Random Neural Networks [78.25060223808936]
We show that a sufficiently large two-layer ReLU-network with standard Gaussian weights and uniformly distributed biases can solve this problem with high probability.
We quantify the relevant structure of the data in terms of a novel notion of mutual complexity.
arXiv Detail & Related papers (2021-07-31T10:25:26Z) - Learning second order coupled differential equations that are subject to
non-conservative forces [0.0]
We introduce a network that incorporates a difference approximation for the second order derivative in terms of residual connections between convolutional blocks.
We optimize this map together with the solver network, while sharing their weights, to form a powerful framework capable of learning the complex physical properties of a dissipative dynamical system.
arXiv Detail & Related papers (2020-10-17T23:31:31Z) - Provably Efficient Neural Estimation of Structural Equation Model: An
Adversarial Approach [144.21892195917758]
We study estimation in a class of generalized Structural equation models (SEMs)
We formulate the linear operator equation as a min-max game, where both players are parameterized by neural networks (NNs), and learn the parameters of these neural networks using a gradient descent.
For the first time we provide a tractable estimation procedure for SEMs based on NNs with provable convergence and without the need for sample splitting.
arXiv Detail & Related papers (2020-07-02T17:55:47Z) - Reduced Dilation-Erosion Perceptron for Binary Classification [1.3706331473063877]
Dilation-erosion perceptron (DEP) is a neural network obtained by a convex combination of a dilation and an erosion.
This paper introduces the reduced dilation-erosion (r-DEP) classifier.
arXiv Detail & Related papers (2020-03-04T19:50:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.