Related papers: Input Space Mode Connectivity in Deep Neural Networks

Input Space Mode Connectivity in Deep Neural Networks

URL: http://arxiv.org/abs/2409.05800v1
Date: Mon, 9 Sep 2024 17:03:43 GMT
Title: Input Space Mode Connectivity in Deep Neural Networks
Authors: Jakub Vrabel, Ori Shem-Ur, Yaron Oz, David Krueger,
Abstract summary: We extend the concept of loss landscape mode connectivity to the input space of deep neural networks. We present theoretical and empirical evidence of its presence in the input space of deep networks. We exploit mode connectivity to obtain new insights about adversarial examples and demonstrate its potential for adversarial detection.
Score: 5.8470747480006695
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We extend the concept of loss landscape mode connectivity to the input space of deep neural networks. Mode connectivity was originally studied within parameter space, where it describes the existence of low-loss paths between different solutions (loss minimizers) obtained through gradient descent. We present theoretical and empirical evidence of its presence in the input space of deep networks, thereby highlighting the broader nature of the phenomenon. We observe that different input images with similar predictions are generally connected, and for trained models, the path tends to be simple, with only a small deviation from being a linear path. Our methodology utilizes real, interpolated, and synthetic inputs created using the input optimization technique for feature visualization. We conjecture that input space mode connectivity in high-dimensional spaces is a geometric effect that takes place even in untrained models and can be explained through percolation theory. We exploit mode connectivity to obtain new insights about adversarial examples and demonstrate its potential for adversarial detection. Additionally, we discuss applications for the interpretability of deep networks.

Related papers

FeatInv: Spatially resolved mapping from feature space to input space using conditional diffusion models [0.9503773054285559]
Internal representations are crucial for understanding deep neural networks.<n>While mapping from feature space to input space aids in interpreting the former, existing approaches often rely on crude approximations.<n>We propose using a conditional diffusion model to learn such a mapping in a probabilistic manner.
arXiv Detail & Related papers (2025-05-27T11:07:34Z)
Low-Loss Space in Neural Networks is Continuous and Fully Connected [0.8212195887472242]
We show that it is possible to connect two different minima with a path consisting of intermediate points that also have low loss.<n>Our work also provides new visualization methods and opportunities to improve model generalization.
arXiv Detail & Related papers (2025-05-05T12:16:55Z)
Approximating Latent Manifolds in Neural Networks via Vanishing Ideals [20.464009622419766]
We establish a connection between manifold learning and computational algebra by demonstrating how vanishing ideals can characterize the latent manifold of deep networks. We propose a new neural architecture that truncates a pretrained network at an intermediate layer, and approximates each class manifold via generators of the vanishing ideal. The resulting models have significantly fewer layers than their pretrained baselines, while maintaining comparable accuracy, achieving higher throughput and utilizing fewer parameters.
arXiv Detail & Related papers (2025-02-20T21:23:02Z)
On Space Folds of ReLU Neural Networks [6.019268056469171]
Recent findings suggest that ReLU neural networks can be understood geometrically as space folding of the input space. We present the first quantitative analysis of this space phenomenon in ReLU models.
arXiv Detail & Related papers (2025-02-14T07:22:24Z)
Landscaping Linear Mode Connectivity [76.39694196535996]
linear mode connectivity (LMC) has garnered interest from both theoretical and practical fronts. We take a step towards understanding it by providing a model of how the loss landscape needs to behave topographically for LMC.
arXiv Detail & Related papers (2024-06-24T03:53:30Z)
Exploring Neural Network Landscapes: Star-Shaped and Geodesic Connectivity [4.516746821973374]
We show that for two typical global minima, there exists a path connecting them without barrier. For a finite number of typical minima, there exists a center on minima manifold that connects all of them simultaneously. Results are provably valid for linear networks and two-layer ReLU networks under a teacher-student setup.
arXiv Detail & Related papers (2024-04-09T15:35:02Z)
Geodesic Mode Connectivity [4.096453902709292]
Mode connectivity is a phenomenon where trained models are connected by a path of low loss. We propose an algorithm to approximate geodesics and demonstrate that they achieve mode connectivity.
arXiv Detail & Related papers (2023-08-24T09:18:43Z)
Quiver neural networks [5.076419064097734]
We develop a uniform theoretical approach towards the analysis of various neural network connectivity architectures. Inspired by quiver representation theory in mathematics, this approach gives a compact way to capture elaborate data flows.
arXiv Detail & Related papers (2022-07-26T09:42:45Z)
Deep Architecture Connectivity Matters for Its Convergence: A Fine-Grained Analysis [94.64007376939735]
We theoretically characterize the impact of connectivity patterns on the convergence of deep neural networks (DNNs) under gradient descent training. We show that by a simple filtration on "unpromising" connectivity patterns, we can trim down the number of models to evaluate.
arXiv Detail & Related papers (2022-05-11T17:43:54Z)
FuNNscope: Visual microscope for interactively exploring the loss landscape of fully connected neural networks [77.34726150561087]
We show how to explore high-dimensional landscape characteristics of neural networks. We generalize observations on small neural networks to more complex systems. An interactive dashboard opens up a number of possible application networks.
arXiv Detail & Related papers (2022-04-09T16:41:53Z)
Deep Networks on Toroids: Removing Symmetries Reveals the Structure of Flat Regions in the Landscape Geometry [3.712728573432119]
We develop a standardized parameterization in which all symmetries are removed, resulting in a toroidal topology. We derive a meaningful notion of the flatness of minimizers and of the geodesic paths connecting them. We also find that minimizers found by variants of gradient descent can be connected by zero-error paths with a single bend.
arXiv Detail & Related papers (2022-02-07T09:57:54Z)
Learning Connectivity of Neural Networks from a Topological Perspective [80.35103711638548]
We propose a topological perspective to represent a network into a complete graph for analysis. By assigning learnable parameters to the edges which reflect the magnitude of connections, the learning process can be performed in a differentiable manner. This learning process is compatible with existing networks and owns adaptability to larger search spaces and different tasks.
arXiv Detail & Related papers (2020-08-19T04:53:31Z)
Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness [97.67477497115163]
We use mode connectivity to study the adversarial robustness of deep neural networks. Our experiments cover various types of adversarial attacks applied to different network architectures and datasets. Our results suggest that mode connectivity offers a holistic tool and practical means for evaluating and improving adversarial robustness.
arXiv Detail & Related papers (2020-04-30T19:12:50Z)
Neural Operator: Graph Kernel Network for Partial Differential Equations [57.90284928158383]
This work is to generalize neural networks so that they can learn mappings between infinite-dimensional spaces (operators) We formulate approximation of the infinite-dimensional mapping by composing nonlinear activation functions and a class of integral operators. Experiments confirm that the proposed graph kernel network does have the desired properties and show competitive performance compared to the state of the art solvers.
arXiv Detail & Related papers (2020-03-07T01:56:20Z)

This list is automatically generated from the titles and abstracts of the papers in this site.