Symphony in the Latent Space: Provably Integrating High-dimensional
Techniques with Non-linear Machine Learning Models
- URL: http://arxiv.org/abs/2212.00852v1
- Date: Thu, 1 Dec 2022 20:18:26 GMT
- Title: Symphony in the Latent Space: Provably Integrating High-dimensional
Techniques with Non-linear Machine Learning Models
- Authors: Qiong Wu, Jian Li, Zhenming Liu, Yanhua Li, Mihai Cucuringu
- Abstract summary: This paper revisits building machine learning algorithms that involve interactions between entities.
We show that it is possible to decouple the learning of high-dimensional interactions from the learning of non-linear feature interactions.
- Score: 19.824998167546298
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper revisits building machine learning algorithms that involve
interactions between entities, such as those between financial assets in an
actively managed portfolio, or interactions between users in a social network.
Our goal is to forecast the future evolution of ensembles of multivariate time
series in such applications (e.g., the future return of a financial asset or
the future popularity of a Twitter account). Designing ML algorithms for such
systems requires addressing the challenges of high-dimensional interactions and
non-linearity. Existing approaches usually adopt an ad-hoc approach to
integrating high-dimensional techniques into non-linear models and recent
studies have shown these approaches have questionable efficacy in time-evolving
interacting systems.
To this end, we propose a novel framework, which we dub as the additive
influence model. Under our modeling assumption, we show that it is possible to
decouple the learning of high-dimensional interactions from the learning of
non-linear feature interactions. To learn the high-dimensional interactions, we
leverage kernel-based techniques, with provable guarantees, to embed the
entities in a low-dimensional latent space. To learn the non-linear
feature-response interactions, we generalize prominent machine learning
techniques, including designing a new statistically sound non-parametric method
and an ensemble learning algorithm optimized for vector regressions. Extensive
experiments on two common applications demonstrate that our new algorithms
deliver significantly stronger forecasting power compared to standard and
recently proposed methods.
Related papers
- Robustness Reprogramming for Representation Learning [18.466637575445024]
Given a well-trained deep learning model, can it be reprogrammed to enhance its robustness against adversarial or noisy input perturbations without altering its parameters?
We propose a novel non-linear robust pattern matching technique as a robust alternative.
arXiv Detail & Related papers (2024-10-06T18:19:02Z) - Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities [89.40778301238642]
Model merging is an efficient empowerment technique in the machine learning community.
There is a significant gap in the literature regarding a systematic and thorough review of these techniques.
arXiv Detail & Related papers (2024-08-14T16:58:48Z) - Dynamic and Adaptive Feature Generation with LLM [10.142660254703225]
We propose a dynamic and adaptive feature generation method that enhances the interpretability of the feature generation process.
Our approach broadens the applicability across various data types and tasks and draws advantages over strategic flexibility.
arXiv Detail & Related papers (2024-06-04T20:32:14Z) - Unified Explanations in Machine Learning Models: A Perturbation Approach [0.0]
Inconsistencies between XAI and modeling techniques can have the undesirable effect of casting doubt upon the efficacy of these explainability approaches.
We propose a systematic, perturbation-based analysis against a popular, model-agnostic method in XAI, SHapley Additive exPlanations (Shap)
We devise algorithms to generate relative feature importance in settings of dynamic inference amongst a suite of popular machine learning and deep learning methods, and metrics that allow us to quantify how well explanations generated under the static case hold.
arXiv Detail & Related papers (2024-05-30T16:04:35Z) - Multi-Agent Dynamic Relational Reasoning for Social Robot Navigation [50.01551945190676]
Social robot navigation can be helpful in various contexts of daily life but requires safe human-robot interactions and efficient trajectory planning.
We propose a systematic relational reasoning approach with explicit inference of the underlying dynamically evolving relational structures.
We demonstrate its effectiveness for multi-agent trajectory prediction and social robot navigation.
arXiv Detail & Related papers (2024-01-22T18:58:22Z) - VIRT: Improving Representation-based Models for Text Matching through
Virtual Interaction [50.986371459817256]
We propose a novel textitVirtual InteRacTion mechanism, termed as VIRT, to enable full and deep interaction modeling in representation-based models.
VIRT asks representation-based encoders to conduct virtual interactions to mimic the behaviors as interaction-based models do.
arXiv Detail & Related papers (2021-12-08T09:49:28Z) - Supervised DKRC with Images for Offline System Identification [77.34726150561087]
Modern dynamical systems are becoming increasingly non-linear and complex.
There is a need for a framework to model these systems in a compact and comprehensive representation for prediction and control.
Our approach learns these basis functions using a supervised learning approach.
arXiv Detail & Related papers (2021-09-06T04:39:06Z) - Model-Based Deep Learning [155.063817656602]
Signal processing, communications, and control have traditionally relied on classical statistical modeling techniques.
Deep neural networks (DNNs) use generic architectures which learn to operate from data, and demonstrate excellent performance.
We are interested in hybrid techniques that combine principled mathematical models with data-driven systems to benefit from the advantages of both approaches.
arXiv Detail & Related papers (2020-12-15T16:29:49Z) - Human Trajectory Forecasting in Crowds: A Deep Learning Perspective [89.4600982169]
We present an in-depth analysis of existing deep learning-based methods for modelling social interactions.
We propose two knowledge-based data-driven methods to effectively capture these social interactions.
We develop a large scale interaction-centric benchmark TrajNet++, a significant yet missing component in the field of human trajectory forecasting.
arXiv Detail & Related papers (2020-07-07T17:19:56Z) - Robust Unsupervised Learning of Temporal Dynamic Interactions [21.928675010305543]
In this paper we introduce a model-free metric based on the Procrustes distance for robust representation learning of interactions.
We also introduce an optimal transport based distance metric for comparing between distributions of interaction primitives.
Their usefulness will be demonstrated in unsupervised learning of vehicle-to-vechicle interactions extracted from the Safety Pilot database.
arXiv Detail & Related papers (2020-06-18T02:39:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.