Learning Neural Set Functions Under the Optimal Subset Oracle
- URL: http://arxiv.org/abs/2203.01693v4
- Date: Tue, 23 May 2023 09:16:35 GMT
- Title: Learning Neural Set Functions Under the Optimal Subset Oracle
- Authors: Zijing Ou, Tingyang Xu, Qinliang Su, Yingzhen Li, Peilin Zhao, Yatao
Bian
- Abstract summary: We present a principled yet practical maximum likelihood learning framework, termed as EquiVSet.
Our framework meets the following desideratas of learning set functions under the OS oracle.
Thanks to the elegant combination of these advanced architectures, empirical studies on three real-world applications demonstrate that EquiVSet outperforms the baselines by a large margin.
- Score: 48.20868958542155
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Learning neural set functions becomes increasingly more important in many
applications like product recommendation and compound selection in AI-aided
drug discovery. The majority of existing works study methodologies of set
function learning under the function value oracle, which, however, requires
expensive supervision signals. This renders it impractical for applications
with only weak supervisions under the Optimal Subset (OS) oracle, the study of
which is surprisingly overlooked. In this work, we present a principled yet
practical maximum likelihood learning framework, termed as EquiVSet, that
simultaneously meets the following desiderata of learning set functions under
the OS oracle: i) permutation invariance of the set mass function being
modeled; ii) permission of varying ground set; iii) minimum prior; and iv)
scalability. The main components of our framework involve: an energy-based
treatment of the set mass function, DeepSet-style architectures to handle
permutation invariance, mean-field variational inference, and its amortized
variants. Thanks to the elegant combination of these advanced architectures,
empirical studies on three real-world applications (including Amazon product
recommendation, set anomaly detection, and compound selection for virtual
screening) demonstrate that EquiVSet outperforms the baselines by a large
margin.
Related papers
- "FRAME: Forward Recursive Adaptive Model Extraction-A Technique for Advance Feature Selection" [0.0]
This study introduces a novel hybrid approach, the Forward Recursive Adaptive Model Extraction Technique (FRAME)
FRAME combines Forward Selection and Recursive Feature Elimination to enhance feature selection across diverse datasets.
The results demonstrate that FRAME consistently delivers superior predictive performance based on downstream machine learning evaluation metrics.
arXiv Detail & Related papers (2025-01-21T08:34:10Z) - Enhancing Unsupervised Graph Few-shot Learning via Set Functions and Optimal Transport [23.36436403062214]
Recent advancements in graph few-shot learning models have exhibited superior performance across diverse applications.
We propose a novel model named STAR, which enhances unsupervised graph few-shot learning.
arXiv Detail & Related papers (2025-01-10T00:42:27Z) - Adaptive Real-Time Multi-Loss Function Optimization Using Dynamic Memory Fusion Framework: A Case Study on Breast Cancer Segmentation [0.0]
We propose a novel framework called dynamic memory fusion for adaptive multi-loss function penalizing in real-time.
Experiments on breast ultrasound datasets demonstrate that the framework improves segmentation performance across various metrics.
arXiv Detail & Related papers (2024-10-10T11:23:04Z) - Enhancing Neural Subset Selection: Integrating Background Information into Set Representations [53.15923939406772]
We show that when the target value is conditioned on both the input set and subset, it is essential to incorporate an textitinvariant sufficient statistic of the superset into the subset of interest.
This ensures that the output value remains invariant to permutations of the subset and its corresponding superset, enabling identification of the specific superset from which the subset originated.
arXiv Detail & Related papers (2024-02-05T16:09:35Z) - Learning Objective-Specific Active Learning Strategies with Attentive
Neural Processes [72.75421975804132]
Learning Active Learning (LAL) suggests to learn the active learning strategy itself, allowing it to adapt to the given setting.
We propose a novel LAL method for classification that exploits symmetry and independence properties of the active learning problem.
Our approach is based on learning from a myopic oracle, which gives our model the ability to adapt to non-standard objectives.
arXiv Detail & Related papers (2023-09-11T14:16:37Z) - FAStEN: An Efficient Adaptive Method for Feature Selection and Estimation in High-Dimensional Functional Regressions [7.674715791336311]
We propose a new, flexible and ultra-efficient approach to perform feature selection in a sparse function-on-function regression problem.
We show how to extend it to the scalar-on-function framework.
We present an application to brain fMRI data from the AOMIC PIOP1 study.
arXiv Detail & Related papers (2023-03-26T19:41:17Z) - Context-Aware Ensemble Learning for Time Series [11.716677452529114]
We introduce a new approach using a meta learner that effectively combines the base model predictions via using a superset of the features that is the union of the base models' feature vectors instead of the predictions themselves.
Our model does not use the predictions of the base models as inputs to a machine learning algorithm, but choose the best possible combination at each time step based on the state of the problem.
arXiv Detail & Related papers (2022-11-30T10:36:13Z) - Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning [89.31889875864599]
We propose an efficient model-based reinforcement learning algorithm for learning in multi-agent systems.
Our main theoretical contributions are the first general regret bounds for model-based reinforcement learning for MFC.
We provide a practical parametrization of the core optimization problem.
arXiv Detail & Related papers (2021-07-08T18:01:02Z) - Rank-Based Multi-task Learning for Fair Regression [9.95899391250129]
We develop a novel learning approach for multi-taskart regression models based on a biased dataset.
We use a popular non-parametric oracle-based non-world multipliers dataset.
arXiv Detail & Related papers (2020-09-23T22:32:57Z) - A Trainable Optimal Transport Embedding for Feature Aggregation and its
Relationship to Attention [96.77554122595578]
We introduce a parametrized representation of fixed size, which embeds and then aggregates elements from a given input set according to the optimal transport plan between the set and a trainable reference.
Our approach scales to large datasets and allows end-to-end training of the reference, while also providing a simple unsupervised learning mechanism with small computational cost.
arXiv Detail & Related papers (2020-06-22T08:35:58Z) - Learning to Learn Kernels with Variational Random Features [118.09565227041844]
We introduce kernels with random Fourier features in the meta-learning framework to leverage their strong few-shot learning ability.
We formulate the optimization of MetaVRF as a variational inference problem.
We show that MetaVRF delivers much better, or at least competitive, performance compared to existing meta-learning alternatives.
arXiv Detail & Related papers (2020-06-11T18:05:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.