Unifying Likelihood-free Inference with Black-box Sequence Design and
Beyond
- URL: http://arxiv.org/abs/2110.03372v1
- Date: Wed, 6 Oct 2021 02:41:50 GMT
- Title: Unifying Likelihood-free Inference with Black-box Sequence Design and
Beyond
- Authors: Dinghuai Zhang, Jie Fu, Yoshua Bengio, Aaron Courville
- Abstract summary: We propose to unify two seemingly distinct worlds: likelihood-free inference and black-box sequence design.
We show how previous drug discovery approaches can be "reinvented" in our framework, and further propose new probabilistic sequence design algorithms.
- Score: 87.92360111463825
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Black-box optimization formulations for biological sequence design have drawn
recent attention due to their promising potential impact on the pharmaceutical
industry. In this work, we propose to unify two seemingly distinct worlds:
likelihood-free inference and black-box sequence design, under one
probabilistic framework. In tandem, we provide a recipe for constructing
various sequence design methods based on this framework. We show how previous
drug discovery approaches can be "reinvented" in our framework, and further
propose new probabilistic sequence design algorithms. Extensive experiments
illustrate the benefits of the proposed methodology.
Related papers
- Seeing Unseen: Discover Novel Biomedical Concepts via
Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues.
We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space.
A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z) - Nesting Particle Filters for Experimental Design in Dynamical Systems [11.179154197435954]
We develop a nested sequential Monte Carlo technique to infer optimal designs, and embed it into a particle Markov chain Monte Carlo framework to perform gradient-based policy amortization.
Our approach is distinct from other amortized experimental design techniques, as it does not rely on contrastive estimators.
arXiv Detail & Related papers (2024-02-12T18:29:17Z) - Beyond Concept Bottleneck Models: How to Make Black Boxes Intervenable? [8.391254800873599]
We introduce a method to perform concept-based interventions on pretrained neural networks, which are not interpretable by design.
We formalise the notion of intervenability as a measure of the effectiveness of concept-based interventions and leverage this definition to fine-tune black boxes.
arXiv Detail & Related papers (2024-01-24T16:02:14Z) - Protein Design with Guided Discrete Diffusion [67.06148688398677]
A popular approach to protein design is to combine a generative model with a discriminative model for conditional sampling.
We propose diffusioN Optimized Sampling (NOS), a guidance method for discrete diffusion models.
NOS makes it possible to perform design directly in sequence space, circumventing significant limitations of structure-based methods.
arXiv Detail & Related papers (2023-05-31T16:31:24Z) - Differentiable Multi-Target Causal Bayesian Experimental Design [43.76697029708785]
We introduce a gradient-based approach for the problem of Bayesian optimal experimental design to learn causal models in a batch setting.
Existing methods rely on greedy approximations to construct a batch of experiments.
We propose a conceptually simple end-to-end gradient-based optimization procedure to acquire a set of optimal intervention target-state pairs.
arXiv Detail & Related papers (2023-02-21T11:32:59Z) - Designing Biological Sequences via Meta-Reinforcement Learning and
Bayesian Optimization [68.28697120944116]
We train an autoregressive generative model via Meta-Reinforcement Learning to propose promising sequences for selection.
We pose this problem as that of finding an optimal policy over a distribution of MDPs induced by sampling subsets of the data.
Our in-silico experiments show that meta-learning over such ensembles provides robustness against reward misspecification and achieves competitive results.
arXiv Detail & Related papers (2022-09-13T18:37:27Z) - Hybrid Predictive Coding: Inferring, Fast and Slow [62.997667081978825]
We propose a hybrid predictive coding network that combines both iterative and amortized inference in a principled manner.
We demonstrate that our model is inherently sensitive to its uncertainty and adaptively balances balances to obtain accurate beliefs using minimum computational expense.
arXiv Detail & Related papers (2022-04-05T12:52:45Z) - Biological Sequence Design with GFlowNets [75.1642973538266]
Design of de novo biological sequences with desired properties often involves an active loop with several rounds of molecule ideation and expensive wet-lab evaluations.
This makes the diversity of proposed candidates a key consideration in the ideation phase.
We propose an active learning algorithm leveraging uncertainty estimation and the recently proposed GFlowNets as a generator of diverse candidate solutions.
arXiv Detail & Related papers (2022-03-02T15:53:38Z) - Efficient Balanced Treatment Assignments for Experimentation [17.3699850124029]
We provide an algorithm that is optimal with respect to the minimum tree test of Friedman and Rafsky.
We provide a novel formulation of estimation as transductive inference and show how the tree structures used in design can also be used in an adjustment estimator.
arXiv Detail & Related papers (2020-10-21T22:06:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.