Self-contained Beta-with-Spikes Approximation for Inference Under a
Wright-Fisher Model
- URL: http://arxiv.org/abs/2303.04691v2
- Date: Thu, 11 May 2023 15:59:00 GMT
- Title: Self-contained Beta-with-Spikes Approximation for Inference Under a
Wright-Fisher Model
- Authors: Juan Guerrero Montero, Richard A. Blythe
- Abstract summary: We construct a reliable estimation of evolutionary parameters within the Wright-Fisher model.
Our method of analysis builds on a Beta-with-Spikes approximation to the distribution of allele frequencies.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We construct a reliable estimation of evolutionary parameters within the
Wright-Fisher model, which describes changes in allele frequencies due to
selection and genetic drift, from time-series data. Such data exists for
biological populations, for example via artificial evolution experiments, and
for the cultural evolution of behavior, such as linguistic corpora that
document historical usage of different words with similar meanings. Our method
of analysis builds on a Beta-with-Spikes approximation to the distribution of
allele frequencies predicted by the Wright-Fisher model. We introduce a
self-contained scheme for estimating the parameters in the approximation, and
demonstrate its robustness with synthetic data, especially in the
strong-selection and near-extinction regimes where previous approaches fail. We
further apply to allele frequency data for baker's yeast (Saccharomyces
cerevisiae), finding a significant signal of selection in cases where
independent evidence supports such a conclusion. We further demonstrate the
possibility of detecting time-points at which evolutionary parameters change in
the context of a historical spelling reform in the Spanish language.
Related papers
- Evolving Voices Based on Temporal Poisson Factorisation [0.0]
We propose the temporal Poisson factorisation (TPF) model as an extension to the factorisation model to model sparse count data matrices.
We discuss in detail results of the TPF model when analysing speeches from 18 sessions in the U.S. Senate (1981-2016)
arXiv Detail & Related papers (2024-10-24T07:21:33Z) - Diffusion posterior sampling for simulation-based inference in tall data settings [53.17563688225137]
Simulation-based inference ( SBI) is capable of approximating the posterior distribution that relates input parameters to a given observation.
In this work, we consider a tall data extension in which multiple observations are available to better infer the parameters of the model.
We compare our method to recently proposed competing approaches on various numerical experiments and demonstrate its superiority in terms of numerical stability and computational cost.
arXiv Detail & Related papers (2024-04-11T09:23:36Z) - Towards Theoretical Understandings of Self-Consuming Generative Models [56.84592466204185]
This paper tackles the emerging challenge of training generative models within a self-consuming loop.
We construct a theoretical framework to rigorously evaluate how this training procedure impacts the data distributions learned by future models.
We present results for kernel density estimation, delivering nuanced insights such as the impact of mixed data training on error propagation.
arXiv Detail & Related papers (2024-02-19T02:08:09Z) - Model Assessment and Selection under Temporal Distribution Shift [1.024113475677323]
We develop an adaptive rolling window approach to estimate the generalization error of a given model.
We also integrate pairwise comparisons into a single-elimination tournament, achieving near-optimal model selection from a collection of candidates.
arXiv Detail & Related papers (2024-02-13T18:54:08Z) - Gluformer: Transformer-Based Personalized Glucose Forecasting with
Uncertainty Quantification [7.451722745955049]
We propose to model the future glucose trajectory conditioned on the past as an infinite mixture of basis distributions.
This change allows us to learn the uncertainty and predict more accurately in the cases when the trajectory has a heterogeneous or multi-modal distribution.
We empirically demonstrate the superiority of our method over existing state-of-the-art techniques both in terms of accuracy and uncertainty on the synthetic and benchmark glucose data sets.
arXiv Detail & Related papers (2022-09-09T21:03:43Z) - TACTiS: Transformer-Attentional Copulas for Time Series [76.71406465526454]
estimation of time-varying quantities is a fundamental component of decision making in fields such as healthcare and finance.
We propose a versatile method that estimates joint distributions using an attention-based decoder.
We show that our model produces state-of-the-art predictions on several real-world datasets.
arXiv Detail & Related papers (2022-02-07T21:37:29Z) - Phylogenetic typology [0.913755431537592]
We propose a novel method to estimate the frequency distribution of linguistic variables.
Unlike previous approaches, our technique uses all available data.
As a case study, we investigate a series of potential word-order correlations across the languages of the world.
arXiv Detail & Related papers (2021-03-18T12:03:49Z) - Leveraging Global Parameters for Flow-based Neural Posterior Estimation [90.21090932619695]
Inferring the parameters of a model based on experimental observations is central to the scientific method.
A particularly challenging setting is when the model is strongly indeterminate, i.e., when distinct sets of parameters yield identical observations.
We present a method for cracking such indeterminacy by exploiting additional information conveyed by an auxiliary set of observations sharing global parameters.
arXiv Detail & Related papers (2021-02-12T12:23:13Z) - Balance-Subsampled Stable Prediction [55.13512328954456]
We propose a novel balance-subsampled stable prediction (BSSP) algorithm based on the theory of fractional factorial design.
A design-theoretic analysis shows that the proposed method can reduce the confounding effects among predictors induced by the distribution shift.
Numerical experiments on both synthetic and real-world data sets demonstrate that our BSSP algorithm significantly outperforms the baseline methods for stable prediction across unknown test data.
arXiv Detail & Related papers (2020-06-08T07:01:38Z) - Decision-Making with Auto-Encoding Variational Bayes [71.44735417472043]
We show that a posterior approximation distinct from the variational distribution should be used for making decisions.
Motivated by these theoretical results, we propose learning several approximate proposals for the best model.
In addition to toy examples, we present a full-fledged case study of single-cell RNA sequencing.
arXiv Detail & Related papers (2020-02-17T19:23:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.