Related papers: Scaling up Continuous-Time Markov Chains Helps Resolve Underspecification

Scaling up Continuous-Time Markov Chains Helps Resolve Underspecification

URL: http://arxiv.org/abs/2107.02911v1
Date: Tue, 6 Jul 2021 21:14:49 GMT
Title: Scaling up Continuous-Time Markov Chains Helps Resolve Underspecification
Authors: Alkis Gotovos, Rebekka Burkholz, John Quackenbush, and Stefanie Jegelka
Abstract summary: We develop an approximate likelihood method for learning continuous-time Markov chains, which can scale to hundreds of items and is orders of magnitude faster than previous methods. We demonstrate the effectiveness of our approach on synthetic and real cancer data.
Score: 42.97840843148334
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Modeling the time evolution of discrete sets of items (e.g., genetic mutations) is a fundamental problem in many biomedical applications. We approach this problem through the lens of continuous-time Markov chains, and show that the resulting learning task is generally underspecified in the usual setting of cross-sectional data. We explore a perhaps surprising remedy: including a number of additional independent items can help determine time order, and hence resolve underspecification. This is in sharp contrast to the common practice of limiting the analysis to a small subset of relevant items, which is followed largely due to poor scaling of existing methods. To put our theoretical insight into practice, we develop an approximate likelihood maximization method for learning continuous-time Markov chains, which can scale to hundreds of items and is orders of magnitude faster than previous methods. We demonstrate the effectiveness of our approach on synthetic and real cancer data.

Related papers

ULTRA-MC: A Unified Approach to Learning Mixtures of Markov Chains via Hitting Times [13.299820337462833]
We introduce a unifying strategy for learning mixtures of discrete and continuous-time Markov chains. Specifically, we design a reconstruction algorithm that outputs a mixture which accurately reflects the estimated hitting times.
arXiv Detail & Related papers (2024-05-23T22:57:15Z)
Seeing Unseen: Discover Novel Biomedical Concepts via Geometry-Constrained Probabilistic Modeling [53.7117640028211]
We present a geometry-constrained probabilistic modeling treatment to resolve the identified issues. We incorporate a suite of critical geometric properties to impose proper constraints on the layout of constructed embedding space. A spectral graph-theoretic method is devised to estimate the number of potential novel classes.
arXiv Detail & Related papers (2024-03-02T00:56:05Z)
Markovletics: Methods and A Novel Application for Learning Continuous-Time Markov Chain Mixtures [11.131861804842886]
We study learning mixtures of continuous-time Markov chains (CTMCs) CTMCs could model intricate continuous-time processes prevalent in various fields including social media, finance, and biology. We introduce a novel framework for exploring CTMCs, emphasizing the influence of observed trails' length and mixture parameters on problem regimes. We apply our algorithms on an extensive collection of Lastfm's user-generated trails spanning three years, demonstrating the capability of our algorithms to differentiate diverse user preferences.
arXiv Detail & Related papers (2024-02-27T18:04:59Z)
A Dynamical View of the Question of Why [10.243488468625783]
We present two key lemmas to compute causal contributions and frame them as reinforcement learning problems. Our approach offers formal and computational tools for uncovering and quantifying causal relationships in diffusion processes. In fairly intricate experiments and through sheer learning, our framework reveals and quantifies causal links, which otherwise seem inexplicable.
arXiv Detail & Related papers (2024-02-14T18:44:05Z)
Scalable Structure Learning for Sparse Context-Specific Systems [0.0]
We present an algorithm for learning context-specific models that scales to hundreds of variables. Our method is shown to perform well on synthetic data and real world examples.
arXiv Detail & Related papers (2024-02-12T16:28:52Z)
Multi-Label Quantification [78.83284164605473]
Quantification, variously called "labelled prevalence estimation" or "learning to quantify", is the supervised learning task of generating predictors of the relative frequencies of the classes of interest in unsupervised data samples. We propose methods for inferring estimators of class prevalence values that strive to leverage the dependencies among the classes of interest in order to predict their relative frequencies more accurately.
arXiv Detail & Related papers (2022-11-15T11:29:59Z)
Provable Reinforcement Learning with a Short-Term Memory [68.00677878812908]
We study a new subclass of POMDPs, whose latent states can be decoded by the most recent history of a short length $m$. In particular, in the rich-observation setting, we develop new algorithms using a novel "moment matching" approach with a sample complexity that scales exponentially. Our results show that a short-term memory suffices for reinforcement learning in these environments.
arXiv Detail & Related papers (2022-02-08T16:39:57Z)
Contrastive learning of strong-mixing continuous-time stochastic processes [53.82893653745542]
Contrastive learning is a family of self-supervised methods where a model is trained to solve a classification task constructed from unlabeled data. We show that a properly constructed contrastive learning task can be used to estimate the transition kernel for small-to-mid-range intervals in the diffusion case.
arXiv Detail & Related papers (2021-03-03T23:06:47Z)
Meta Cyclical Annealing Schedule: A Simple Approach to Avoiding Meta-Amortization Error [50.83356836818667]
We develop a novel meta-regularization objective using it cyclical annealing schedule and it maximum mean discrepancy (MMD) criterion. The experimental results show that our approach substantially outperforms standard meta-learning algorithms.
arXiv Detail & Related papers (2020-03-04T04:43:16Z)

This list is automatically generated from the titles and abstracts of the papers in this site.