Using Sequences of Life-events to Predict Human Lives
- URL: http://arxiv.org/abs/2306.03009v1
- Date: Mon, 5 Jun 2023 16:19:48 GMT
- Title: Using Sequences of Life-events to Predict Human Lives
- Authors: Germans Savcisens, Tina Eliassi-Rad, Lars Kai Hansen, Laust Mortensen,
Lau Lilleholt, Anna Rogers, Ingo Zettler, Sune Lehmann
- Abstract summary: We draw on the most comprehensive registry data in existence to examine the evolution and predictability of human lives.
We create embeddings of life-events in a single vector space showing that this embedding space is robust and highly structured.
Our models allow us to predict diverse outcomes ranging from early mortality to personality nuances, outperforming state-of-the-art models by a wide margin.
- Score: 16.77071145203099
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Over the past decade, machine learning has revolutionized computers' ability
to analyze text through flexible computational models. Due to their structural
similarity to written language, transformer-based architectures have also shown
promise as tools to make sense of a range of multi-variate sequences from
protein-structures, music, electronic health records to weather-forecasts. We
can also represent human lives in a way that shares this structural similarity
to language. From one perspective, lives are simply sequences of events: People
are born, visit the pediatrician, start school, move to a new location, get
married, and so on. Here, we exploit this similarity to adapt innovations from
natural language processing to examine the evolution and predictability of
human lives based on detailed event sequences. We do this by drawing on
arguably the most comprehensive registry data in existence, available for an
entire nation of more than six million individuals across decades. Our data
include information about life-events related to health, education, occupation,
income, address, and working hours, recorded with day-to-day resolution. We
create embeddings of life-events in a single vector space showing that this
embedding space is robust and highly structured. Our models allow us to predict
diverse outcomes ranging from early mortality to personality nuances,
outperforming state-of-the-art models by a wide margin. Using methods for
interpreting deep learning models, we probe the algorithm to understand the
factors that enable our predictions. Our framework allows researchers to
identify new potential mechanisms that impact life outcomes and associated
possibilities for personalized interventions.
Related papers
- Conserving Human Creativity with Evolutionary Generative Algorithms: A Case Study in Music Generation [0.0]
This study explores the application of evolutionary generative algorithms in music production to preserve and enhance human creativity.
By integrating human feedback into Differential Evolution algorithms, we produced six songs that were submitted to international record labels, all of which received contract offers.
arXiv Detail & Related papers (2024-06-09T18:11:05Z) - Large Language Models for Scientific Synthesis, Inference and
Explanation [56.41963802804953]
We show how large language models can perform scientific synthesis, inference, and explanation.
We show that the large language model can augment this "knowledge" by synthesizing from the scientific literature.
This approach has the further advantage that the large language model can explain the machine learning system's predictions.
arXiv Detail & Related papers (2023-10-12T02:17:59Z) - Learning signatures of decision making from many individuals playing the
same game [54.33783158658077]
We design a predictive framework that learns representations to encode an individual's 'behavioral style'
We apply our method to a large-scale behavioral dataset from 1,000 humans playing a 3-armed bandit task.
arXiv Detail & Related papers (2023-02-21T21:41:53Z) - StyleGAN-Human: A Data-Centric Odyssey of Human Generation [96.7080874757475]
This work takes a data-centric perspective and investigates multiple critical aspects in "data engineering"
We collect and annotate a large-scale human image dataset with over 230K samples capturing diverse poses and textures.
We rigorously investigate three essential factors in data engineering for StyleGAN-based human generation, namely data size, data distribution, and data alignment.
arXiv Detail & Related papers (2022-04-25T17:55:08Z) - A-ACT: Action Anticipation through Cycle Transformations [89.83027919085289]
We take a step back to analyze how the human capability to anticipate the future can be transferred to machine learning algorithms.
A recent study on human psychology explains that, in anticipating an occurrence, the human brain counts on both systems.
In this work, we study the impact of each system for the task of action anticipation and introduce a paradigm to integrate them in a learning framework.
arXiv Detail & Related papers (2022-04-02T21:50:45Z) - Practical Challenges in Differentially-Private Federated Survival
Analysis of Medical Data [57.19441629270029]
In this paper, we take advantage of the inherent properties of neural networks to federate the process of training of survival analysis models.
In the realistic setting of small medical datasets and only a few data centers, this noise makes it harder for the models to converge.
We propose DPFed-post which adds a post-processing stage to the private federated learning scheme.
arXiv Detail & Related papers (2022-02-08T10:03:24Z) - Physion: Evaluating Physical Prediction from Vision in Humans and
Machines [46.19008633309041]
We present a visual and physical prediction benchmark that precisely measures this capability.
We compare an array of algorithms on their ability to make diverse physical predictions.
We find that graph neural networks with access to the physical state best capture human behavior.
arXiv Detail & Related papers (2021-06-15T16:13:39Z) - Interpretable machine learning for high-dimensional trajectories of
aging health [0.0]
We have built a computational model for individual aging trajectories of health and survival.
It contains physical, functional, and biological variables, and is conditioned on demographic, lifestyle, and medical background information.
Our model is scalable to large longitudinal data sets and infers an interpretable network of directed interactions between the health variables.
arXiv Detail & Related papers (2021-05-07T17:42:15Z) - Health State Estimation [2.463876252896007]
dissertation presents an approach to build a personal model and dynamically estimate the health state of an individual.
The system is stitched together from four essential abstraction elements.
Experiments demonstrate the use of dense and heterogeneous real-world data to monitor individual cardiovascular health state.
arXiv Detail & Related papers (2020-03-16T21:06:32Z) - Uncovering life-course patterns with causal discovery and survival
analysis [5.627346969563955]
We provide a novel approach to modelling life event choices and occurrence from a probabilistic perspective.
Data comes from a survey conducted in Dortmund, Germany.
arXiv Detail & Related papers (2020-01-30T15:30:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.