Design-unbiased statistical learning in survey sampling
- URL: http://arxiv.org/abs/2003.11423v1
- Date: Wed, 25 Mar 2020 14:27:39 GMT
- Title: Design-unbiased statistical learning in survey sampling
- Authors: Luis Sanguiao Sande and Li-Chun Zhang
- Abstract summary: We propose a subsampling Rao-Blackwell method, and develop a statistical learning theory for exactly design-unbiased estimation.
Our approach makes use of classic ideas from Statistical Science as well as the rapidly growing field of Machine Learning.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Design-consistent model-assisted estimation has become the standard practice
in survey sampling. However, a general theory is lacking so far, which allows
one to incorporate modern machine-learning techniques that can lead to
potentially much more powerful assisting models. We propose a subsampling
Rao-Blackwell method, and develop a statistical learning theory for exactly
design-unbiased estimation with the help of linear or non-linear prediction
models. Our approach makes use of classic ideas from Statistical Science as
well as the rapidly growing field of Machine Learning. Provided rich auxiliary
information, it can yield considerable efficiency gains over standard linear
model-assisted methods, while ensuring valid estimation for the given target
population, which is robust against potential mis-specifications of the
assisting model at the individual level.
Related papers
- On the use of Statistical Learning Theory for model selection in Structural Health Monitoring [0.0]
This paper focusses on how well models can generalise in Structural Health Monitoring (SHM)
It is demonstrated that incorporating domain knowledge into the regression problem yields a lower guaranteed risk, thereby enhancing generalisation.
arXiv Detail & Related papers (2025-01-14T11:56:05Z) - Can Strategic Data Collection Improve the Performance of Poverty
Prediction Models? [4.444335418188173]
We test whether adaptive sampling strategies for ground truth data collection can improve the performance of poverty prediction models.
We find that none of these active learning methods improve over uniform-at-random sampling.
We discuss how these results can help shape future efforts to refine machine learning-based estimates of poverty.
arXiv Detail & Related papers (2022-11-16T07:50:56Z) - Design Amortization for Bayesian Optimal Experimental Design [70.13948372218849]
We build off of successful variational approaches, which optimize a parameterized variational model with respect to bounds on the expected information gain (EIG)
We present a novel neural architecture that allows experimenters to optimize a single variational model that can estimate the EIG for potentially infinitely many designs.
arXiv Detail & Related papers (2022-10-07T02:12:34Z) - Planning with Diffusion for Flexible Behavior Synthesis [125.24438991142573]
We consider what it would look like to fold as much of the trajectory optimization pipeline as possible into the modeling problem.
The core of our technical approach lies in a diffusion probabilistic model that plans by iteratively denoising trajectories.
arXiv Detail & Related papers (2022-05-20T07:02:03Z) - Sample-Efficient Reinforcement Learning via Conservative Model-Based
Actor-Critic [67.00475077281212]
Model-based reinforcement learning algorithms are more sample efficient than their model-free counterparts.
We propose a novel approach that achieves high sample efficiency without the strong reliance on accurate learned models.
We show that CMBAC significantly outperforms state-of-the-art approaches in terms of sample efficiency on several challenging tasks.
arXiv Detail & Related papers (2021-12-16T15:33:11Z) - ALT-MAS: A Data-Efficient Framework for Active Testing of Machine
Learning Algorithms [58.684954492439424]
We propose a novel framework to efficiently test a machine learning model using only a small amount of labeled test data.
The idea is to estimate the metrics of interest for a model-under-test using Bayesian neural network (BNN)
arXiv Detail & Related papers (2021-04-11T12:14:04Z) - Learning the Truth From Only One Side of the Story [58.65439277460011]
We focus on generalized linear models and show that without adjusting for this sampling bias, the model may converge suboptimally or even fail to converge to the optimal solution.
We propose an adaptive approach that comes with theoretical guarantees and show that it outperforms several existing methods empirically.
arXiv Detail & Related papers (2020-06-08T18:20:28Z) - Goal-Directed Planning for Habituated Agents by Active Inference Using a
Variational Recurrent Neural Network [5.000272778136268]
This study shows that the predictive coding (PC) and active inference (AIF) frameworks can develop better generalization by learning a prior distribution in a low dimensional latent state space.
In our proposed model, learning is carried out by inferring optimal latent variables as well as synaptic weights for maximizing the evidence lower bound.
Our proposed model was evaluated with both simple and complex robotic tasks in simulation, which demonstrated sufficient generalization in learning with limited training data.
arXiv Detail & Related papers (2020-05-27T06:43:59Z) - Towards Interpretable Deep Learning Models for Knowledge Tracing [62.75876617721375]
We propose to adopt the post-hoc method to tackle the interpretability issue for deep learning based knowledge tracing (DLKT) models.
Specifically, we focus on applying the layer-wise relevance propagation (LRP) method to interpret RNN-based DLKT model.
Experiment results show the feasibility using the LRP method for interpreting the DLKT model's predictions.
arXiv Detail & Related papers (2020-05-13T04:03:21Z) - Structural Regularization [0.0]
We propose a novel method for modeling data by using structural models based on economic theory as regularizers for statistical models.
We show that our method can outperform both the (misspecified) structural model and un-structural-regularized statistical models.
arXiv Detail & Related papers (2020-04-27T06:47:07Z) - Introduction to Rare-Event Predictive Modeling for Inferential
Statisticians -- A Hands-On Application in the Prediction of Breakthrough
Patents [0.0]
We introduce a machine learning (ML) approach to quantitative analysis geared towards optimizing the predictive performance.
We discuss the potential synergies between the two fields against the backdrop of this, at first glance, target-incompatibility.
We are providing a hands-on predictive modeling introduction for a quantitative social science audience while aiming at demystifying computer science jargon.
arXiv Detail & Related papers (2020-03-30T13:06:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.