Related papers: xtdml: Double Machine Learning Estimation to Static Panel Data Models with Fixed Effects in R

xtdml: Double Machine Learning Estimation to Static Panel Data Models with Fixed Effects in R

URL: http://arxiv.org/abs/2512.15965v1
Date: Wed, 17 Dec 2025 20:48:40 GMT
Title: xtdml: Double Machine Learning Estimation to Static Panel Data Models with Fixed Effects in R
Authors: Annalivia Polselli,
Abstract summary: The paper presents the R package xtdml, which implements DML methods for partially linear panel regression models.<n>The package provides functionalities to: (a) learn nuisance functions with machine learning algorithms from the mlr3 ecosystem.<n>We showcase the use of xtdml with both simulated and real longitudinal data.
Score: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The double machine learning (DML) method combines the predictive power of machine learning with statistical estimation to conduct inference about the structural parameter of interest. This paper presents the R package `xtdml`, which implements DML methods for partially linear panel regression models with low-dimensional fixed effects, high-dimensional confounding variables, proposed by Clarke and Polselli (2025). The package provides functionalities to: (a) learn nuisance functions with machine learning algorithms from the `mlr3` ecosystem, (b) handle unobserved individual heterogeneity choosing among first-difference transformation, within-group transformation, and correlated random effects, (c) transform the covariates with min-max normalization and polynomial expansion to improve learning performance. We showcase the use of `xtdml` with both simulated and real longitudinal data.

Related papers

Functional effects models: Accounting for preference heterogeneity in panel data with machine learning [0.0]
We present a general specification for Functional Effects Models, which use Machine Learning (ML) methodologies to learn individual-specific preference parameters from socio-demographic characteristics.<n>We identify three specific advantages of the Functional Effects Model over traditional fixed, and random/mixed effects models.
arXiv Detail & Related papers (2025-09-22T17:22:18Z)
LOCAL: Learning with Orientation Matrix to Infer Causal Structure from Time Series Data [51.47827479376251]
LOCAL is a highly efficient, easy-to-implement, and constraint-free method for recovering dynamic causal structures.<n>Asymptotic Causal Learning Mask (ACML) and Dynamic Graph Learning (DGPL)<n>Experiments on synthetic and real-world datasets demonstrate that LOCAL significantly outperforms existing methods.
arXiv Detail & Related papers (2024-10-25T10:48:41Z)
Scaling and renormalization in high-dimensional regression [72.59731158970894]
We present a unifying perspective on recent results on ridge regression.<n>We use the basic tools of random matrix theory and free probability, aimed at readers with backgrounds in physics and deep learning.<n>Our results extend and provide a unifying perspective on earlier models of scaling laws.
arXiv Detail & Related papers (2024-05-01T15:59:00Z)
Sample Complexity Characterization for Linear Contextual MDPs [67.79455646673762]
Contextual decision processes (CMDPs) describe a class of reinforcement learning problems in which the transition kernels and reward functions can change over time with different MDPs indexed by a context variable. CMDPs serve as an important framework to model many real-world applications with time-varying environments. We study CMDPs under two linear function approximation models: Model I with context-varying representations and common linear weights for all contexts; and Model II with common representations for all contexts and context-varying linear weights.
arXiv Detail & Related papers (2024-02-05T03:25:04Z)
Double Machine Learning for Static Panel Models with Fixed Effects [0.0]
We develop novel machine learning procedures for panel data.<n>New procedures are extensions of the well-known correlated random effects, within-group and first-difference estimators.<n>We use our procedures to re-estimate the impact of minimum wage on voting behaviour in the UK.
arXiv Detail & Related papers (2023-12-13T14:34:12Z)
Sparse high-dimensional linear regression with a partitioned empirical Bayes ECM algorithm [62.997667081978825]
We propose a computationally efficient and powerful Bayesian approach for sparse high-dimensional linear regression. Minimal prior assumptions on the parameters are used through the use of plug-in empirical Bayes estimates. The proposed approach is implemented in the R package probe.
arXiv Detail & Related papers (2022-09-16T19:15:50Z)
Learning to Refit for Convex Learning Problems [11.464758257681197]
We propose a framework to learn to estimate optimized model parameters for different training sets using neural networks. We rigorously characterize the power of neural networks to approximate convex problems.
arXiv Detail & Related papers (2021-11-24T15:28:50Z)
Double Machine Learning for Partially Linear Mixed-Effects Models with Repeated Measurements [0.0]
We use machine learning algorithms to incorporate more complex interaction structures and high-dimensional variables. The adjusted variables satisfy a linear mixed-effects model, where the linear coefficient can be estimated with standard linear mixed-effects techniques.
arXiv Detail & Related papers (2021-08-31T07:41:36Z)
DoubleML -- An Object-Oriented Implementation of Double Machine Learning in R [4.830430752756141]
R package DoubleML implements the double/debiased machine learning framework. It provides functionalities to estimate parameters in causal models based on machine learning methods.
arXiv Detail & Related papers (2021-03-17T12:42:41Z)
Sparse PCA via $l_{2,p}$-Norm Regularization for Unsupervised Feature Selection [138.97647716793333]
We propose a simple and efficient unsupervised feature selection method, by combining reconstruction error with $l_2,p$-norm regularization. We present an efficient optimization algorithm to solve the proposed unsupervised model, and analyse the convergence and computational complexity of the algorithm theoretically.
arXiv Detail & Related papers (2020-12-29T04:08:38Z)
Generalized Matrix Factorization: efficient algorithms for fitting generalized linear latent variable models to large data arrays [62.997667081978825]
Generalized Linear Latent Variable models (GLLVMs) generalize such factor models to non-Gaussian responses. Current algorithms for estimating model parameters in GLLVMs require intensive computation and do not scale to large datasets. We propose a new approach for fitting GLLVMs to high-dimensional datasets, based on approximating the model using penalized quasi-likelihood.
arXiv Detail & Related papers (2020-10-06T04:28:19Z)

This list is automatically generated from the titles and abstracts of the papers in this site.