Word embeddings for topic modeling: an application to the estimation of
the economic policy uncertainty index
- URL: http://arxiv.org/abs/2111.00057v1
- Date: Fri, 29 Oct 2021 19:31:03 GMT
- Title: Word embeddings for topic modeling: an application to the estimation of
the economic policy uncertainty index
- Authors: Hairo U. Miranda Belmonte and Victor Mu\~niz-S\'anchez and Francisco
Corona
- Abstract summary: Quantification of economic uncertainty is a key concept for the prediction of macro economic variables such as GDP.
Economic policy uncertainty (EPU) index is the most used newspaper-based indicator to quantify the uncertainty.
We propose a methodology to estimate the EPU index, which incorporates a fast and efficient method for topic modeling of digital news.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Quantification of economic uncertainty is a key concept for the prediction of
macro economic variables such as gross domestic product (GDP), and it becomes
particularly relevant on real-time or short-time predictions methodologies,
such as nowcasting, where it is required a large amount of time series data,
commonly with different structures and frequencies. Most of the data comes from
the official agencies statistics and non-public institutions, however, relying
our estimates in just the traditional data mentioned before, have some
disadvantages. One of them is that economic uncertainty could not be
represented or measured in a proper way based solely in financial or
macroeconomic data, another one, is that they are susceptible to lack of
information due to extraordinary events, such as the current COVID-19 pandemic.
For these reasons, it is very common nowadays to use some non-traditional data
from different sources, such as social networks or digital newspapers, in
addition to the traditional data from official sources. The economic policy
uncertainty (EPU) index, is the most used newspaper-based indicator to quantify
the uncertainty, and is based on topic modeling of newspapers. In this paper,
we propose a methodology to estimate the EPU index, which incorporates a fast
and efficient method for topic modeling of digital news based on semantic
clustering with word embeddings, allowing to update the index in real-time,
which is a drawback with another proposals that use computationally intensive
methods for topic modeling, such as Latent Dirichlet Allocation (LDA). We show
that our proposal allow us to update the index and significantly reduces the
time required for new document assignation into topics.
Related papers
- Context is Key: A Benchmark for Forecasting with Essential Textual Information [87.3175915185287]
"Context is Key" (CiK) is a time series forecasting benchmark that pairs numerical data with diverse types of carefully crafted textual context.
We evaluate a range of approaches, including statistical models, time series foundation models, and LLM-based forecasters.
Our experiments highlight the importance of incorporating contextual information, demonstrate surprising performance when using LLM-based forecasting models, and also reveal some of their critical shortcomings.
arXiv Detail & Related papers (2024-10-24T17:56:08Z) - Editable Fairness: Fine-Grained Bias Mitigation in Language Models [52.66450426729818]
We propose a novel debiasing approach, Fairness Stamp (FAST), which enables fine-grained calibration of individual social biases.
FAST surpasses state-of-the-art baselines with superior debiasing performance.
This highlights the potential of fine-grained debiasing strategies to achieve fairness in large language models.
arXiv Detail & Related papers (2024-08-07T17:14:58Z) - Fast Model Debias with Machine Unlearning [54.32026474971696]
Deep neural networks might behave in a biased manner in many real-world scenarios.
Existing debiasing methods suffer from high costs in bias labeling or model re-training.
We propose a fast model debiasing framework (FMD) which offers an efficient approach to identify, evaluate and remove biases.
arXiv Detail & Related papers (2023-10-19T08:10:57Z) - Conservative Predictions on Noisy Financial Data [6.300716661852326]
Price movements in financial markets are well known to be very noisy.
Traditional rule-learning techniques would seek only high precision rules and refrain from making predictions where their antecedents did not apply.
We apply a similar approach, where a model abstains from making a prediction on data points that it is uncertain on.
arXiv Detail & Related papers (2023-10-18T09:14:19Z) - Advancing Counterfactual Inference through Nonlinear Quantile Regression [77.28323341329461]
We propose a framework for efficient and effective counterfactual inference implemented with neural networks.
The proposed approach enhances the capacity to generalize estimated counterfactual outcomes to unseen data.
Empirical results conducted on multiple datasets offer compelling support for our theoretical assertions.
arXiv Detail & Related papers (2023-06-09T08:30:51Z) - Macroeconomic Predictions using Payments Data and Machine Learning [0.0]
This paper aims to demonstrate that non-traditional and timely data can provide policymakers with sophisticated models to accurately estimate key macroeconomic indicators in near real-time.
We provide a set of econometric tools to mitigate overfitting and interpretability challenges in machine learning models to improve their effectiveness for policy use.
Our models with payments data, nonlinear methods, and tailored cross-validation approaches help improve macroeconomic nowcasting accuracy up to 40% -- with higher gains during the COVID-19 period.
arXiv Detail & Related papers (2022-09-02T11:12:10Z) - Weak Supervision in Analysis of News: Application to Economic Policy
Uncertainty [0.0]
Our work focuses on studying the potential of textual data, in particular news pieces, for measuring economic policy uncertainty (EPU)
Economic policy uncertainty is defined as the public's inability to predict the outcomes of their decisions under new policies and future economic fundamentals.
Our work proposes a machine learning based solution involving weak supervision to classify news articles with regards to economic policy uncertainty.
arXiv Detail & Related papers (2022-08-10T09:08:29Z) - Benchmarking Econometric and Machine Learning Methodologies in
Nowcasting [0.0]
Nowcasting can play a key role in giving policymakers timelier insight to data published with a significant time lag.
This paper examines the performance of 12 different methodologies in nowcasting US quarterly GDP growth.
Performance was assessed on three different tumultuous periods in US economic history.
arXiv Detail & Related papers (2022-05-06T15:51:31Z) - Bayesian Bilinear Neural Network for Predicting the Mid-price Dynamics
in Limit-Order Book Markets [84.90242084523565]
Traditional time-series econometric methods often appear incapable of capturing the true complexity of the multi-level interactions driving the price dynamics.
By adopting a state-of-the-art second-order optimization algorithm, we train a Bayesian bilinear neural network with temporal attention.
By addressing the use of predictive distributions to analyze errors and uncertainties associated with the estimated parameters and model forecasts, we thoroughly compare our Bayesian model with traditional ML alternatives.
arXiv Detail & Related papers (2022-03-07T18:59:54Z) - Aggregate Learning for Mixed Frequency Data [0.0]
We propose a mixed-temporal aggregate learning model that predicts economic indicators for smaller areas in real-time.
We find that the proposed model predicts (i) the regional heterogeneity of the labor market condition and (ii) the rapidly changing economic status.
The model can be applied to various tasks, especially economic analysis.
arXiv Detail & Related papers (2021-05-20T08:12:43Z) - Super-App Behavioral Patterns in Credit Risk Models: Financial,
Statistical and Regulatory Implications [110.54266632357673]
We present the impact of alternative data that originates from an app-based marketplace, in contrast to traditional bureau data, upon credit scoring models.
Our results, validated across two countries, show that these new sources of data are particularly useful for predicting financial behavior in low-wealth and young individuals.
arXiv Detail & Related papers (2020-05-09T01:32:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.