FinEntity: Entity-level Sentiment Classification for Financial Texts
- URL: http://arxiv.org/abs/2310.12406v1
- Date: Thu, 19 Oct 2023 01:38:40 GMT
- Title: FinEntity: Entity-level Sentiment Classification for Financial Texts
- Authors: Yixuan Tang, Yi Yang, Allen H Huang, Andy Tam, Justin Z Tang
- Abstract summary: In the financial domain, conducting entity-level sentiment analysis is crucial for accurately assessing the sentiment directed toward a specific financial entity.
We introduce an entity-level sentiment classification dataset, called textbfFinEntity, that annotates financial entity spans and their sentiment in financial news.
- Score: 15.467477195487763
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In the financial domain, conducting entity-level sentiment analysis is
crucial for accurately assessing the sentiment directed toward a specific
financial entity. To our knowledge, no publicly available dataset currently
exists for this purpose. In this work, we introduce an entity-level sentiment
classification dataset, called \textbf{FinEntity}, that annotates financial
entity spans and their sentiment (positive, neutral, and negative) in financial
news. We document the dataset construction process in the paper. Additionally,
we benchmark several pre-trained models (BERT, FinBERT, etc.) and ChatGPT on
entity-level sentiment classification. In a case study, we demonstrate the
practical utility of using FinEntity in monitoring cryptocurrency markets. The
data and code of FinEntity is available at
\url{https://github.com/yixuantt/FinEntity}
Related papers
- Advancing Anomaly Detection: Non-Semantic Financial Data Encoding with LLMs [49.57641083688934]
We introduce a novel approach to anomaly detection in financial data using Large Language Models (LLMs) embeddings.
Our experiments demonstrate that LLMs contribute valuable information to anomaly detection as our models outperform the baselines.
arXiv Detail & Related papers (2024-06-05T20:19:09Z) - EFSA: Towards Event-Level Financial Sentiment Analysis [27.45063543020029]
We extend financial sentiment analysis(FSA) to event-level since events usually serve as the subject of the sentiment in financial text.
We design a categorization comprising coarse-grained and fine-grained event categories.
A large-scale Chinese dataset containing $12,160$ news articles and $13,725$ quintuples is publicized as a brand new testbed for our task.
arXiv Detail & Related papers (2024-04-08T07:36:26Z) - AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework [48.3060010653088]
We release AlphaFin datasets, combining traditional research datasets, real-time financial data, and handwritten chain-of-thought (CoT) data.
We then use AlphaFin datasets to benchmark a state-of-the-art method, called Stock-Chain, for effectively tackling the financial analysis task.
arXiv Detail & Related papers (2024-03-19T09:45:33Z) - FinBen: A Holistic Financial Benchmark for Large Language Models [75.09474986283394]
FinBen is the first extensive open-source evaluation benchmark, including 36 datasets spanning 24 financial tasks.
FinBen offers several key innovations: a broader range of tasks and datasets, the first evaluation of stock trading, novel agent and Retrieval-Augmented Generation (RAG) evaluation, and three novel open-source evaluation datasets for text summarization, question answering, and stock trading.
arXiv Detail & Related papers (2024-02-20T02:16:16Z) - Fin-Fact: A Benchmark Dataset for Multimodal Financial Fact Checking and Explanation Generation [26.573578326262307]
Fin-Fact is a benchmark dataset for multimodal fact-checking within the financial domain.
It includes professional fact-checker annotations and justifications, providing expertise and credibility.
arXiv Detail & Related papers (2023-09-15T22:24:00Z) - FinTree: Financial Dataset Pretrain Transformer Encoder for Relation
Extraction [0.0]
We pretrain FinTree on the financial dataset, adapting the model in financial tasks.
FinTree stands out with its novel structure that predicts a masked token instead of the conventional domain [an] token.
Our experiments demonstrate that FinTree outperforms on the REFinD, a large-scale financial relation extraction dataset.
arXiv Detail & Related papers (2023-07-26T01:48:52Z) - PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark
for Finance [63.51545277822702]
PIXIU is a comprehensive framework including the first financial large language model (LLMs) based on fine-tuning LLaMA with instruction data.
We propose FinMA by fine-tuning LLaMA with the constructed dataset to be able to follow instructions for various financial tasks.
We conduct a detailed analysis of FinMA and several existing LLMs, uncovering their strengths and weaknesses in handling critical financial tasks.
arXiv Detail & Related papers (2023-06-08T14:20:29Z) - SEntFiN 1.0: Entity-Aware Sentiment Analysis for Financial News [0.03018439717785794]
We make publicly available SEntFiN 1.0, a human-annotated dataset of 10,753 news headlines with entity-sentiment annotations.
We propose a framework that enables the extraction of entity-relevant sentiments using a feature-based approach rather than an expression-based approach.
arXiv Detail & Related papers (2023-05-20T18:20:39Z) - Entity Cloze By Date: What LMs Know About Unseen Entities [79.34707800653597]
Language models (LMs) are typically trained once on a large-scale corpus and used for years without being updated.
We propose a framework to analyze what LMs can infer about new entities that did not exist when the LMs were pretrained.
We derive a dataset of entities indexed by their origination date and paired with their English Wikipedia articles, from which we can find sentences about each entity.
arXiv Detail & Related papers (2022-05-05T17:59:31Z) - FinQA: A Dataset of Numerical Reasoning over Financial Data [52.7249610894623]
We focus on answering deep questions over financial data, aiming to automate the analysis of a large corpus of financial documents.
We propose a new large-scale dataset, FinQA, with Question-Answering pairs over Financial reports, written by financial experts.
The results demonstrate that popular, large, pre-trained models fall far short of expert humans in acquiring finance knowledge.
arXiv Detail & Related papers (2021-09-01T00:08:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.