Learning Semantic Text Similarity to rank Hypernyms of Financial Terms
- URL: http://arxiv.org/abs/2303.13475v2
- Date: Sat, 12 Aug 2023 23:51:53 GMT
- Title: Learning Semantic Text Similarity to rank Hypernyms of Financial Terms
- Authors: Sohom Ghosh, Ankush Chopra, Sudip Kumar Naskar
- Abstract summary: We propose a system capable of extracting and ranking hypernyms for a given financial term.
The system has been trained with financial text corpora obtained from various sources like DBpedia.
A novel approach has been used to augment the training set with negative samples.
- Score: 0.23940819037450983
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Over the years, there has been a paradigm shift in how users access financial
services. With the advancement of digitalization more users have been
preferring the online mode of performing financial activities. This has led to
the generation of a huge volume of financial content. Most investors prefer to
go through these contents before making decisions. Every industry has terms
that are specific to the domain it operates in. Banking and Financial Services
are not an exception to this. In order to fully comprehend these contents, one
needs to have a thorough understanding of the financial terms. Getting a basic
idea about a term becomes easy when it is explained with the help of the broad
category to which it belongs. This broad category is referred to as hypernym.
For example, "bond" is a hypernym of the financial term "alternative
debenture". In this paper, we propose a system capable of extracting and
ranking hypernyms for a given financial term. The system has been trained with
financial text corpora obtained from various sources like DBpedia [4],
Investopedia, Financial Industry Business Ontology (FIBO), prospectus and so
on. Embeddings of these terms have been extracted using FinBERT [3], FinISH [1]
and fine-tuned using SentenceBERT [54]. A novel approach has been used to
augment the training set with negative samples. It uses the hierarchy present
in FIBO. Finally, we benchmark the system performance with that of the existing
ones. We establish that it performs better than the existing ones and is also
scalable.
Related papers
- A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges [60.546677053091685]
Large language models (LLMs) have unlocked novel opportunities for machine learning applications in the financial domain.
We explore the application of LLMs on various financial tasks, focusing on their potential to transform traditional practices and drive innovation.
We highlight this survey for categorizing the existing literature into key application areas, including linguistic tasks, sentiment analysis, financial time series, financial reasoning, agent-based modeling, and other applications.
arXiv Detail & Related papers (2024-06-15T16:11:35Z) - AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework [48.3060010653088]
We release AlphaFin datasets, combining traditional research datasets, real-time financial data, and handwritten chain-of-thought (CoT) data.
We then use AlphaFin datasets to benchmark a state-of-the-art method, called Stock-Chain, for effectively tackling the financial analysis task.
arXiv Detail & Related papers (2024-03-19T09:45:33Z) - FinBen: A Holistic Financial Benchmark for Large Language Models [75.09474986283394]
FinBen is the first extensive open-source evaluation benchmark, including 36 datasets spanning 24 financial tasks.
FinBen offers several key innovations: a broader range of tasks and datasets, the first evaluation of stock trading, novel agent and Retrieval-Augmented Generation (RAG) evaluation, and three novel open-source evaluation datasets for text summarization, question answering, and stock trading.
arXiv Detail & Related papers (2024-02-20T02:16:16Z) - FinEntity: Entity-level Sentiment Classification for Financial Texts [15.467477195487763]
In the financial domain, conducting entity-level sentiment analysis is crucial for accurately assessing the sentiment directed toward a specific financial entity.
We introduce an entity-level sentiment classification dataset, called textbfFinEntity, that annotates financial entity spans and their sentiment in financial news.
arXiv Detail & Related papers (2023-10-19T01:38:40Z) - DICoE@FinSim-3: Financial Hypernym Detection using Augmented Terms and
Distance-based Features [2.6599014990168834]
We present the submission of team DICoE for FinSim-3, the 3rd Shared Task on Learning Semantic Similarities for the Financial Domain.
The task provides a set of terms in the financial domain and requires to classify them into the most relevant hypernym from a financial ontology.
Our best-performing submission ranked 4th on the task's leaderboard.
arXiv Detail & Related papers (2021-09-30T08:01:48Z) - FinQA: A Dataset of Numerical Reasoning over Financial Data [52.7249610894623]
We focus on answering deep questions over financial data, aiming to automate the analysis of a large corpus of financial documents.
We propose a new large-scale dataset, FinQA, with Question-Answering pairs over Financial reports, written by financial experts.
The results demonstrate that popular, large, pre-trained models fall far short of expert humans in acquiring finance knowledge.
arXiv Detail & Related papers (2021-09-01T00:08:14Z) - Term Expansion and FinBERT fine-tuning for Hypernym and Synonym Ranking
of Financial Terms [0.0]
We present systems that attempt to solve Hypernym and synonym matching problem.
We designed these systems to participate in the FinSim-3, a shared task of FinNLP workshop at IJCAI-2021.
Our best performing model (Accuracy: 0.917, Rank: 1.156) was developed by fine-tuning SentenceBERT [Reimers et al., 2019] over an extended labelled set created using the hierarchy of labels present in FIBO.
arXiv Detail & Related papers (2021-07-29T06:17:44Z) - JSI at the FinSim-2 task: Ontology-Augmented Financial Concept
Classification [2.2559617939136505]
Ontologies are increasingly used for machine reasoning over the last few years.
This paper presents a practical use of an ontology for a classification problem from the financial domain.
We propose a method that maps given concepts to the mentioned explanations and performs a graph search for the most relevant hypernyms.
arXiv Detail & Related papers (2021-06-17T03:56:15Z) - FinMatcher at FinSim-2: Hypernym Detection in the Financial Services
Domain using Knowledge Graphs [1.2891210250935146]
This paper presents the FinMatcher system and its results for the FinSim 2021 shared task.
The FinSim-2 shared task consists of a set of concept labels from the financial services domain.
The goal is to find the most relevant top-level concept from a given set of concepts.
arXiv Detail & Related papers (2021-03-02T08:56:28Z) - Supporting Financial Inclusion with Graph Machine Learning and Super-App
Alternative Data [63.942632088208505]
Super-Apps have changed the way we think about the interactions between users and commerce.
This paper investigates how different interactions between users within a Super-App provide a new source of information to predict borrower behavior.
arXiv Detail & Related papers (2021-02-19T15:13:06Z) - NLP in FinTech Applications: Past, Present and Future [50.27357144360525]
We focus on the researches applying natural language processing (NLP) technologies in the finance domain.
We go through the application scenarios from three aspects including Know Your Customer (KYC), Know Your Product (KYP), and Satisfy Your Customer (SYC)
arXiv Detail & Related papers (2020-05-04T08:37:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.