DSC-IITISM at FinCausal 2021: Combining POS tagging with Attention-based
Contextual Representations for Identifying Causal Relationships in Financial
Documents
- URL: http://arxiv.org/abs/2111.00490v1
- Date: Sun, 31 Oct 2021 13:09:19 GMT
- Title: DSC-IITISM at FinCausal 2021: Combining POS tagging with Attention-based
Contextual Representations for Identifying Causal Relationships in Financial
Documents
- Authors: Gunjan Haldar, Aman Mittal and Pradyumna Gupta
- Abstract summary: Causality detection has applications in information retrieval, event prediction, question answering, financial analysis, and market research.
In this study, we explore several methods to identify and extract cause-effect pairs in financial documents using transformers.
Our best methodology achieves an F1-Score of 0.9551, and an Exact Match Score of 0.8777 on the blind test.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Causality detection draws plenty of attention in the field of Natural
Language Processing and linguistics research. It has essential applications in
information retrieval, event prediction, question answering, financial
analysis, and market research. In this study, we explore several methods to
identify and extract cause-effect pairs in financial documents using
transformers. For this purpose, we propose an approach that combines POS
tagging with the BIO scheme, which can be integrated with modern transformer
models to address this challenge of identifying causality in a given text. Our
best methodology achieves an F1-Score of 0.9551, and an Exact Match Score of
0.8777 on the blind test in the FinCausal-2021 Shared Task at the FinCausal
2021 Workshop.
Related papers
- ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich
Document Images [198.35937007558078]
The competition opened on 30th December, 2022 and closed on 24th March, 2023.
There are 35 participants and 91 valid submissions received for Track 1, and 15 participants and 26 valid submissions received for Track 2.
According to the performance of the submissions, we believe there is still a large gap on the expected information extraction performance for complex and zero-shot scenarios.
arXiv Detail & Related papers (2023-06-05T22:20:52Z) - ConvFinQA: Exploring the Chain of Numerical Reasoning in Conversational
Finance Question Answering [70.6359636116848]
We propose a new large-scale dataset, ConvFinQA, to study the chain of numerical reasoning in conversational question answering.
Our dataset poses great challenge in modeling long-range, complex numerical reasoning paths in real-world conversations.
arXiv Detail & Related papers (2022-10-07T23:48:50Z) - RuArg-2022: Argument Mining Evaluation [69.87149207721035]
This paper is a report of the organizers on the first competition of argumentation analysis systems dealing with Russian language texts.
A corpus containing 9,550 sentences (comments on social media posts) on three topics related to the COVID-19 pandemic was prepared.
The system that won the first place in both tasks used the NLI (Natural Language Inference) variant of the BERT architecture.
arXiv Detail & Related papers (2022-06-18T17:13:37Z) - FinBERT-MRC: financial named entity recognition using BERT under the
machine reading comprehension paradigm [8.17576814961648]
We formulate the FinNER task as a machine reading comprehension (MRC) problem and propose a new model termed FinBERT-MRC.
This formulation introduces significant prior information by utilizing well-designed queries, and extracts start index and end index of target entities.
We conduct experiments on a publicly available Chinese financial dataset ChFinAnn and a real-word dataset AdminPunish.
arXiv Detail & Related papers (2022-05-31T00:44:57Z) - A Generative Approach for Financial Causality Extraction [25.341822852612225]
Causality represents the foremost relation between events in financial documents.
We propose a generative approach for causality extraction using the encoder-decoder framework and pointer networks.
arXiv Detail & Related papers (2022-04-12T10:05:41Z) - Exploring Multi-Modal Representations for Ambiguity Detection &
Coreference Resolution in the SIMMC 2.0 Challenge [60.616313552585645]
We present models for effective Ambiguity Detection and Coreference Resolution in Conversational AI.
Specifically, we use TOD-BERT and LXMERT based models, compare them to a number of baselines and provide ablation experiments.
Our results show that (1) language models are able to exploit correlations in the data to detect ambiguity; and (2) unimodal coreference resolution models can avoid the need for a vision component.
arXiv Detail & Related papers (2022-02-25T12:10:02Z) - Overview of ADoBo 2021: Automatic Detection of Unassimilated Borrowings
in the Spanish Press [8.950918531231158]
This paper summarizes the main findings of the ADoBo 2021 shared task, proposed in the context of IberLef 2021.
In this task, we invited participants to detect lexical borrowings (coming mostly from English) in Spanish newswire texts.
We provided participants with an annotated corpus of lexical borrowings which we split into training, development and test splits.
arXiv Detail & Related papers (2021-10-29T11:07:59Z) - FinQA: A Dataset of Numerical Reasoning over Financial Data [52.7249610894623]
We focus on answering deep questions over financial data, aiming to automate the analysis of a large corpus of financial documents.
We propose a new large-scale dataset, FinQA, with Question-Answering pairs over Financial reports, written by financial experts.
The results demonstrate that popular, large, pre-trained models fall far short of expert humans in acquiring finance knowledge.
arXiv Detail & Related papers (2021-09-01T00:08:14Z) - Financial Document Causality Detection Shared Task (FinCausal 2020) [0.0]
This workshop is associated to the Joint Workshop on Financial Narrative Processing and MultiLing Financial Summarisation (FNP-FNS 2020)
We present the FinCausal 2020 Shared Task on Causality Detection in Financial Documents and the associated FinCausal dataset.
arXiv Detail & Related papers (2020-12-04T10:17:42Z) - IIT_kgp at FinCausal 2020, Shared Task 1: Causality Detection using
Sentence Embeddings in Financial Reports [0.0]
This work is associated with the first sub-task of identifying causality in sentences.
BERT (Large) performed the best, giving a F1 score of 0.958, in the task of detecting the causality of sentences in financial texts and reports.
arXiv Detail & Related papers (2020-11-16T00:57:14Z) - Explaining Relationships Between Scientific Documents [55.23390424044378]
We address the task of explaining relationships between two scientific documents using natural language text.
In this paper we establish a dataset of 622K examples from 154K documents.
arXiv Detail & Related papers (2020-02-02T03:54:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.