Research on All-content Text Recognition Method for Financial Ticket
Image
- URL: http://arxiv.org/abs/2012.08168v1
- Date: Tue, 15 Dec 2020 09:39:32 GMT
- Title: Research on All-content Text Recognition Method for Financial Ticket
Image
- Authors: Fukang Tian, Haiyu Wu, Bo Xu
- Abstract summary: We design an accurate and efficient all contents text detection and recognition method based on deep learning.
We also propose a Financial Ticket Character Recognition Framework (FTCRF)
According to the characteristics of Chinese character recognition, this framework contains a two-step information extraction method.
The experimental results show that the average recognition accuracy of this method is 91.75% for character sequence and 87% for the whole ticket.
- Score: 5.371241477007343
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With the development of the economy, the number of financial tickets
increases rapidly. The traditional manual invoice reimbursement and financial
accounting system bring more and more burden to financial accountants.
Therefore, based on the research and analysis of a large number of real
financial ticket data, we designed an accurate and efficient all contents text
detection and recognition method based on deep learning. This method has higher
recognition accuracy and recall rate and can meet the actual requirements of
financial accounting work. In addition, we propose a Financial Ticket Character
Recognition Framework (FTCRF). According to the characteristics of Chinese
character recognition, this framework contains a two-step information
extraction method, which can improve the speed of Chinese character
recognition. The experimental results show that the average recognition
accuracy of this method is 91.75\% for character sequence and 87\% for the
whole ticket. The availability and effectiveness of this method are verified by
a commercial application system, which significantly improves the efficiency of
the financial accounting system.
Related papers
- Money Recognition for the Visually Impaired: A Case Study on Sri Lankan Banknotes [0.0]
This research proposes a user-friendly stand-alone system for the identification of Sri Lankan currency notes.
A custom-created dataset of images of Sri Lankan currency notes was used to fine-tune an EfficientDet model.
The model achieved 0.9847 AP on the validation dataset and performs exceptionally well in real-world scenarios.
arXiv Detail & Related papers (2025-02-20T05:07:46Z) - Billet Number Recognition Based on Test-Time Adaptation [3.663302839754229]
We propose a billet number recognition method that integrates test-time adaptation with prior knowledge.
Experimental results on real datasets, including both machine-printed billet numbers and handwritten billet numbers, show significant improvements in evaluation metrics.
arXiv Detail & Related papers (2025-02-13T07:31:03Z) - Self-consistent Validation for Machine Learning Electronic Structure [81.54661501506185]
Method integrates machine learning with self-consistent field methods to achieve both low validation cost and interpret-ability.
This, in turn, enables exploration of the model's ability with active learning and instills confidence in its integration into real-world studies.
arXiv Detail & Related papers (2024-02-15T18:41:35Z) - Towards reducing hallucination in extracting information from financial
reports using Large Language Models [1.2289361708127877]
We show how Large Language Models (LLMs) can efficiently and rapidly extract information from earnings report transcripts.
We evaluate the outcomes of various LLMs with and without using our proposed approach based on various objective metrics for evaluating Q&A systems.
arXiv Detail & Related papers (2023-10-16T18:45:38Z) - Enabling and Analyzing How to Efficiently Extract Information from
Hybrid Long Documents with LLMs [48.87627426640621]
This research focuses on harnessing the potential of Large Language Models to comprehend critical information from financial reports.
We propose an Automated Financial Information Extraction framework that enhances LLMs' ability to comprehend and extract information from financial reports.
Our framework is effectively validated on GPT-3.5 and GPT-4, yielding average accuracy increases of 53.94% and 33.77%, respectively.
arXiv Detail & Related papers (2023-05-24T10:35:58Z) - Applications of Machine Learning in Detecting Afghan Fake Banknotes [0.0]
The prevalence of fake currency in Afghanistan poses significant challenges and detrimentally impacts the economy.
This paper introduces a method using image processing to identify counterfeit Afghan banknotes by analyzing specific security features.
The Random Forest algorithm achieved exceptional accuracy of 99% in detecting fake Afghan banknotes.
arXiv Detail & Related papers (2023-05-24T05:39:46Z) - Structure Diagram Recognition in Financial Announcements [7.763515888324117]
We propose a new method for recognizing structure diagrams in financial announcements.
We developed a two-stage method to efficiently generate the industry's first benchmark of structure diagrams from Chinese financial announcements.
We experimentally verified the significant performance advantage of our structure diagram recognition method over previous methods.
arXiv Detail & Related papers (2023-04-26T02:04:19Z) - An Efficient and Accurate Rough Set for Feature Selection,
Classification and Knowledge Representation [89.5951484413208]
This paper present a strong data mining method based on rough set, which can realize feature selection, classification and knowledge representation at the same time.
We first find the ineffectiveness of rough set because of overfitting, especially in processing noise attribute, and propose a robust measurement for an attribute, called relative importance.
Experimental results on public benchmark data sets show that the proposed framework achieves higher accurcy than seven popular or the state-of-the-art feature selection methods.
arXiv Detail & Related papers (2021-12-29T12:45:49Z) - Enhancing User' s Income Estimation with Super-App Alternative Data [59.60094442546867]
It compares the performance of these alternative data sources with the performance of industry-accepted bureau income estimators.
Ultimately, this paper shows the incentive for financial institutions to seek to incorporate alternative data into constructing their risk profiles.
arXiv Detail & Related papers (2021-04-12T21:34:44Z) - Research on Fast Text Recognition Method for Financial Ticket Image [5.371241477007343]
In the financial accounting field, the rapid increase in the number of financial tickets dramatically increases labor costs.
This paper first analyzes the different features of 482 kinds of financial tickets, divides all kinds of financial tickets into three categories and proposes different recognition patterns for each category.
According to the characteristics of the financial ticket text, in order to obtain higher recognition accuracy, the loss function, Region Proposal Network (RPN) and Non-Maximum Suppression (NMS) are improved.
arXiv Detail & Related papers (2021-01-05T01:42:35Z) - Automatic Counting and Identification of Train Wagons Based on Computer
Vision and Deep Learning [70.84106972725917]
The proposed solution is cost-effective and can easily replace solutions based on radiofrequency identification (RFID)
The system is able to automatically reject some of the train wagons successfully counted, as they have damaged identification codes.
arXiv Detail & Related papers (2020-10-30T14:56:54Z) - Financial ticket intelligent recognition system based on deep learning [4.606100785248409]
The system can recognize 194 kinds of financial tickets and has an automatic iterative optimization mechanism.
The average recognition accuracy of the system is 97.07%, and the average running time for a single ticket is 175.67ms.
arXiv Detail & Related papers (2020-10-29T05:07:40Z) - Cost-Sensitive Portfolio Selection via Deep Reinforcement Learning [100.73223416589596]
We propose a cost-sensitive portfolio selection method with deep reinforcement learning.
Specifically, a novel two-stream portfolio policy network is devised to extract both price series patterns and asset correlations.
A new cost-sensitive reward function is developed to maximize the accumulated return and constrain both costs via reinforcement learning.
arXiv Detail & Related papers (2020-03-06T06:28:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.