Research on Fast Text Recognition Method for Financial Ticket Image
- URL: http://arxiv.org/abs/2101.01310v1
- Date: Tue, 5 Jan 2021 01:42:35 GMT
- Title: Research on Fast Text Recognition Method for Financial Ticket Image
- Authors: Fukang Tian, Haiyu Wu, Bo Xu
- Abstract summary: In the financial accounting field, the rapid increase in the number of financial tickets dramatically increases labor costs.
This paper first analyzes the different features of 482 kinds of financial tickets, divides all kinds of financial tickets into three categories and proposes different recognition patterns for each category.
According to the characteristics of the financial ticket text, in order to obtain higher recognition accuracy, the loss function, Region Proposal Network (RPN) and Non-Maximum Suppression (NMS) are improved.
- Score: 5.371241477007343
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Currently, deep learning methods have been widely applied in and thus
promoted the development of different fields. In the financial accounting
field, the rapid increase in the number of financial tickets dramatically
increases labor costs; hence, using a deep learning method to relieve the
pressure on accounting is necessary. At present, a few works have applied deep
learning methods to financial ticket recognition. However, first, their
approaches only cover a few types of tickets. In addition, the precision and
speed of their recognition models cannot meet the requirements of practical
financial accounting systems. Moreover, none of the methods provides a detailed
analysis of both the types and content of tickets. Therefore, this paper first
analyzes the different features of 482 kinds of financial tickets, divides all
kinds of financial tickets into three categories and proposes different
recognition patterns for each category. These recognition patterns can meet
almost all types of financial ticket recognition needs. Second, regarding the
fixed format types of financial tickets (accounting for 68.27\% of the total
types of tickets), we propose a simple yet efficient network named the
Financial Ticket Faster Detection network (FTFDNet) based on a Faster RCNN.
Furthermore, according to the characteristics of the financial ticket text, in
order to obtain higher recognition accuracy, the loss function, Region Proposal
Network (RPN), and Non-Maximum Suppression (NMS) are improved to make FTFDNet
focus more on text. Finally, we perform a comparison with the best ticket
recognition model from the ICDAR2019 invoice competition. The experimental
results illustrate that FTFDNet increases the processing speed by 50\% while
maintaining similar precision.
Related papers
- FinTSB: A Comprehensive and Practical Benchmark for Financial Time Series Forecasting [58.70072722290475]
Financial time series (FinTS) record the behavior of human-brain-augmented decision-making.
FinTSB is a comprehensive and practical benchmark for financial time series forecasting.
arXiv Detail & Related papers (2025-02-26T05:19:16Z) - BD Currency Detection: A CNN Based Approach with Mobile App Integration [1.2535250082638645]
This study introduces an advanced currency recognition system utilizing Convolutional Neural Networks (CNNs)
A dataset comprising 50,334 images was collected, preprocessed, and used to train a CNN model optimized for high performance classification.
The trained model achieved an accuracy of 98.5%, surpassing conventional based currency recognition approaches.
arXiv Detail & Related papers (2025-02-25T07:13:43Z) - Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation [61.91492500828508]
Few-shot 3D point cloud segmentation (FS-PCS) aims at generalizing models to segment novel categories with minimal support samples.
We introduce a cost-free multimodal FS-PCS setup, utilizing textual labels and the potentially available 2D image modality.
We propose a simple yet effective Test-time Adaptive Cross-modal Seg (TACC) technique to mitigate training bias.
arXiv Detail & Related papers (2024-10-29T19:28:41Z) - FinEmbedDiff: A Cost-Effective Approach of Classifying Financial Documents with Vector Sampling using Multi-modal Embedding Models [0.0]
FinEmbedDiff is a cost-effective vector sampling method to classify financial documents.
It achieves competitive classification accuracy compared to state-of-the-art baselines.
It is a practical and scalable solution for real-world financial applications.
arXiv Detail & Related papers (2024-05-28T16:34:24Z) - Identifying Banking Transaction Descriptions via Support Vector Machine Short-Text Classification Based on a Specialized Labelled Corpus [7.046417074932257]
We describe a novel system that combines Natural Language Processing techniques with Machine Learning algorithms to classify banking transaction descriptions.
Motivated by existing solutions in spam detection, we also propose a short text similarity detector to reduce training set size based on the Jaccard distance.
We present a use case with a personal finance application, CoinScrap, which is available at Google Play and App Store.
arXiv Detail & Related papers (2024-03-29T13:15:46Z) - Textual Data Mining for Financial Fraud Detection: A Deep Learning
Approach [0.0]
I present a deep learning approach to conduct a natural language processing (hereafter NLP) binary classification task for analyzing financial-fraud texts.
My methodology involved different kinds of neural network models, including Multilayer Perceptrons with Embedding layers, vanilla Recurrent Neural Network (RNN), Long-Short Term Memory (LSTM), and Gated Recurrent Unit (GRU)
My results bring significant implications for financial fraud detection as this work contributes to the growing body of research at the intersection of deep learning, NLP, and finance.
arXiv Detail & Related papers (2023-08-05T15:33:10Z) - Transaction Fraud Detection via an Adaptive Graph Neural Network [64.9428588496749]
We propose an Adaptive Sampling and Aggregation-based Graph Neural Network (ASA-GNN) that learns discriminative representations to improve the performance of transaction fraud detection.
A neighbor sampling strategy is performed to filter noisy nodes and supplement information for fraudulent nodes.
Experiments on three real financial datasets demonstrate that the proposed method ASA-GNN outperforms state-of-the-art ones.
arXiv Detail & Related papers (2023-07-11T07:48:39Z) - Robust Lottery Tickets for Pre-trained Language Models [57.14316619360376]
We propose a novel method based on learning binary weight masks to identify robust tickets hidden in the original language models.
Experimental results show the significant improvement of the proposed method over previous work on adversarial robustness evaluation.
arXiv Detail & Related papers (2022-11-06T02:59:27Z) - The Elastic Lottery Ticket Hypothesis [106.79387235014379]
Lottery Ticket Hypothesis raises keen attention to identifying sparse trainableworks or winning tickets.
The most effective method to identify such winning tickets is still Iterative Magnitude-based Pruning.
We propose a variety of strategies to tweak the winning tickets found from different networks of the same model family.
arXiv Detail & Related papers (2021-03-30T17:53:45Z) - Research on All-content Text Recognition Method for Financial Ticket
Image [5.371241477007343]
We design an accurate and efficient all contents text detection and recognition method based on deep learning.
We also propose a Financial Ticket Character Recognition Framework (FTCRF)
According to the characteristics of Chinese character recognition, this framework contains a two-step information extraction method.
The experimental results show that the average recognition accuracy of this method is 91.75% for character sequence and 87% for the whole ticket.
arXiv Detail & Related papers (2020-12-15T09:39:32Z) - Financial ticket intelligent recognition system based on deep learning [4.606100785248409]
The system can recognize 194 kinds of financial tickets and has an automatic iterative optimization mechanism.
The average recognition accuracy of the system is 97.07%, and the average running time for a single ticket is 175.67ms.
arXiv Detail & Related papers (2020-10-29T05:07:40Z) - Winning Lottery Tickets in Deep Generative Models [64.79920299421255]
We show the existence of winning tickets in deep generative models such as GANs and VAEs.
We also demonstrate the transferability of winning tickets across different generative models.
arXiv Detail & Related papers (2020-10-05T21:45:39Z) - ResNeSt: Split-Attention Networks [86.25490825631763]
We present a modularized architecture, which applies the channel-wise attention on different network branches to leverage their success in capturing cross-feature interactions and learning diverse representations.
Our model, named ResNeSt, outperforms EfficientNet in accuracy and latency trade-off on image classification.
arXiv Detail & Related papers (2020-04-19T20:40:31Z) - Drawing Early-Bird Tickets: Towards More Efficient Training of Deep Networks [82.52404247479359]
Early-bird (EB) tickets can be identified at the very early training stage.
We propose a mask distance metric that can be used to identify EB tickets with low computational overhead.
arXiv Detail & Related papers (2019-09-26T07:43:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.