Is Image Encoding Beneficial for Deep Learning in Finance? An Analysis
of Image Encoding Methods for the Application of Convolutional Neural
Networks in Finance
- URL: http://arxiv.org/abs/2010.08698v1
- Date: Sat, 17 Oct 2020 02:14:39 GMT
- Title: Is Image Encoding Beneficial for Deep Learning in Finance? An Analysis
of Image Encoding Methods for the Application of Convolutional Neural
Networks in Finance
- Authors: Dan Wang, Tianrui Wang, Ionu\c{t} Florescu
- Abstract summary: SEC mandated all corporate filings for any company doing business in US be entered into the Electronic Data Gathering, Analysis, and Retrieval (EDGAR) system.
This may serve portfolio managers (pension funds, mutual funds, insurance, hedge funds) to get automated insights into companies they invest in.
- Score: 4.14084373472438
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In 2012, SEC mandated all corporate filings for any company doing business in
US be entered into the Electronic Data Gathering, Analysis, and Retrieval
(EDGAR) system. In this work we are investigating ways to analyze the data
available through EDGAR database. This may serve portfolio managers (pension
funds, mutual funds, insurance, hedge funds) to get automated insights into
companies they invest in, to better manage their portfolios. The analysis is
based on Artificial Neural Networks applied to the data.} In particular, one of
the most popular machine learning methods, the Convolutional Neural Network
(CNN) architecture, originally developed to interpret and classify images, is
now being used to interpret financial data. This work investigates the best way
to input data collected from the SEC filings into a CNN architecture. We
incorporate accounting principles and mathematical methods into the design of
three image encoding methods. Specifically, two methods are derived from
accounting principles (Sequential Arrangement, Category Chunk Arrangement) and
one is using a purely mathematical technique (Hilbert Vector Arrangement). In
this work we analyze fundamental financial data as well as financial ratio data
and study companies from the financial, healthcare and IT sectors in the United
States. We find that using imaging techniques to input data for CNN works
better for financial ratio data but is not significantly better than simply
using the 1D input directly for fundamental data. We do not find the Hilbert
Vector Arrangement technique to be significantly better than other imaging
techniques.
Related papers
- Transfer Learning and Transformer Architecture for Financial Sentiment Analysis [3.065600760950715]
Financial domain uses specialized mechanisms which makes sentiment analysis difficult.
We propose a pre-trained language model which can help to solve this problem with fewer labelled data.
arXiv Detail & Related papers (2024-04-28T17:15:07Z) - Learnable Graph Matching: A Practical Paradigm for Data Association [74.28753343714858]
We propose a general learnable graph matching method to address these issues.
Our method achieves state-of-the-art performance on several MOT datasets.
For image matching, our method outperforms state-of-the-art methods on a popular indoor dataset, ScanNet.
arXiv Detail & Related papers (2023-03-27T17:39:00Z) - Reduction Algorithms for Persistence Diagrams of Networks: CoralTDA and
PrunIT [27.411830935369498]
High computational costs remain the primary roadblock hindering the successful application of topological data analysis.
We develop two new, remarkably simple but effective algorithms to compute the exact persistence diagrams of large graphs.
Our experiments on large networks show that our novel approach can achieve computational gains up to 95%.
arXiv Detail & Related papers (2022-11-24T16:52:48Z) - Comparison Analysis of Traditional Machine Learning and Deep Learning
Techniques for Data and Image Classification [62.997667081978825]
The purpose of the study is to analyse and compare the most common machine learning and deep learning techniques used for computer vision 2D object classification tasks.
Firstly, we will present the theoretical background of the Bag of Visual words model and Deep Convolutional Neural Networks (DCNN)
Secondly, we will implement a Bag of Visual Words model, the VGG16 CNN Architecture.
arXiv Detail & Related papers (2022-04-11T11:34:43Z) - ProgFed: Effective, Communication, and Computation Efficient Federated Learning by Progressive Training [65.68511423300812]
We propose ProgFed, a progressive training framework for efficient and effective federated learning.
ProgFed inherently reduces computation and two-way communication costs while maintaining the strong performance of the final models.
Our results show that ProgFed converges at the same rate as standard training on full models.
arXiv Detail & Related papers (2021-10-11T14:45:00Z) - Fund2Vec: Mutual Funds Similarity using Graph Learning [0.966840768820136]
We propose a radically new approach to identify similar funds based on the weighted bipartite network representation of funds and their underlying assets data.
Ours is the first ever study of the weighted bipartite network representation of the funds-assets network in its original form.
arXiv Detail & Related papers (2021-06-24T17:35:00Z) - Evaluating data augmentation for financial time series classification [85.38479579398525]
We evaluate several augmentation methods applied to stocks datasets using two state-of-the-art deep learning models.
For a relatively small dataset augmentation methods achieve up to $400%$ improvement in risk adjusted return performance.
For a larger stock dataset augmentation methods achieve up to $40%$ improvement.
arXiv Detail & Related papers (2020-10-28T17:53:57Z) - A Novel Ensemble Deep Learning Model for Stock Prediction Based on Stock
Prices and News [7.578363431637128]
This paper proposes to use sentiment analysis to extract useful information from multiple textual data sources to predict future stock movement.
The blending ensemble model contains two levels. The first level contains two Recurrent Neural Networks (RNNs), one Long-Short Term Memory network (LSTM) and one Gated Recurrent Units network (GRU)
The fully connected neural network is used to ensemble several individual prediction results to further improve the prediction accuracy.
arXiv Detail & Related papers (2020-07-23T15:25:37Z) - Learning to map source code to software vulnerability using
code-as-a-graph [67.62847721118142]
We explore the applicability of Graph Neural Networks in learning the nuances of source code from a security perspective.
We show that a code-as-graph encoding is more meaningful for vulnerability detection than existing code-as-photo and linear sequence encoding approaches.
arXiv Detail & Related papers (2020-06-15T16:05:27Z) - 3D medical image segmentation with labeled and unlabeled data using
autoencoders at the example of liver segmentation in CT images [58.720142291102135]
This work investigates the potential of autoencoder-extracted features to improve segmentation with a convolutional neural network.
A convolutional autoencoder was used to extract features from unlabeled data and a multi-scale, fully convolutional CNN was used to perform the target task of 3D liver segmentation in CT images.
arXiv Detail & Related papers (2020-03-17T20:20:43Z) - Application of Deep Neural Networks to assess corporate Credit Rating [4.14084373472438]
We analyze the performance of four neural network architectures in predicting corporate credit rating as issued by Standard and Poor's.
The goal of the analysis is to improve application of machine learning algorithms to credit assessment.
arXiv Detail & Related papers (2020-03-04T21:29:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.