Related papers: Is Image Encoding Beneficial for Deep Learning in Finance? An Analysis of Image Encoding Methods for the Application of Convolutional Neural Networks in Finance

Is Image Encoding Beneficial for Deep Learning in Finance? An Analysis of Image Encoding Methods for the Application of Convolutional Neural Networks in Finance

URL: http://arxiv.org/abs/2010.08698v1
Date: Sat, 17 Oct 2020 02:14:39 GMT
Title: Is Image Encoding Beneficial for Deep Learning in Finance? An Analysis of Image Encoding Methods for the Application of Convolutional Neural Networks in Finance
Authors: Dan Wang, Tianrui Wang, Ionu\c{t} Florescu
Abstract summary: SEC mandated all corporate filings for any company doing business in US be entered into the Electronic Data Gathering, Analysis, and Retrieval (EDGAR) system. This may serve portfolio managers (pension funds, mutual funds, insurance, hedge funds) to get automated insights into companies they invest in.
Score: 4.14084373472438
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In 2012, SEC mandated all corporate filings for any company doing business in US be entered into the Electronic Data Gathering, Analysis, and Retrieval (EDGAR) system. In this work we are investigating ways to analyze the data available through EDGAR database. This may serve portfolio managers (pension funds, mutual funds, insurance, hedge funds) to get automated insights into companies they invest in, to better manage their portfolios. The analysis is based on Artificial Neural Networks applied to the data.} In particular, one of the most popular machine learning methods, the Convolutional Neural Network (CNN) architecture, originally developed to interpret and classify images, is now being used to interpret financial data. This work investigates the best way to input data collected from the SEC filings into a CNN architecture. We incorporate accounting principles and mathematical methods into the design of three image encoding methods. Specifically, two methods are derived from accounting principles (Sequential Arrangement, Category Chunk Arrangement) and one is using a purely mathematical technique (Hilbert Vector Arrangement). In this work we analyze fundamental financial data as well as financial ratio data and study companies from the financial, healthcare and IT sectors in the United States. We find that using imaging techniques to input data for CNN works better for financial ratio data but is not significantly better than simply using the 1D input directly for fundamental data. We do not find the Hilbert Vector Arrangement technique to be significantly better than other imaging techniques.

Related papers

Exact Certification of (Graph) Neural Networks Against Label Poisoning [50.87615167799367]
We introduce an exact certification method for label flipping in Graph Neural Networks (GNNs) We apply our method to certify a broad range of GNN architectures in node classification tasks. Our work presents the first exact certificate to a poisoning attack ever derived for neural networks.
arXiv Detail & Related papers (2024-11-30T17:05:12Z)
Transfer Learning and Transformer Architecture for Financial Sentiment Analysis [3.065600760950715]
Financial domain uses specialized mechanisms which makes sentiment analysis difficult. We propose a pre-trained language model which can help to solve this problem with fewer labelled data.
arXiv Detail & Related papers (2024-04-28T17:15:07Z)
Learnable Graph Matching: A Practical Paradigm for Data Association [74.28753343714858]
We propose a general learnable graph matching method to address these issues. Our method achieves state-of-the-art performance on several MOT datasets. For image matching, our method outperforms state-of-the-art methods on a popular indoor dataset, ScanNet.
arXiv Detail & Related papers (2023-03-27T17:39:00Z)
Reduction Algorithms for Persistence Diagrams of Networks: CoralTDA and PrunIT [27.411830935369498]
High computational costs remain the primary roadblock hindering the successful application of topological data analysis. We develop two new, remarkably simple but effective algorithms to compute the exact persistence diagrams of large graphs. Our experiments on large networks show that our novel approach can achieve computational gains up to 95%.
arXiv Detail & Related papers (2022-11-24T16:52:48Z)
Comparison Analysis of Traditional Machine Learning and Deep Learning Techniques for Data and Image Classification [62.997667081978825]
The purpose of the study is to analyse and compare the most common machine learning and deep learning techniques used for computer vision 2D object classification tasks. Firstly, we will present the theoretical background of the Bag of Visual words model and Deep Convolutional Neural Networks (DCNN) Secondly, we will implement a Bag of Visual Words model, the VGG16 CNN Architecture.
arXiv Detail & Related papers (2022-04-11T11:34:43Z)
ProgFed: Effective, Communication, and Computation Efficient Federated Learning by Progressive Training [65.68511423300812]
We propose ProgFed, a progressive training framework for efficient and effective federated learning. ProgFed inherently reduces computation and two-way communication costs while maintaining the strong performance of the final models. Our results show that ProgFed converges at the same rate as standard training on full models.
arXiv Detail & Related papers (2021-10-11T14:45:00Z)
Fund2Vec: Mutual Funds Similarity using Graph Learning [0.966840768820136]
We propose a radically new approach to identify similar funds based on the weighted bipartite network representation of funds and their underlying assets data. Ours is the first ever study of the weighted bipartite network representation of the funds-assets network in its original form.
arXiv Detail & Related papers (2021-06-24T17:35:00Z)
Evaluating data augmentation for financial time series classification [85.38479579398525]
We evaluate several augmentation methods applied to stocks datasets using two state-of-the-art deep learning models. For a relatively small dataset augmentation methods achieve up to $400%$ improvement in risk adjusted return performance. For a larger stock dataset augmentation methods achieve up to $40%$ improvement.
arXiv Detail & Related papers (2020-10-28T17:53:57Z)
A Novel Ensemble Deep Learning Model for Stock Prediction Based on Stock Prices and News [7.578363431637128]
This paper proposes to use sentiment analysis to extract useful information from multiple textual data sources to predict future stock movement. The blending ensemble model contains two levels. The first level contains two Recurrent Neural Networks (RNNs), one Long-Short Term Memory network (LSTM) and one Gated Recurrent Units network (GRU) The fully connected neural network is used to ensemble several individual prediction results to further improve the prediction accuracy.
arXiv Detail & Related papers (2020-07-23T15:25:37Z)
Learning to map source code to software vulnerability using code-as-a-graph [67.62847721118142]
We explore the applicability of Graph Neural Networks in learning the nuances of source code from a security perspective. We show that a code-as-graph encoding is more meaningful for vulnerability detection than existing code-as-photo and linear sequence encoding approaches.
arXiv Detail & Related papers (2020-06-15T16:05:27Z)
3D medical image segmentation with labeled and unlabeled data using autoencoders at the example of liver segmentation in CT images [58.720142291102135]
This work investigates the potential of autoencoder-extracted features to improve segmentation with a convolutional neural network. A convolutional autoencoder was used to extract features from unlabeled data and a multi-scale, fully convolutional CNN was used to perform the target task of 3D liver segmentation in CT images.
arXiv Detail & Related papers (2020-03-17T20:20:43Z)
Application of Deep Neural Networks to assess corporate Credit Rating [4.14084373472438]
We analyze the performance of four neural network architectures in predicting corporate credit rating as issued by Standard and Poor's. The goal of the analysis is to improve application of machine learning algorithms to credit assessment.
arXiv Detail & Related papers (2020-03-04T21:29:22Z)

This list is automatically generated from the titles and abstracts of the papers in this site.