Heterogeneous Graph Neural Networks for Software Effort Estimation
- URL: http://arxiv.org/abs/2206.11023v1
- Date: Wed, 22 Jun 2022 12:46:02 GMT
- Title: Heterogeneous Graph Neural Networks for Software Effort Estimation
- Authors: Hung Phan and Ali Jannesari
- Abstract summary: Current approaches for automatically estimating story points focus on applying pre-trained embedding models and deep learning for text regression.
We propose HeteroSP, a tool for estimating story points from textual input of Agile software project issues.
- Score: 2.652428960991066
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Software effort can be measured by story point [35]. Current approaches for
automatically estimating story points focus on applying pre-trained embedding
models and deep learning for text regression to solve this problem which
required expensive embedding models. We propose HeteroSP, a tool for estimating
story points from textual input of Agile software project issues. We select
GPT2SP [12] and Deep-SE [8] as the baselines for comparison. First, from the
analysis of the story point dataset [8], we conclude that software issues are
actually a mixture of natural language sentences with quoted code snippets and
have problems related to large-size vocabulary. Second, we provide a module to
normalize the input text including words and code tokens of the software
issues. Third, we design an algorithm to convert an input software issue to a
graph with different types of nodes and edges. Fourth, we construct a
heterogeneous graph neural networks model with the support of fastText [6] for
constructing initial node embedding to learn and predict the story points of
new issues. We did the comparison over three scenarios of estimation, including
within project, cross-project within the repository, and cross-project cross
repository with our baseline approaches. We achieve the average Mean Absolute
Error (MAE) as 2.38, 2.61, and 2.63 for three scenarios. We outperform GPT2SP
in 2/3 of the scenarios while outperforming Deep-SE in the most challenging
scenario with significantly less amount of running time. We also compare our
approaches with different homogeneous graph neural network models and the
results show that the heterogeneous graph neural networks model outperforms the
homogeneous models in story point estimation. For time performance, we achieve
about 570 seconds as the time performance in both three processes: node
embedding initialization, model construction, and story point estimation.
Related papers
- Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelines [74.42485647685272]
We focus on Generative Masked Language Models (GMLMs)
We train a model to fit conditional probabilities of the data distribution via masking, which are subsequently used as inputs to a Markov Chain to draw samples from the model.
We adapt the T5 model for iteratively-refined parallel decoding, achieving 2-3x speedup in machine translation with minimal sacrifice in quality.
arXiv Detail & Related papers (2024-07-22T18:00:00Z) - Clustering based Point Cloud Representation Learning for 3D Analysis [80.88995099442374]
We propose a clustering based supervised learning scheme for point cloud analysis.
Unlike current de-facto, scene-wise training paradigm, our algorithm conducts within-class clustering on the point embedding space.
Our algorithm shows notable improvements on famous point cloud segmentation datasets.
arXiv Detail & Related papers (2023-07-27T03:42:12Z) - Networked Time Series Imputation via Position-aware Graph Enhanced
Variational Autoencoders [31.953958053709805]
We design a new model named PoGeVon which leverages variational autoencoder (VAE) to predict missing values over both node time series features and graph structures.
Experiment results demonstrate the effectiveness of our model over baselines.
arXiv Detail & Related papers (2023-05-29T21:11:34Z) - Learnable Graph Matching: A Practical Paradigm for Data Association [74.28753343714858]
We propose a general learnable graph matching method to address these issues.
Our method achieves state-of-the-art performance on several MOT datasets.
For image matching, our method outperforms state-of-the-art methods on a popular indoor dataset, ScanNet.
arXiv Detail & Related papers (2023-03-27T17:39:00Z) - Revisit the Algorithm Selection Problem for TSP with Spatial Information
Enhanced Graph Neural Networks [4.084365114504618]
This paper revisits the algorithm selection problem for Euclidean Traveling Salesman Problem (TSP)
We propose a novel Graph Neural Network (GNN), called GINES.
GINES takes the coordinates of cities and distances between cities as input.
It is better than the traditional handcrafted feature-based approach on one dataset.
arXiv Detail & Related papers (2023-02-08T13:14:20Z) - SCOOP: Self-Supervised Correspondence and Optimization-Based Scene Flow [25.577386156273256]
Scene flow estimation is a long-standing problem in computer vision, where the goal is to find the 3D motion of a scene from its consecutive observations.
We introduce SCOOP, a new method for scene flow estimation that can be learned on a small amount of data without employing ground-truth flow supervision.
arXiv Detail & Related papers (2022-11-25T10:52:02Z) - Towards Better Out-of-Distribution Generalization of Neural Algorithmic
Reasoning Tasks [51.8723187709964]
We study the OOD generalization of neural algorithmic reasoning tasks.
The goal is to learn an algorithm from input-output pairs using deep neural networks.
arXiv Detail & Related papers (2022-11-01T18:33:20Z) - Story Point Effort Estimation by Text Level Graph Neural Network [2.652428960991066]
Graph Neural Network is a new approach that has been applied in Natural Language Processing for text classification.
We show the potential and possible challenges of Graph Neural Network text classification in story point level estimation.
arXiv Detail & Related papers (2022-03-06T22:15:03Z) - A Novel Hand Gesture Detection and Recognition system based on
ensemble-based Convolutional Neural Network [3.5665681694253903]
Detection of hand portion has become a challenging task in computer vision and pattern recognition communities.
Deep learning algorithm like convolutional neural network (CNN) architecture has become a very popular choice for classification tasks.
In this paper, an ensemble of CNN-based approaches is presented to overcome some problems like high variance during prediction, overfitting problem and also prediction errors.
arXiv Detail & Related papers (2022-02-25T06:46:58Z) - Learnable Graph Matching: Incorporating Graph Partitioning with Deep
Feature Learning for Multiple Object Tracking [58.30147362745852]
Data association across frames is at the core of Multiple Object Tracking (MOT) task.
Existing methods mostly ignore the context information among tracklets and intra-frame detections.
We propose a novel learnable graph matching method to address these issues.
arXiv Detail & Related papers (2021-03-30T08:58:45Z) - Learning multiview 3D point cloud registration [74.39499501822682]
We present a novel, end-to-end learnable, multiview 3D point cloud registration algorithm.
Our approach outperforms the state-of-the-art by a significant margin, while being end-to-end trainable and computationally less costly.
arXiv Detail & Related papers (2020-01-15T03:42:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.