Using Changeset Descriptions as a Data Source to Assist Feature Location
- URL: http://arxiv.org/abs/2402.05711v1
- Date: Thu, 8 Feb 2024 14:38:29 GMT
- Title: Using Changeset Descriptions as a Data Source to Assist Feature Location
- Authors: Muslim Chochlov, Michael English, Jim Buckley
- Abstract summary: We implement a technique utilizing changeset descriptions and conduct an empirical study to observe this technique's overall performance.
Preliminary study with Rhino and Mylyn.Tasks systems suggest that the approach could lead to a potentially efficient feature location technique.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Feature location attempts to assist developers in discovering functionality
in source code. Many textual feature location techniques utilize information
retrieval and rely on comments and identifiers of source code to describe
software entities. An interesting alternative would be to employ the changeset
descriptions of the code altered in that changeset as a data source to describe
such software entities. To investigate this we implement a technique utilizing
changeset descriptions and conduct an empirical study to observe this
technique's overall performance. Moreover, we study how the granularity (i.e.
file or method level of software entities) and changeset range inclusion (i.e.
most recent or all historical changesets) affect such an approach. The results
of a preliminary study with Rhino and Mylyn.Tasks systems suggest that the
approach could lead to a potentially efficient feature location technique. They
also suggest that it is advantageous in terms of the effort to configure the
technique at method level granularity and that older changesets from older
systems may reduce the effectiveness of the technique.
Related papers
- ChangeGuard: Validating Code Changes via Pairwise Learning-Guided Execution [16.130469984234956]
ChangeGuard is an approach that uses learning-guided execution to compare the runtime behavior of a modified function.
Our results show that the approach identifies semantics-changing code changes with a precision of 77.1% and a recall of 69.5%.
arXiv Detail & Related papers (2024-10-21T15:13:32Z) - Toward Interactive Optimization of Source Code Differences: An Empirical Study of Its Performance [1.313675711285772]
We propose an interactive approach to optimize source code differences (diffs)
Users can provide feedback for the points of a diff that should not be matched but are or parts that should be matched but are not.
The results of 23 GitHub projects confirm that 92% of nonoptimal diffs can be addressed with less than four feedback actions in the ideal case.
arXiv Detail & Related papers (2024-09-20T15:43:55Z) - Towards Understanding the Impact of Code Modifications on Software Quality Metrics [1.2277343096128712]
This study aims to assess and interpret the impact of code modifications on software quality metrics.
The underlying hypothesis posits that code modifications inducing similar changes in software quality metrics can be grouped into distinct clusters.
The results reveal distinct clusters of code modifications, each accompanied by a concise description, revealing their collective impact on software quality metrics.
arXiv Detail & Related papers (2024-04-05T08:41:18Z) - Editing Large Language Models: Problems, Methods, and Opportunities [51.903537096207]
This paper embarks on a deep exploration of the problems, methods, and opportunities related to model editing for LLMs.
We provide an exhaustive overview of the task definition and challenges associated with model editing, along with an in-depth empirical analysis of the most progressive methods currently at our disposal.
Our objective is to provide valuable insights into the effectiveness and feasibility of each editing technique, thereby assisting the community in making informed decisions on the selection of the most appropriate method for a specific task or context.
arXiv Detail & Related papers (2023-05-22T16:00:00Z) - Leveraging Data Mining Algorithms to Recommend Source Code Changes [7.959841510571622]
This paper proposes an automatic method for recommending source code changes using four data mining algorithms.
We compare the algorithms in terms of performance (Precision, Recall and F-measure) and execution time.
Apriori seems appropriate for large-scale projects, whereas Eclat appears to be suitable for small-scale projects.
arXiv Detail & Related papers (2023-04-29T18:38:23Z) - Leveraging Wikidata's edit history in knowledge graph refinement tasks [77.34726150561087]
edit history represents the process in which the community reaches some kind of fuzzy and distributed consensus.
We build a dataset containing the edit history of every instance from the 100 most important classes in Wikidata.
We propose and evaluate two new methods to leverage this edit history information in knowledge graph embedding models for type prediction tasks.
arXiv Detail & Related papers (2022-10-27T14:32:45Z) - Incorporating Task-specific Concept Knowledge into Script Learning [68.95195207989605]
We present Tetris, a new task of Goal-Oriented Script Completion.
It considers a more realistic and general setting, where the input includes not only the goal but also additional user context.
We propose a novel approach, which uses two techniques to improve performance.
arXiv Detail & Related papers (2022-08-31T18:55:22Z) - Human-in-the-Loop Disinformation Detection: Stance, Sentiment, or
Something Else? [93.91375268580806]
Both politics and pandemics have recently provided ample motivation for the development of machine learning-enabled disinformation (a.k.a. fake news) detection algorithms.
Existing literature has focused primarily on the fully-automated case, but the resulting techniques cannot reliably detect disinformation on the varied topics, sources, and time scales required for military applications.
By leveraging an already-available analyst as a human-in-the-loop, canonical machine learning techniques of sentiment analysis, aspect-based sentiment analysis, and stance detection become plausible methods to use for a partially-automated disinformation detection system.
arXiv Detail & Related papers (2021-11-09T13:30:34Z) - Modurec: Recommender Systems with Feature and Time Modulation [50.51144496609274]
We propose Modurec: an autoencoder-based method that combines all available information using the feature-wise modulation mechanism.
We show on Movielens datasets that these modifications produce state-of-the-art results in most evaluated settings.
arXiv Detail & Related papers (2020-10-13T09:18:33Z) - Discovering Reinforcement Learning Algorithms [53.72358280495428]
Reinforcement learning algorithms update an agent's parameters according to one of several possible rules.
This paper introduces a new meta-learning approach that discovers an entire update rule.
It includes both 'what to predict' (e.g. value functions) and 'how to learn from it' by interacting with a set of environments.
arXiv Detail & Related papers (2020-07-17T07:38:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.