Learning to Describe Solutions for Bug Reports Based on Developer
Discussions
- URL: http://arxiv.org/abs/2110.04353v1
- Date: Fri, 8 Oct 2021 19:39:55 GMT
- Title: Learning to Describe Solutions for Bug Reports Based on Developer
Discussions
- Authors: Sheena Panthaplackel, Junyi Jessy Li, Milos Gligoric, Raymond J.
Mooney
- Abstract summary: We propose generating a concise natural language description of the solution by synthesizing relevant content within the discussion.
To support generating an informative description during an ongoing discussion, we propose a secondary task of determining when sufficient context about the solution emerges in real-time.
We construct a dataset for these tasks with a novel technique for obtaining noisy supervision from repository changes linked to bug reports.
- Score: 43.427873307255425
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: When a software bug is reported, developers engage in a discussion to
collaboratively resolve it. While the solution is likely formulated within the
discussion, it is often buried in a large amount of text, making it difficult
to comprehend, which delays its implementation. To expedite bug resolution, we
propose generating a concise natural language description of the solution by
synthesizing relevant content within the discussion, which encompasses both
natural language and source code. Furthermore, to support generating an
informative description during an ongoing discussion, we propose a secondary
task of determining when sufficient context about the solution emerges in
real-time. We construct a dataset for these tasks with a novel technique for
obtaining noisy supervision from repository changes linked to bug reports. We
establish baselines for generating solution descriptions, and develop a
classifier which makes a prediction following each new utterance on whether or
not the necessary context for performing generation is available. Through
automated and human evaluation, we find these tasks to form an ideal testbed
for complex reasoning in long, bimodal dialogue context.
Related papers
- On The Importance of Reasoning for Context Retrieval in Repository-Level Code Editing [82.96523584351314]
We decouple the task of context retrieval from the other components of the repository-level code editing pipelines.
We conclude that while the reasoning helps to improve the precision of the gathered context, it still lacks the ability to identify its sufficiency.
arXiv Detail & Related papers (2024-06-06T19:44:17Z) - Exploring Interaction Patterns for Debugging: Enhancing Conversational
Capabilities of AI-assistants [18.53732314023887]
Large Language Models (LLMs) enable programmers to obtain natural language explanations for various software development tasks.
LLMs often leap to action without sufficient context, giving rise to implicit assumptions and inaccurate responses.
In this paper, we draw inspiration from interaction patterns and conversation analysis -- to design Robin, an enhanced conversational AI-assistant for debug.
arXiv Detail & Related papers (2024-02-09T07:44:27Z) - Context-dependent Instruction Tuning for Dialogue Response Generation [61.21790201307179]
Recent language models have achieved impressive performance in natural language computation tasks by incorporating instructions with task input during fine-tuning.
We introduce a context-based instruction fine-tuning framework for each multi-turn dialogue.
During the evaluation, the model generates instructions based on the previous context to self-guide the response.
arXiv Detail & Related papers (2023-11-13T01:25:30Z) - Python Code Generation by Asking Clarification Questions [57.63906360576212]
In this work, we introduce a novel and more realistic setup for this task.
We hypothesize that the under-specification of a natural language description can be resolved by asking clarification questions.
We collect and introduce a new dataset named CodeClarQA containing pairs of natural language descriptions and code with created synthetic clarification questions and answers.
arXiv Detail & Related papers (2022-12-19T22:08:36Z) - Using Developer Discussions to Guide Fixing Bugs in Software [51.00904399653609]
We propose using bug report discussions, which are available before the task is performed and are also naturally occurring, avoiding the need for additional information from developers.
We demonstrate that various forms of natural language context derived from such discussions can aid bug-fixing, even leading to improved performance over using commit messages corresponding to the oracle bug-fixing commits.
arXiv Detail & Related papers (2022-11-11T16:37:33Z) - Leveraging Natural Supervision for Language Representation Learning and
Generation [8.083109555490475]
We describe three lines of work that seek to improve the training and evaluation of neural models using naturally-occurring supervision.
We first investigate self-supervised training losses to help enhance the performance of pretrained language models for various NLP tasks.
We propose a framework that uses paraphrase pairs to disentangle semantics and syntax in sentence representations.
arXiv Detail & Related papers (2022-07-21T17:26:03Z) - Reasoning in Dialog: Improving Response Generation by Context Reading
Comprehension [49.92173751203827]
In multi-turn dialog, utterances do not always take the full form of sentences.
We propose to improve the response generation performance by examining the model's ability to answer a reading comprehension question.
arXiv Detail & Related papers (2020-12-14T10:58:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.