How Far Are We? The Triumphs and Trials of Generative AI in Learning
Software Engineering
- URL: http://arxiv.org/abs/2312.11719v1
- Date: Mon, 18 Dec 2023 21:38:00 GMT
- Title: How Far Are We? The Triumphs and Trials of Generative AI in Learning
Software Engineering
- Authors: Rudrajit Choudhuri, Dylan Liu, Igor Steinmacher, Marco Gerosa, Anita
Sarma
- Abstract summary: We evaluate the effectiveness of ChatGPT, a convo-genAI platform, in assisting students in Software Engineering tasks.
Our study did not find statistical differences in participants' productivity or self-efficacy when using ChatGPT as compared to traditional resources.
Our study also revealed 5 distinct faults arising from violations of Human-AI interaction guidelines, which led to 7 different (negative) consequences on participants.
- Score: 16.5141990552784
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Conversational Generative AI (convo-genAI) is revolutionizing Software
Engineering (SE) as engineers and academics embrace this technology in their
work. However, there is a gap in understanding the current potential and
pitfalls of this technology, specifically in supporting students in SE tasks.
In this work, we evaluate through a between-subjects study (N=22) the
effectiveness of ChatGPT, a convo-genAI platform, in assisting students in SE
tasks. Our study did not find statistical differences in participants'
productivity or self-efficacy when using ChatGPT as compared to traditional
resources, but we found significantly increased frustration levels. Our study
also revealed 5 distinct faults arising from violations of Human-AI interaction
guidelines, which led to 7 different (negative) consequences on participants.
Related papers
- Raising the Stakes: Performance Pressure Improves AI-Assisted Decision Making [57.53469908423318]
We show the effects of performance pressure on AI advice reliance when laypeople complete a common AI-assisted task.
We find that when the stakes are high, people use AI advice more appropriately than when stakes are lower, regardless of the presence of an AI explanation.
arXiv Detail & Related papers (2024-10-21T22:39:52Z) - Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants [175.9723801486487]
We evaluate whether two AI assistants, GPT-3.5 and GPT-4, can adequately answer assessment questions.
GPT-4 answers an average of 65.8% of questions correctly, and can even produce the correct answer across at least one prompting strategy for 85.1% of questions.
Our results call for revising program-level assessment design in higher education in light of advances in generative AI.
arXiv Detail & Related papers (2024-08-07T12:11:49Z) - Effects of a Prompt Engineering Intervention on Undergraduate Students' AI Self-Efficacy, AI Knowledge and Prompt Engineering Ability: A Mixed Methods Study [36.48421439947282]
This study designed and implemented a prompt engineering intervention at a university in Hong Kong.
It examined students' AI self-efficacy, AI knowledge, and proficiency in creating effective prompts.
arXiv Detail & Related papers (2024-07-30T15:05:24Z) - AI-Tutoring in Software Engineering Education [0.7631288333466648]
We conducted an exploratory case study by integrating the GPT-3.5-Turbo model as an AI-Tutor within the APAS Artemis.
The findings highlight advantages, such as timely feedback and scalability.
However, challenges like generic responses and students' concerns about a learning progress inhibition when using the AI-Tutor were also evident.
arXiv Detail & Related papers (2024-04-03T08:15:08Z) - I would love this to be like an assistant, not the teacher: a voice of the customer perspective of what distance learning students want from an Artificial Intelligence Digital Assistant [0.0]
This study examined the perceptions of ten online and distance learning students regarding the design of a hypothetical AI Digital Assistant (AIDA)
All participants agreed on the usefulness of such an AI tool while studying and reported benefits from using it for real-time assistance and query resolution, support for academic tasks, personalisation and accessibility, together with emotional and social support.
Students' concerns related to the ethical and social implications of implementing AIDA, data privacy and data use, operational challenges, academic integrity and misuse, and the future of education.
arXiv Detail & Related papers (2024-02-16T08:10:41Z) - ChatGPT as a Software Development Bot: A Project-based Study [5.518217604591736]
This study examines the impact of generative AI tools, specifically ChatGPT, on the software development experiences of undergraduate students.
Results showed that ChatGPT significantly addresses skill gaps in software development education, enhancing efficiency, accuracy, and collaboration.
arXiv Detail & Related papers (2023-10-20T16:48:19Z) - Identifying and Mitigating the Security Risks of Generative AI [179.2384121957896]
This paper reports the findings of a workshop held at Google on the dual-use dilemma posed by GenAI.
GenAI can be used just as well by attackers to generate new attacks and increase the velocity and efficacy of existing attacks.
We discuss short-term and long-term goals for the community on this topic.
arXiv Detail & Related papers (2023-08-28T18:51:09Z) - Perception, performance, and detectability of conversational artificial
intelligence across 32 university courses [15.642614735026106]
We compare the performance of ChatGPT against students on 32 university-level courses.
We find that ChatGPT's performance is comparable, if not superior, to that of students in many courses.
We find an emerging consensus among students to use the tool, and among educators to treat this as plagiarism.
arXiv Detail & Related papers (2023-05-07T10:37:51Z) - Human Decision Makings on Curriculum Reinforcement Learning with
Difficulty Adjustment [52.07473934146584]
We guide the curriculum reinforcement learning results towards a preferred performance level that is neither too hard nor too easy via learning from the human decision process.
Our system is highly parallelizable, making it possible for a human to train large-scale reinforcement learning applications.
It shows reinforcement learning performance can successfully adjust in sync with the human desired difficulty level.
arXiv Detail & Related papers (2022-08-04T23:53:51Z) - The MineRL BASALT Competition on Learning from Human Feedback [58.17897225617566]
The MineRL BASALT competition aims to spur forward research on this important class of techniques.
We design a suite of four tasks in Minecraft for which we expect it will be hard to write down hardcoded reward functions.
We provide a dataset of human demonstrations on each of the four tasks, as well as an imitation learning baseline.
arXiv Detail & Related papers (2021-07-05T12:18:17Z) - Building Bridges: Generative Artworks to Explore AI Ethics [56.058588908294446]
In recent years, there has been an increased emphasis on understanding and mitigating adverse impacts of artificial intelligence (AI) technologies on society.
A significant challenge in the design of ethical AI systems is that there are multiple stakeholders in the AI pipeline, each with their own set of constraints and interests.
This position paper outlines some potential ways in which generative artworks can play this role by serving as accessible and powerful educational tools.
arXiv Detail & Related papers (2021-06-25T22:31:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.