Early Success Prediction in Student Codes in Computer Science Education


Presenter
Hayun Jung
Group Members
Piyush Maheshwari, Saksham Bansal
Campus
UMass Amherst
Sponsor
Andrew Lan, Department of Computer Science, UMass Amherst
Schedule
Session 2, 11:30 AM - 12:15 PM [Schedule by Time][Poster Grid for Time/Location]
Location
Poster Board A20, Campus Center Auditorium, Row 1 (A1-A20) [Poster Location Map]
Abstract
Predicting the success rate of student-written code submissions for a given problem without executing the code against test cases is a challenging task, but one that is beneficial for both instructors and students. In this study, we explore various methods to achieve this, including BERT encoding, BERT fine-tuning, and prompting approaches like Few-shot and Chain-of-Thought (CoT). Our experiments on the CSEDM Challenge dataset, consisting of 50 unique problems and 69627 student code submissions, reveal that BERT encoding has the lowest performance, while Code-BERT encodings show promise. BERT fine-tuning, particularly BERT-base-uncased, performs slightly better, which we attribute to the diverse language used in the prompt and technical constraints in the runtime environment. Among the prompting approaches, Few-shot demonstrates the lowest error, indicating the effectiveness of constructing in-context examples. However, CoT does not perform as well, partly due to the assumption that every code contains only a single bug. Overall, we find that prompting approaches show the best performance, followed by fine-tuning and then encoding approaches. Our findings provide insights into the efficacy of different methods for predicting code success rates and can inform future research in this area. This study was funded by the Undergraduate Research Volunteers program during winter break. 
Keywords
Natural Language Processing, Computer Science in Education, Machine Learning
Research Area
Computer Science

SIMILAR ABSTRACTS (BY KEYWORD)

Research Area Presenter Title Keywords
Probability, Statistics, and Machine Learning Waghe, Shreyas Fair Machine Learning (0.896552), Machine Learning (1.0)
Computer Science Kakkar, Ishita Natural Language Processing
Computer Science Sejdi, Anxhela Machine Learning
Computer Science Baron, Cameron Shea Machine Learning
Artificial Intelligence Landaverde, Yeilin M. Machine Learning