Repository logo
 

Risk-Based Test Case Prioritization Using Large Language Models in Regression Testing

dc.contributor.advisorMadhusudan Srinivasan
dc.contributor.authorGuzman-Sanchez, Jose
dc.contributor.committeeMemberNic Herndon
dc.contributor.committeeMemberNasseh Nassehzadeh-Tabrizi
dc.contributor.departmentComputer Science
dc.date.accessioned2025-06-05T17:32:06Z
dc.date.available2025-06-05T17:32:06Z
dc.date.created2025-05
dc.date.issuedMay 2025
dc.date.submittedMay 2025
dc.date.updated2025-05-22T21:15:15Z
dc.degree.collegeCollege of Engineering and Technology
dc.degree.grantorEast Carolina University
dc.degree.majorMS-Software Engineering
dc.degree.nameM.S.
dc.degree.programMS-Software Engineering
dc.description.abstractRegression testing is critical to ensuring software quality after performing code modifications. However, complete test execution on complex and robust test suites can be infeasible due to time and resource constraints. Therefore, test case prioritization (TCP) strategies aim to organize test cases to increase fault detection rates early during test execution. This study proposes a risk-based test case prioritization approach that leverages large language models (LLMs) to estimate the fault-proneness of individual methods to guide the prior- itization process. An LLM is fine-tuned to predict the risk score of each function based on several software metrics, which is used to perform static analysis of test cases to determine an overall risk ranking. The prioritized test suites are evaluated using established metrics, including Fault Detection Rate (FDR) and Average Percentage of Faults Detected (APFD). The evaluation of this approach is compared against baseline techniques such as coverage-based and randomized prioritization. The results of this experiment, conducted on open-source Java projects, determined that the risk-based LLM prioritization approach outperforms traditional TCP methods in early fault detection, highlighting the potential of including LLMs in regression testing workflows.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/10342/14048
dc.language.isoEnglish
dc.publisherEast Carolina University
dc.subjectComputer Science
dc.titleRisk-Based Test Case Prioritization Using Large Language Models in Regression Testing
dc.typeMaster's Thesis
dc.type.materialtext

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
GUZMAN-SANCHEZ-PRIMARY-2025.pdf
Size:
793.72 KB
Format:
Adobe Portable Document Format