Patch Based Analysis With Machine Learning to Aid Breast Cancer Recurrence Prediction

dc.contributor.advisorHerndon, Nic
dc.contributor.authorRose, Madison
dc.contributor.committeeMemberDavid Hart
dc.contributor.committeeMemberRui Wu
dc.contributor.departmentComputer Science
dc.date.accessioned2024-07-19T14:00:01Z
dc.date.available2024-07-19T14:00:01Z
dc.date.created2024-05
dc.date.issuedMay 2024
dc.date.submittedMay 2024
dc.date.updated2024-07-16T20:36:01Z
dc.degree.collegeCollege of Engineering and Technology
dc.degree.departmentComputer Science
dc.degree.grantorEast Carolina University
dc.degree.majorMS-Data Science
dc.degree.nameM.S.
dc.description.abstractSince the introduction of whole slide scanners, machine learning research has become a popular area of interest in digital pathology. Many studies have attempted to use machine learning to aid pathology tasks such as breast cancer diagnosis and metastasis detection. However, one area that has less available research is in applying machine learning to predict patient recurrence risk categories. Since H&E-stained images are routinely collected for diagnostic purposes, creating an image-based recurrence prediction method could help increase accessibility and lower cost for recurrence risk category assessment for breast cancer patients. In this study, patches were extracted from a dataset of 102 whole slide images to train a machine learning model to predict slide level breast cancer Oncotype DX risk category using only H&E-stained images with no additional clinical data or region of interest annotations. Multiple patch size and patch quantity combinations were tested. Patches were extracted from each whole slide image and feature extraction was performed before the features were aggregated together to create a bag of features for each case. These bags were then used to train a logistic regression model. The best scoring model utilized 2,000 patches of size 256 x 256 pixels. This model scored 0.628 ± 0.044 accuracy on 5-fold cross validation across the entire dataset.
dc.etdauthor.orcid0009-0005-1283-1228
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/10342/13392
dc.publisherEast Carolina University
dc.subject.lcshBreast--Cancer--Imaging
dc.subject.lcshBreast--Cancer--Prognosis
dc.subject.lcshMachine learning
dc.subject.lcshImage processing
dc.titlePatch Based Analysis With Machine Learning to Aid Breast Cancer Recurrence Prediction
dc.typeMaster's Thesis
dc.type.materialtext
local.embargo.lift2025-05-01
local.embargo.terms2025-05-01

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1330140189\1713982275718-ROSE-PRIMARY-2024.pdf
Size:
1.34 MB
Format:
Adobe Portable Document Format

Collections