Graduation Year
2024
Document Type
Thesis
Degree
M.S.Cp.
Degree Name
MS in Computer Engineering (M.S.C.P.)
Degree Granting Department
Computer Science and Engineering
Major Professor
Lawrence O. Hall, Ph.D.
Committee Member
Dmitry Goldgof, Ph.D.
Committee Member
Shaun Canavan, Ph.D.
Keywords
Deep Learning, Domain Generalization, Machine Learning, Snapshot Ensembles, SWAD
Abstract
Deep learning models, typically, take significant time to train. Classifier ensembles are areliable way to increase classifier accuracy and perhaps generalizability to unseen sources of data. These classifiers can be combined with a simple voting scheme. The problem is that having multiple models can very heavily increase training time. Snapshot ensembles have been shown to provide a boost in performance by creating an ensemble of classifiers with different weights during the training of a single deep learned model. This can somewhat solve the problem of the increased training time as you do not have to train separate models. As Machine Learning becomes a more common tool in the medical field, the domain general- ization problem becomes more important. Domain generalization is a deep-learned model’s ability to generalize to unseen sources. That is, to perform well on sources that were not seen during training. In image recognition and classification, especially in a medical setting, this is important because domain shifts can cause accuracy of medical image classification to drop significantly. Recent work in 2021 proposed a method named Stochastic Weight Av- eraging Densely (SWAD) that, through finding a flatter minima on the loss function during training, decreases the generalization gap between seen and unseen data. In this thesis, we show that the SWAD learning algorithm can increase the accuracy on out-of-domain chest X-ray data by 4.23% (P-value < 0.05) on average. In addition to this finding, an alteration to the SWAD algorithm is introduced that allows an increase in accuracy over the original algorithm on the chest X-ray dataset by 2.27% on average (2 tests P-value < 0.05, 2 tests P-value > 0.05).
Scholar Commons Citation
Weinhofer, Brandon M., "Exploring the Use of Enhanced SWAD Towards Building Learned Models that Generalize Better to Unseen Sources" (2024). USF Tampa Graduate Theses and Dissertations.
https://digitalcommons.usf.edu/etd/10261