USF Tampa Graduate Theses and Dissertations

Exploring the Use of Enhanced SWAD Towards Building Learned Models that Generalize Better to Unseen Sources

Brandon M. Weinhofer, University of South Florida

Graduation Year

2024

Document Type

Thesis

Degree

M.S.Cp.

Degree Name

MS in Computer Engineering (M.S.C.P.)

Degree Granting Department

Computer Science and Engineering

Major Professor

Lawrence O. Hall, Ph.D.

Committee Member

Dmitry Goldgof, Ph.D.

Committee Member

Shaun Canavan, Ph.D.

Keywords

Deep Learning, Domain Generalization, Machine Learning, Snapshot Ensembles, SWAD

Abstract

Deep learning models, typically, take significant time to train. Classifier ensembles are areliable way to increase classifier accuracy and perhaps generalizability to unseen sources of data. These classifiers can be combined with a simple voting scheme. The problem is that having multiple models can very heavily increase training time. Snapshot ensembles have been shown to provide a boost in performance by creating an ensemble of classifiers with different weights during the training of a single deep learned model. This can somewhat solve the problem of the increased training time as you do not have to train separate models. As Machine Learning becomes a more common tool in the medical field, the domain general- ization problem becomes more important. Domain generalization is a deep-learned model’s ability to generalize to unseen sources. That is, to perform well on sources that were not seen during training. In image recognition and classification, especially in a medical setting, this is important because domain shifts can cause accuracy of medical image classification to drop significantly. Recent work in 2021 proposed a method named Stochastic Weight Av- eraging Densely (SWAD) that, through finding a flatter minima on the loss function during training, decreases the generalization gap between seen and unseen data. In this thesis, we show that the SWAD learning algorithm can increase the accuracy on out-of-domain chest X-ray data by 4.23% (P-value < 0.05) on average. In addition to this finding, an alteration to the SWAD algorithm is introduced that allows an increase in accuracy over the original algorithm on the chest X-ray dataset by 2.27% on average (2 tests P-value < 0.05, 2 tests P-value > 0.05).

Scholar Commons Citation

Weinhofer, Brandon M., "Exploring the Use of Enhanced SWAD Towards Building Learned Models that Generalize Better to Unseen Sources" (2024). USF Tampa Graduate Theses and Dissertations.
https://digitalcommons.usf.edu/etd/10261

Download

Included in

Computer Engineering Commons

COinS

USF Tampa Graduate Theses and Dissertations

Exploring the Use of Enhanced SWAD Towards Building Learned Models that Generalize Better to Unseen Sources

Graduation Year

Document Type

Degree

Degree Name

Degree Granting Department

Major Professor

Committee Member

Committee Member

Keywords

Abstract

Scholar Commons Citation

Included in

Search

Browse By

Useful Links

USF Tampa Graduate Theses and Dissertations

Exploring the Use of Enhanced SWAD Towards Building Learned Models that Generalize Better to Unseen Sources

Author

Graduation Year

Document Type

Degree

Degree Name

Degree Granting Department

Major Professor

Committee Member

Committee Member

Keywords

Abstract

Scholar Commons Citation

Included in

Share

Search

Browse By

Useful Links