Graduation Year

2020

Document Type

Thesis

Degree

M.S.C.S.

Degree Name

MS in Computer Science (M.S.C.S.)

Degree Granting Department

Engineering Computer Science

Major Professor

Dmitry Goldgof, Ph.D.

Committee Member

Yu Sun, Ph.D.

Committee Member

Shaun Canavan, Ph.D.

Keywords

bias, dropbox, mish, mosaic, YOLOv3

Abstract

There are many face detection classification models available for download and use in the modern technological world. Based in the field of deep neural networks, these off-the-shelf solutions are generally inadequate to solve real world challenges. This work presents how current approaches biased towards detecting adult human faces must be modified in order to better accommodate face detection of the neonate in a NICU setting.

YOLO is a powerful object detection algorithm. Due to optimizations such as Cross mini-batch Normalization, Modified Spatial Attention Modules, Modified Path Aggregation Networks, Self-Adversarial Training, Mosaic Data Augmentation, DropBox Regularization, Multi-Input Weighted Residual Connections and Mish-activation functions, in its most recent form, YOLOv5 will be lightweight enough to be integrated into the clinical setting. As YOLO has always been a front runner in terms of computation speed, with the optimizations above YOLOv5 is able to make real-time detection. With appropriate training, YOLOv5 will be able to accurately detect the face of the neonate and overcome bias, occlusion and extremely difficult background context that normally cause inaccurate results with other object detection methods.

Share

COinS