Piecewise position encoding in convoutional neural network for cough-based COVID-19 detection
Jiakun Shen (Institute of Acoustics, Chinese Academy of Sciences); XueShuai Zhang (University of Chinese Academy of Sciences); pengyuan zhang ( Institute of Acoustics, Chinese Academy of Sciences); Yonghong Yan ( Institute of Acoustics, Chinese Academy of Sciences); Shaoxing Zhang (Peking University Third Hospital); Zhihua Huang (Xinjiang University); Yanfen Tang (Beijing Ditan Hospital Capital Medical University); Yu Wang (Beijing Ditan Hospital Capital Medical University); Fujie Zhang (Beijing Ditan Hospital Capital Medical University); Aijun Sun (Dalian Public Health Clinical Center)
-
SPS
IEEE Members: $11.00
Non-members: $15.00
A fast and efficient COVID-19 detection method is of vital importance to control the spread of the epidemic. Many studies have achieved good performance on cough-based COVID-19 detection in the past two years. However, the effect of position information in time-frequency features of cough audio has been less considered in previous studies. Even the convolutional neural networks that are capable to learn position information may be affected by small transformations of input features. Therefore, we propose piecewise position encoding added to time-frequency features to provide supplementary position information explicitly. Considering the differences in recording devices among different people, we use modified instance normalization to achieve better generalization. The proposed methods are validated on three open-sourced datasets and achieve significant improvements in AUC and UAR. The proposed model also shows competitive results in detecting asymptomatic patients.