Transfer Learning From Youtube Soundtracks To Tag Arctic Ecoacoustic Recordings
Enis Berk Ãoban, Dara Pir, Richard So, Michael I Mandel
-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 13:24
Sound provides a valuable tool for long-term monitoring of sensitive animal habitats at a spatial scale larger than camera traps or field observations, while also providing more details than satellite imagery. Currently, the ability to collect such recordings outstrips the ability to analyze them manually, necessitating the development of automatic analysis methods. While several datasets and models of large corpora of video soundtracks have recently been released, it is not clear to what extent these models will generalize to environmental recordings and the scientific questions of interest in analyzing them. This paper investigates this generalization in several ways and finds that models themselves display limited performance, however, their intermediate representations can be used to train successful models on small sets of labeled data.