University of Tasmania
Browse

File(s) under permanent embargo

Automatic rain and cicada chorus filtering of bird acoustic data

journal contribution
posted on 2023-05-20, 03:44 authored by Alexander Brown, Saurabh GargSaurabh Garg, Erin MontgomeryErin Montgomery

Recording and analysing environmental audio recordings has become a common approach for monitoring the environment. This has several advantages over other approaches, such as reducing costs by avoiding the need for experts to be present in the area of interest. A current problem with performing analyses of environmental recordings is interference from noise that can mask vocalisations of interest. This makes detecting these vocalisations more difficult and can require additional resources. While some work has been done to remove stationary noise from environmental recordings, there has been little effort to remove noise from non-stationary sources, such as rain, wind, engines, and animal vocalisations that are not of interest. This work addresses the challenge of filtering noise from rain and cicada choruses from recordings containing bird sound. The use of acoustic indices and Mel Frequency Cepstral Coefficients (MFCCs) with machine learning classifiers is investigated to find the most effective filters. Hyperparameters for several classification approaches are investigated to fine tune models to achieve the best results. The approach used enables users to set thresholds to increase or decrease the sensitivity of classification, based on the prediction probability outputted by classifiers. A novel approach to remove cicada choruses using bandpass filters is also proposed. A threshold-based approach (Multi-Layer Perceptron with Acoustic Indices and MFCCs) for rain detection is derived which achieves an AUC of 0.9911 and is more accurate than existing approaches when set to the same sensitivities. Cicada choruses are classified in the training set used with 100% accuracy using 10-fold cross-validation using a Support Vector Machine (SVM) classifier with MFCCs. The cicada filtering approach greatly increased the median signal to noise ratios of affected recordings from 0.53 for unfiltered audio to 1.86 to audio filtered by both the cicada filter and a common stationary noise filter.

History

Publication title

Applied Soft Computing

Volume

81

Article number

105501

Number

105501

Pagination

1-15

ISSN

1568-4946

Department/School

School of Information and Communication Technology

Publisher

Elsevier BV

Place of publication

Netherlands

Rights statement

Copyright 2019 Published by Elsevier B.V.

Repository Status

  • Restricted

Socio-economic Objectives

Information systems, technologies and services not elsewhere classified

Usage metrics

    University Of Tasmania

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC