| ||||
| ||||
![]() Title:Towards Classifying Bird Sounds Using a Deep Transfer Learning Model Authors:Saptarshi Dey, Soumi Ghosh, Soumapriyo Mondal, Akash Harh, Spandan Bandhu, Bidhan Barai and Pawan Kumar Singh Conference:ICDMAI2025 Tags:Bird Clef 2022 Dataset, Bird Species Classification, Cornell Birdcall Identification Dataset, Feature Extraction, Mel-Spectrogram, Short-Time Fourier Transform and VGG-16 Abstract: The conservation of bird biodiversity relies on accurately identifying and classifying species, which is often time-consuming and requires specialized knowledge. Recent advances in deep learning, particularly in convolutional neu- ral networks (CNNs), have made it possible to detect species passively from acoustic signals, even in challenging environments. This paper presents a high- performance deep convolutional neural network (CNN) model using the VGG- 16 architecture for the passive classification of bird sounds, using a remarkably accurate model of Short-Time Fourier Transform (STFT) that accounts for 97.31% of the BirdCLEF 2022 data set and 98.41% for the Cornell Birdcall Iden- tification dataset. The model discriminates between species, even in complex soundscapes with overlapping records. The framework also uses a tool-based consensus framework to enhance the focus on relevant features, improving clas- sification accuracy for rare and endangered species. This method is highly effec- tive in various phonological and language processing tasks and enhances the model's robustness, making it suitable for real-world applications. Towards Classifying Bird Sounds Using a Deep Transfer Learning Model ![]() Towards Classifying Bird Sounds Using a Deep Transfer Learning Model | ||||
Copyright © 2002 – 2025 EasyChair |