Fundamental frequency extraction by utilizing modified BaNa in noisy speech

Arpita Saha, Nargis Parvin, Dr. Md. Saifur Rahman, Moinur Rahman, Any Chowdhury

Abstract


A sound’s pitch can be largely understood and perceived by using its fundamental frequency. Multiple algorithms have been developed for extracting fundamental frequency, and the choice of which one to employ depends on the noise and features of the signal. Therefore, for an accurate fundamental frequency estimate, the noise resistance of the algorithm becomes even more crucial. Still, many of the most advanced algorithms fail to produce acceptable results when faced with loud speech recordings that have low signal-to-noise ratios (SNRs). In this research paper, we focus on the harmonic selection step in BaNa method, which is one of the vital parts for enhancing the extraction accuracy of fundamental frequency (F0) in noisy situations. BaNa algorithm always emphasizes 5 harmonics on average for both male and female speakers. However, our observation reveals that relying on 5 harmonics is inadequate for male speakers in noisy conditions. Thus, we propose a new idea based on BaNa that separately utilizes the 3 harmonics for male speakers and 5 harmonics for female speakers to achieve accurate pitch extraction within noisy environments. The results demonstrate that our proposed approach attains the lowest rate of gross pitch error (GPE) across various noise types and SNR levels.

Full Text:

PDF


DOI: http://doi.org/10.11591/ijaas.v13.i3.pp515-529

Refbacks

  • There are currently no refbacks.


International Journal of Advances in Applied Sciences (IJAAS)
p-ISSN 2252-8814, e-ISSN 2722-2594
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.


Web Analytics View IJAAS Stats