Pitch extraction using discrete cosine transform based power spectrum method in noisy speech
Abstract
The pitch period is a key component of many speech analysis research projects. In real-world applications, voice data is frequently gathered in noisy surround- ings, therefore algorithms must be able to manage background noise well in order to estimate pitch accurately. Despite advancements, many state-of–the-art algorithms struggle to deliver adequate results when faced with low signal-to- noise ratios (SNRs) in processing noisy speech signals. This research proposes an effective concept specifically designed for speech processing applications, particularly in noisy conditions. To achieve this goal, we introduce a fundamen- tal frequency extraction algorithm designed to tolerate non-stationary changes in the amplitude and frequency of the input signal. In order to improve the extrac- tion accuracy, we also use a cumulative power spectrum (CPS) based on discrete cosine transform (DCT) rather than conventional power spectrum. We enhance extraction accuracy of our method by utilizing shorter sub-frames of the input signal to mitigate the noise characteristics present in speech signals. According to the experimental results, our proposed technique demonstrates superior per- formance in noisy conditions compared to other existing state-of-the-art meth- ods without utilizing any kind of post-processing techniques.
Full Text:
PDFDOI: http://doi.org/10.11591/ijaas.v14.i3.pp955-965
Refbacks
- There are currently no refbacks.
Copyright (c) 2025 Humaira Sunzida, Nargis Parvin, Jafrin Akter Jeba, Sulin Chi, Md. Shiplu Ali, Moinur Rahman, Md. Saifur Rahman
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
View the IJAAS Visitor Statistics
International Journal of Advances in Applied Sciences (IJAAS)
p-ISSN 2252-8814, e-ISSN 2722-2594
This journal is published by Intelektual Pustaka Media Utama (IPMU) in collaboration with the Institute of Advanced Engineering and Science (IAES).