Speaker Recognition System using Wavelet Transform under Stress Condition

Kshirabdhi Tanaya Biswal, Janmejaya Rout

Abstract


in this paper, we introduced a text-depend speaker recognition by using wavelet transform under stressed conditions. Here we compare different feature such as ARC, LAR, LPCC, MFCC, CEP and after comparison we found that LPCC provides best feature. For decompose signal at two levels Discrete Wavelet Transform is used here. Discrete Wavelet Transform (DWT) based Linear Predictive Cepstral Coefficients (LPCC) used as a feature for recognized the speaker system. For classification Vector Quantization method is used. Four different stressed data has selected for (SUSAS) i.e. stress speech data base for speaker recognition. Improvement is achieved 93% and 94% in case of Lombard and Neutral case.


References


J.P. Campbell, “Speaker Recognition: A Tutorial,”

Proc. IEEE., Vol. 85, no. 9, Sep 1997.

Herman J.M.Steeneken and John.H.L.hasen ‘‘Speech

under stress condition.”IEEE “Robust Speech

process”1999.

Sanjay. Patil“ Speech under stress: Analysis, Modeling

And Recognition,” January 2007.

W.Alkhadi, W.Fakhar and N.Handy ‘‘Automatic

Speech/speaker recognition in noisy environments using

WT,”senior member, IEEE2002.

Shailaja S Yadav, D.G. Bhalke , ‘‘ Speaker

Identification System using Wavelet Transform and

VQ modeling Technique,’’International Journal Of

Computer Applications Volume 112 – No. 9, February

Mahmoud I. Abdalla, Haitham M. Abobakr, Tamer S.

Gaafar ‘‘DWT AndMfccs Based Feature Extraction

Methods For Isolated Word Recognition,’’Volume

– No.20, May 2013 21.

TaabishGulzar, Anand Singh, Sandeep Sharma,

“Comparative Analysis of LPCC, MFCC and BFCC

For the Recognition using Artifical Neural Network,”

International Journal of Computer Application Volume 101-No.12, September 2014.

J.H.L Hasen , Mark A .Clements ‘‘Stresscompensation

noise reduction algorithm for robust speech recognition”

IEEE.

Levent M. Arslan and John H.L Hasen ‘‘Speech

enhancement for cross talk interference” IEEE signal

processing letter, VOL.4, no.4, APRIL 1997.

Murray , I. R, Baber. c and south. A .j “towards

defination and working model of stress and its effects

on Speech.” Speech enhancement communication

vol.20,nov. 1-2,1996.

H.Hermansky, S. Tibrewala, and M. Pavel,“Towards

ASR on Partially Corrupted Speech,” in Proceedings

of ZCSLP, 1996.

J.N.Gowdy, Z.Tufekci ‘‘Mel-scale Discrete Wavelet

Coefficiant for Speech recognition”2000.

Evan Ruzanski1, John H.L. ‘‘ Stress Level

Classification of Speech Using Euclidean Distance

Metricsin a Novel Hybrid Multi-Dimensional Feature

SpaceHansen”2006.

Alm,C.O., Roth, D., Sproat, R.: Emotions from Text:

‘‘Machine Learning for Text based Emotion Prediction. In: Proceedings of HLT/EMNLP 05, Vancouver’’ (2005).

A. Mantilla-Caeiros, M. Nakano-Miyatake, H. Perez-

Meana, “A New Wavelet Function for Audio and

Speech Processing”, 50th MWSCAS, pp. 101-

(2007).

HynekBoˇril, Member, IEEE, and John H. L. Hansen,

Fellow, IEEE ‘‘Unsupervised Equalization of

Lombard Effect for Speech Recognition in Noisy

Adverse Environments” Transactions On Audio,

Speech, And Language Processing, Vol. 18, No. 6,

August 2010.

RuhiSarikaya and John N. Gowdy‘‘ Wavelet Based

Analysis of Speech Under Stress” Digital Speech and


Full Text: PDF [Full Text]

Refbacks

  • There are currently no refbacks.


Copyright © 2013, All rights reserved.| ijseat.com

Creative Commons License
International Journal of Science Engineering and Advance Technology is licensed under a Creative Commons Attribution 3.0 Unported License.Based on a work at IJSEat , Permissions beyond the scope of this license may be available at http://creativecommons.org/licenses/by/3.0/deed.en_GB.