A New Human Voice Recognition System
DOI:
https://doi.org/10.51983/ajsat-2016.5.2.931Keywords:
Human Voice, Decimated wavelet, LPC, RASTA and Euclidean DistanceAbstract
In an effort to provide a more efficient representation of the speech signal, the application of the wavelet analysis is considered. This research presents an effective and robust method for extracting features for speech processing. Here, we proposed a new human voice recognition system using the combination of decimated wavelet (DW) and Relative Spectra Algorithm with Linear Predictive coding. First, we will apply the proposed techniques to the training speech signals and then form a train feature vector which contains the low level features extracted, wavelet and linear predictive coefficients. Afterwards, the same process will be applied to the testing speech signals and will form a test feature vector. Now, we will compare the two feature vectors by calculating the Euclidean distance between the vectors to identify the speech and speaker. If the distance between two vectors is near to zero then the tested speech/speaker will be matched with the trained speech/speaker. Simulation results have been compared with LPC scheme, and shown that the proposed scheme has performed superior to the existing technique by using the fifty preloaded voice signals from six individuals, the verification tests have been carried and an accuracy rate of approximately 90 % has been achieved.
References
S. Oraintara et al., "IEEE Transactions on Signal Processing," IFFT, vol. 50, no. 3, March 2002.
K. Wong, "Journal of Undergraduate Research," The Role of the Fourier Transform in Time-Scale Modification, University of Florida, vol. 2, issue 11, August 2011.
B. Liu and S. Riemenschneider, "An Adaptive Time Frequency Representation and Its Fast Implementation," Department of Mathematics, West Virginia University.
V. Ganapathy et al., "Signal Separation using Time Frequency Representation," Honeywell Technology Solutions Laboratory.
A. Graps, "An Introduction to Wavelets," Istituto di Fisica dello Spazio Interplanetario, CNR-ARTOV.
B. Vidakovic and P. Mueller, "Wavelets For Kids – A Tutorial Introduction," Duke University.
O. Farooq and S. Datta, "A Novel Wavelet Based Pre Processing For Robust Features In ASR."
G. Antoniol et al., "LPC & Cepstrum coefficients for Mining Time Variant Information from Software Repositories," IEEE Transactions on Software Engineering, University Of Sannio, Italy.
M. Unser and T. Blu, "Wavelet Theory Demystified," IEEE Transactions on Signal Processing, vol. 51, no. 2, Feb’13.
C. Valens, "A Really Friendly Guide to Wavelets," IEEE, vol. 86, no. 11, Nov 2012.
J. M. Lewis and C. S. Burrus, "Approximate CWT with An Application To Noise Reduction," Rice University, Houston.
T. Painter and A. Spanias, "Perceptual Coding of Digital Audio," IE EE, ASU.
D. P. W. Ellis, "PLP,RASTA, MFCC & inversion Matlab," 2005.
R. Singh, "Spectral Subtraction Speech Enhancement with RASTA Filtering," Proceedings of the NCC, IIT-B, 2012.
N. Sawhney, "Situational Awareness from Environmental Sounds," SIG, MIT Media Lab, June 13, 2013.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2016 The Research Publication
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.