INCORPORATING PHONETIC KNOWLEDGE INTO AN EVOLUTIONARY SUBSPACE APPROACH FOR ROBUST SPEECH RECOGNITION

doi:10.2316/Journal.202.2007.2.202-1885

INCORPORATING PHONETIC KNOWLEDGE INTO AN EVOLUTIONARY SUBSPACE APPROACH FOR ROBUST SPEECH RECOGNITION

S.A. Selouani, D. O’Shaughnessy, and J. Caelen

References

[1] Y. Gong, Speech recognition in noisy environments: A survey, Speech Communications, 16, 1995, 261–291. doi:10.1016/0167-6393(94)00059-J
[2] S.F. Boll, Suppression of acoustic noise in speech using spectral substraction, IEEE Trans. on Acoustic, Speech and Signal Processing, 29, 1979, 113–120. doi:10.1109/TASSP.1979.1163209
[3] D. Mansour & B.H. Juang, A family of distorsion measures based upon projection operation for robust speech recognition, IEEE Trans. on Acoustic, Speech and Signal Processing, 37, 1989, 1659–1671. doi:10.1109/29.46548
[4] S. Davis & P. Mermelstein, Comparison of parametric representation for monosyllabic word recognition in continuously spoken sentences, IEEE Trans. on Acoustics, Speech and Signal Processing, 28(4), 1980, 357–366. doi:10.1109/TASSP.1980.1163420
[5] H. Hermansky, Perceptual linear predictive (PLP) analysis of speech, Journal of Acoustical Society America, 87(4), 1990, 1738–1752. doi:10.1121/1.399423
[6] D.G. Stork & M.E. Hennecke, Speechreading by man andmachine: Models, systems and applications (New York: NATO,ASI Series, Springer, 1996).
[7] S.-A. Selouani & D. O’Shaughnessy, On the use of evolutionary algorithms to improve the robustness of continuous speech recognition systems in adverse conditions, EURASIP Journal on Applied Signal Processing, 8, 2003, 814–823. doi:10.1155/S1110865703302070
[8] D. O’Shaughnessy, Speech communication: Human and ma-chine (USA: Wiley-IEEE Press, 2001).
[9] J. Picone, Signal modeling techniques in speech recognition, Proc. IEEE, 81(9), 1993, 1215–1247. doi:10.1109/5.237532
[10] H. Hermansky & N. Morgan, RASTA Processing of Speech,IEEE Trans. on Audio and Speech Processing, ASP 2(4), 1994, 578–589. doi:10.1109/89.326616
[11] J. Caelen, Space/time data-information in the ARIAL project ear model, Speech Communications, 4(1), 1985.
[12] N. Chomsky & M. Halle, Sound pattern of English (New York: Harper and Row, 1968).
[13] R. Jakobson, G. Fant, & M. Halle, Preliminaries to speech analysis: The distinctive features and their correlates (Cambridge: MIT Press, 1963).
[14] H. Tolba, S.-A. Selouani, & D. O’Shaughnessy, Auditory-based acoustic distinctive features and spectral cues for automatic speech recognition using a multi-stream paradigm, IEEE Int. Conf. on Acoustic, Speech and Signal Processing (ICASSP’2002), Orlando, FL, 2002, 837–840.
[15] Y. Ephraim & H.L. Van Trees, A signal subspace approach for speech enhancement, IEEE Trans. on Acoustic, Speech and Signal Processing, 3(4), 1995, 251–266. doi:10.1109/89.397090
[16] A. Spalanzani, S.A. Selouani, & H. Kabre, Evolutionary algorithms for optimizing speech data projection, Genetic and Evolutionary Computation Conf., Orlando, FL, 1999, 1799.
[17] T. Rudolph, Discriminative codebook design for critical word recognition using evolution strategies, Proc. 3rd IEEE International Conf. on Evolutionary Computation, 1996, 67–70. doi:10.1109/ICEC.1996.542335
[18] Y. Sato, Interactive evolution of adaptive parameter for speaker verification systems, Proc. Genetic and Evolutionary Computation Conf., Morgan Kaufmann Publishers, Las Vegas, Nevada, 2000, 742–749.
[19] Z. Michalewicz, Genetic Algorithms + Data Structure =Evolution programs, AI series. (New York: Springer-Verlag,1996).
[20] L. Davis, The genetic algorithm handbook, Chapter 17 (New York: Van Nostrand Reinhold, 1991).
[21] D.E. Goldberg, Genetic algorithms in search, optimization and machine learning (Boston, USA: Addison-Wesley Publishing, 1989).
[22] C.R. Houk, J.A. Joines, & M.G. Kay, A Genetic algorithm for function optimization: A matlab implementation, Technical Report, North Carolina University-NCSU-IE, 1995.
[23] J. Hernando & C. Nadeu, A comparative study of parameters and distances for noisy speech recognition, Proc. Eurospeech, Genoa, Italy, 1991, 91–94.
[24] W. Fisher, G. Dodington, & K. Goudie-Marshall, The DARPA speech recognition research database: Specification and status, DARPA Workshop on Speech Recognition, 1986.
[25] Cambridge University Speech Group, The HTK book (Version 2.1.1) (Cambridge, England: Cambridge University Group, 1997).

Important Links:

Abstract
DOI: 10.2316/Journal.202.2007.2.202-1885
From Journal (202) International Journal of Computers and Applications - 2007

Go Back