INCORPORATING PHONETIC KNOWLEDGE INTO AN EVOLUTIONARY SUBSPACE APPROACH FOR ROBUST SPEECH RECOGNITION

S.A. Selouani, D. O’Shaughnessy, and J. Caelen

References

  1. [1] Y. Gong, Speech recognition in noisy environments: A survey, Speech Communications, 16, 1995, 261–291. doi:10.1016/0167-6393(94)00059-J
  2. [2] S.F. Boll, Suppression of acoustic noise in speech using spectral substraction, IEEE Trans. on Acoustic, Speech and Signal Processing, 29, 1979, 113–120. doi:10.1109/TASSP.1979.1163209
  3. [3] D. Mansour & B.H. Juang, A family of distorsion measures based upon projection operation for robust speech recognition, IEEE Trans. on Acoustic, Speech and Signal Processing, 37, 1989, 1659–1671. doi:10.1109/29.46548
  4. [4] S. Davis & P. Mermelstein, Comparison of parametric representation for monosyllabic word recognition in continuously spoken sentences, IEEE Trans. on Acoustics, Speech and Signal Processing, 28(4), 1980, 357–366. doi:10.1109/TASSP.1980.1163420
  5. [5] H. Hermansky, Perceptual linear predictive (PLP) analysis of speech, Journal of Acoustical Society America, 87(4), 1990, 1738–1752. doi:10.1121/1.399423
  6. [6] D.G. Stork & M.E. Hennecke, Speechreading by man and machine: Models, systems and applications (New York: NATO, ASI Series, Springer, 1996).
  7. [7] S.-A. Selouani & D. O’Shaughnessy, On the use of evolutionary algorithms to improve the robustness of continuous speech recognition systems in adverse conditions, EURASIP Journal on Applied Signal Processing, 8, 2003, 814–823. doi:10.1155/S1110865703302070
  8. [8] D. O’Shaughnessy, Speech communication: Human and machine (USA: Wiley-IEEE Press, 2001).
  9. [9] J. Picone, Signal modeling techniques in speech recognition, Proc. IEEE, 81(9), 1993, 1215–1247. doi:10.1109/5.237532
  10. [10] H. Hermansky & N. Morgan, RASTA Processing of Speech, IEEE Trans. on Audio and Speech Processing, ASP 2(4), 1994, 578–589. doi:10.1109/89.326616
  11. [11] J. Caelen, Space/time data-information in the ARIAL project ear model, Speech Communications, 4(1), 1985.
  12. [12] N. Chomsky & M. Halle, Sound pattern of English (New York: Harper and Row, 1968).
  13. [13] R. Jakobson, G. Fant, & M. Halle, Preliminaries to speech analysis: The distinctive features and their correlates (Cambridge: MIT Press, 1963).
  14. [14] H. Tolba, S.-A. Selouani, & D. O’Shaughnessy, Auditory-based acoustic distinctive features and spectral cues for automatic speech recognition using a multi-stream paradigm, IEEE Int. Conf. on Acoustic, Speech and Signal Processing (ICASSP’2002), Orlando, FL, 2002, 837–840.
  15. [15] Y. Ephraim & H.L. Van Trees, A signal subspace approach for speech enhancement, IEEE Trans. on Acoustic, Speech and Signal Processing, 3(4), 1995, 251–266. doi:10.1109/89.397090
  16. [16] A. Spalanzani, S.A. Selouani, & H. Kabre, Evolutionary algorithms for optimizing speech data projection, Genetic and Evolutionary Computation Conf., Orlando, FL, 1999, 1799.
  17. [17] T. Rudolph, Discriminative codebook design for critical word recognition using evolution strategies, Proc. 3rd IEEE International Conf. on Evolutionary Computation, 1996, 67–70. doi:10.1109/ICEC.1996.542335
  18. [18] Y. Sato, Interactive evolution of adaptive parameter for speaker verification systems, Proc. Genetic and Evolutionary Computation Conf., Morgan Kaufmann Publishers, Las Vegas, Nevada, 2000, 742–749.
  19. [19] Z. Michalewicz, Genetic Algorithms + Data Structure = Evolution programs, AI series. (New York: Springer-Verlag, 1996).
  20. [20] L. Davis, The genetic algorithm handbook, Chapter 17 (New York: Van Nostrand Reinhold, 1991).
  21. [21] D.E. Goldberg, Genetic algorithms in search, optimization and machine learning (Boston, USA: Addison-Wesley Publishing, 1989).
  22. [22] C.R. Houk, J.A. Joines, & M.G. Kay, A Genetic algorithm for function optimization: A matlab implementation, Technical Report, North Carolina University-NCSU-IE, 1995.
  23. [23] J. Hernando & C. Nadeu, A comparative study of parameters and distances for noisy speech recognition, Proc. Eurospeech, Genoa, Italy, 1991, 91–94.
  24. [24] W. Fisher, G. Dodington, & K. Goudie-Marshall, The DARPA speech recognition research database: Specification and status, DARPA Workshop on Speech Recognition, 1986.
  25. [25] Cambridge University Speech Group, The HTK book (Version 2.1.1) (Cambridge, England: Cambridge University Group, 1997).

Important Links:

Go Back