Development of TTS Engine for Indian Accent using Modified HMM Algorithm

Sasanko Sekhar Gantayat

doi:10.30630/joiv.2.2.112

Development of TTS Engine for Indian Accent using Modified HMM Algorithm

Sasanko Gantayat - GMR Institute of Technology, Rajam, Andhra Pradesh, India

Citation Format:

DOI: http://dx.doi.org/10.30630/joiv.2.2.112

Abstract

A text-to-speech (TTS) system converts normal language text into speech. An intelligent text-to-speech program allows people with visual impairments or reading disabilities, to listen to written works on a home computer. Many computer operating systems and day to day software applications like Adobe Reader have included text-to-speech systems. This paper is presented to show that how HMM can be used as a tool to convert text to speech.

Keywords

K-means, Text-to-speech; Speech synthesis, HMM Algorithm

Full Text:

PDF

References

Junichi Yamagishi, Takashi Nose, Heiga Zen, Zhen-Hua Ling, Tomoki Toda, Keiichi Tokuda, Simon King and Steve Renals,"Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis", IEEE Transactions on Audio, Speech, and Language Processing, Vol.17, No. 6, August 2009

Agni Dika1, Adnan Maxhuni1, Avni Rexhepi, "The principles of designing of algorithm for speech synthesis from texts written in Albanian language", IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 3, No 3, May 2012

L. R. Rabiner, â€œA tutorial on hidden markov models and selected applications in speech recognition,â€ Proceedings of the IEEE, vol. 77, no. 2, pp. 257â€“286, February 1989.

A. W. Black, K. Lenzo, Building voices in the Festival speech synthesis system, 2000, http://festvox.org/bsv.

Kevin Murphy, â€œHMM toolbox for Matlabâ€, freely downloadable SW written in Matlab,

http://www.cs.ubc.ca/~murphyk/Software/HMM/hmm.html

Juang BH, Rabiner LR, â€œMixture Autoregressive Hidden Markov Models for Speech Signalsâ€, IEEE Trans Acoustics, Speech and Signal Processing 33: 1404-1413,1985

Qystein Birkenes,Tomoko Matsui, Kunio Tanabe, Sabato Marco Siniscalchi, Tor Andre Myrvoll, and Magne Hallstein Johnsen, â€œPenalized Logistic Regression with HMM LogLikelihood Regressors for Speech Recognitionâ€, IEEE Transactions on Audio, Speech, and Language Processing Vol. 18, No. 6, pp. 1440-1454, August 2010.

Tokuda, K., Nankaku, Y., Toda, T., Zen, H., Yamagishi, J., Oura, K.,â€Speech synthesis based on hidden markov modelsâ€, In: Proceedings of the IEEE, Vol. 101(5), pp. 1234â€“1252 (2013)

HMM-based Speech Synthesis System (HTS). http://hts.sp.nitech.ac.jp/

L.E. Baum, T. Petrie, â€œStatistical inference for probabilistic functions of finite state Markov chainsâ€, Ann. Math. Stat., 37 (1966), pp. 1554-1563

Y. Ariki, M.A. Jack, â€œEnhanced time duration constraints in hidden Markov modelling for phoneme recognitionâ€, Electronics Letters, 25 (13) (22 June 1989), pp. 824-825

Username
Password
Remember me