Development of TTS Engine for Indian Accent using Modified HMM Algorithm

Sasanko Gantayat - GMR Institute of Technology, Rajam, Andhra Pradesh, India


Citation Format:



DOI: http://dx.doi.org/10.30630/joiv.2.2.112

Abstract


A text-to-speech (TTS) system converts normal language text into speech. An intelligent text-to-speech program allows people with visual impairments or reading disabilities, to listen to written works on a home computer. Many computer operating systems and day to day software applications like Adobe Reader have included text-to-speech systems. This paper is presented to show that how HMM can be used as a tool to convert text to speech.

Keywords


K-means, Text-to-speech; Speech synthesis, HMM Algorithm

Full Text:

PDF

References


Junichi Yamagishi, Takashi Nose, Heiga Zen, Zhen-Hua Ling, Tomoki Toda, Keiichi Tokuda, Simon King and Steve Renals,"Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis", IEEE Transactions on Audio, Speech, and Language Processing, Vol.17, No. 6, August 2009

Agni Dika1, Adnan Maxhuni1, Avni Rexhepi, "The principles of designing of algorithm for speech synthesis from texts written in Albanian language", IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 3, No 3, May 2012

L. R. Rabiner, “A tutorial on hidden markov models and selected applications in speech recognition,†Proceedings of the IEEE, vol. 77, no. 2, pp. 257–286, February 1989.

A. W. Black, K. Lenzo, Building voices in the Festival speech synthesis system, 2000, http://festvox.org/bsv.

Kevin Murphy, “HMM toolbox for Matlabâ€, freely downloadable SW written in Matlab,

http://www.cs.ubc.ca/~murphyk/Software/HMM/hmm.html

Juang BH, Rabiner LR, “Mixture Autoregressive Hidden Markov Models for Speech Signalsâ€, IEEE Trans Acoustics, Speech and Signal Processing 33: 1404-1413,1985

Qystein Birkenes,Tomoko Matsui, Kunio Tanabe, Sabato Marco Siniscalchi, Tor Andre Myrvoll, and Magne Hallstein Johnsen, “Penalized Logistic Regression with HMM LogLikelihood Regressors for Speech Recognitionâ€, IEEE Transactions on Audio, Speech, and Language Processing Vol. 18, No. 6, pp. 1440-1454, August 2010.

Tokuda, K., Nankaku, Y., Toda, T., Zen, H., Yamagishi, J., Oura, K.,â€Speech synthesis based on hidden markov modelsâ€, In: Proceedings of the IEEE, Vol. 101(5), pp. 1234–1252 (2013)

HMM-based Speech Synthesis System (HTS). http://hts.sp.nitech.ac.jp/

L.E. Baum, T. Petrie, “Statistical inference for probabilistic functions of finite state Markov chainsâ€, Ann. Math. Stat., 37 (1966), pp. 1554-1563

Olivier Cappé Eric Moulines Tobias Rydén, Inference in Hidden Markov Mode,l Springer Series in Statistics, 2005.

Y. Ariki, M.A. Jack, “Enhanced time duration constraints in hidden Markov modelling for phoneme recognitionâ€, Electronics Letters, 25 (13) (22 June 1989), pp. 824-825