Development of TTS Engine for Indian Accent using Modified HMM Algorithm

Sasanko Sekhar Gantayat


A text-to-speech (TTS) system converts normal language text into speech. An intelligent text-to-speech program allows people with visual impairments or reading disabilities, to listen to written works on a home computer. Many computer operating systems and day to day software applications like Adobe Reader have included text-to-speech systems. This paper is presented to show that how HMM can be used as a tool to convert text to speech.


K-means, Text-to-speech; Speech synthesis, HMM Algorithm


Junichi Yamagishi, Takashi Nose, Heiga Zen, Zhen-Hua Ling, Tomoki Toda, Keiichi Tokuda, Simon King and Steve Renals,"Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis", IEEE Transactions on Audio, Speech, and Language Processing, Vol.17, No. 6, August 2009

Agni Dika1, Adnan Maxhuni1, Avni Rexhepi, "The principles of designing of algorithm for speech synthesis from texts written in Albanian language", IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 3, No 3, May 2012

L. R. Rabiner, “A tutorial on hidden markov models and selected applications in speech recognition,” Proceedings of the IEEE, vol. 77, no. 2, pp. 257–286, February 1989.

A. W. Black, K. Lenzo, Building voices in the Festival speech synthesis system, 2000,

Kevin Murphy, “HMM toolbox for Matlab”, freely downloadable SW written in Matlab,

Juang BH, Rabiner LR, “Mixture Autoregressive Hidden Markov Models for Speech Signals”, IEEE Trans Acoustics, Speech and Signal Processing 33: 1404-1413,1985

Qystein Birkenes,Tomoko Matsui, Kunio Tanabe, Sabato Marco Siniscalchi, Tor Andre Myrvoll, and Magne Hallstein Johnsen, “Penalized Logistic Regression with HMM LogLikelihood Regressors for Speech Recognition”, IEEE Transactions on Audio, Speech, and Language Processing Vol. 18, No. 6, pp. 1440-1454, August 2010.

Tokuda, K., Nankaku, Y., Toda, T., Zen, H., Yamagishi, J., Oura, K.,”Speech synthesis based on hidden markov models”, In: Proceedings of the IEEE, Vol. 101(5), pp. 1234–1252 (2013)

HMM-based Speech Synthesis System (HTS).

L.E. Baum, T. Petrie, “Statistical inference for probabilistic functions of finite state Markov chains”, Ann. Math. Stat., 37 (1966), pp. 1554-1563

Olivier Cappé Eric Moulines Tobias Rydén, Inference in Hidden Markov Mode,l Springer Series in Statistics, 2005.

Y. Ariki, M.A. Jack, “Enhanced time duration constraints in hidden Markov modelling for phoneme recognition”, Electronics Letters, 25 (13) (22 June 1989), pp. 824-825



  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

JOIV : International Journal on Informatics Visualization
Published by Information Technology Department
Politeknik Negeri Padang, Indonesia

© JOIV - ISSN : 2549-9610 | e-ISSN : 2549-9904 

Phone : +62-82386434344
Email  :

Creative Commons License is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

View My Stats