Abstract: In this paper, we developed two
sets of phonetically balanced word (PBW) lists in
Filipino and two sets of phoneme-level HMMs (Hidden Markov
Model). Two PBW lists were based on textbooks used in public school in Philippines and used to develop
speech corpus with the fifty
speakers of 25 males and 25 females. In a 2-syllable
word list (PBW2), an average accuracy rate of 88.95% for speaker dependent and
82.57% for speaker independent test were achieved. For 3-syllable word list
(PBW3), the recognizer achieved an accuracy rate of 90.28% for speaker
dependent and 83.30% for speaker independent test.
Keywords: Filipino phonetically balanced words, Filipino word corpus, Hidden Markov Model, Automatic Speech Recognition