Citation: | XU Ji, PAN Jielin, YAN Yonghong, “Agglutinative Language Speech Recognition Using Automatic Allophone Deriving,” Chinese Journal of Electronics, vol. 25, no. 2, pp. 328-333, 2016, doi: 10.1049/cje.2016.03.020 |
D. Kiecza, T. Schultz and A. Waibel, "Data-driven determination of appropriate dictionary units for Korean speech recognition", Proc. of ICSP, Seoul, Korea, pp.323-327, 1999.
|
K. Cark, P. Geutner and T. Schultz, "Turkish LVCSR: Towards better speech recognition for agglutinative language", Proc. of ICASSP, Istanbul, Turkey, pp.1563-1566, 2000.
|
T. Hirsimaki, M. Creutz, V. Siivola, et al., "Unlimited vocabulary speech recognition with morph language models applied to Finnish", Computer Speech and Language, pp.515-541, 2006.
|
H. Hong, S. Kim and M. Chung, "Effects of Allophones on the Performance of Korean Speech Recognition", Proc. of Interspeech, Brisbane, Australia, pp.2410-2413, 2008.
|
J. Xu, Y. Si, J. Pan, et al., "Automatic allophone deriving for Korean speech recognition", Proc. of IEEE 9th International Conference on Computational Intelligence and Security (CIS), Emei Mountain, China, pp.776-779, 2013.
|
H. Hermansky and S. Sharma, "Temporal patterns (TRAPs) in ASR of noisy speech", Proc. of ICASSP, Phoenix, USA, Vol.1, pp.289-292, 1999.
|
I. Taylor, "The Korean writing system: An alphabet? A syllabary? A logography", Proc. of Visible Language, pp.67-82, 1980.
|
O.W. Kwon and J. Park, "Korean large vocabulary continuous speech recognition with morpheme-based recognition units", Speech Communication, Vol.39, No.3-4, pp.287-300, 2003.
|
Sakriani Sakti, Andrew Finch, Ryosuke Isotani, et al., "Unsupervised determination of efficient Korean LVCSR units using a Bayesian Dirichlet process mode", Proc. of ICASSP, Prague, Czech Republic, pp.4664-4667, 2011.
|
Sakriani Sakti, Andrew Finch, Chiori Hori, et al., "Conditional random fields for modeling Korean pronunciation variation", Proc. of the Paralinguistic Information and its Integration in Spoken Dialogue Systems Workshop Part 2, Granada, Spain, pp.49-55, 2011.
|
M. Kim, Y.R. Oh and H.K. Kim, "Non-native pronunciation variation modeling using an indirect data driven method", Proc. of ASRU, Kyoto, Japan, pp.231-236, 2007.
|
Huajian Xue, "Applying morphological rules to Uyghur continuous speech recognition", Ph.D.Thesis, The Xinjiang Technical Institute of Physics and Chemistry of Chinese Academy of Science, Urumqi, China, pp.13-15, 2012.
|
Gulilaadongbieke. "The research of proofreading for the Uighur character", Proc. of IEEE International Conference on System, Man and Cybernetics, Tucson, U.S.A, Vol.2, pp.874-876, 2001.
|
X. Li, S. Cai, J. Pan, et al., "Large vocabulary Uyghur continuous speech recognition based on stems and suffixe", Proc. of IEEE 7th International Symposium on Chinese Spoken Language Processing (ISCSLP), Tainan, China, pp.220-223, 2010.
|
Shao Jian, "Chinese spoken term detection towards largescale telephone conversational speech", Ph.D. Thesis, Chinese Academy of Sciences, Beijing, China, pp.41-42, 2008.
|
X. Huang, A. Acero and H.W. Hon, Spoken Language Processing: A Guide to Theory, Algorithm, and System Development, Prentice Hall, Englewood Cliffs, U.S.A, pp.177-181, 2001.
|
K. Beulen and H. Ney, "Automatic question generation for decision tree based state tying", Proc. of ICASSP, Seattle, U.S.A. Vol.2, pp.805-808, 1998.
|
P. Schwarz, "Phoneme recognition based on long temporal context", Ph.D.Thesis, Brno University of Technology, Brno, Czech Republic, pp.11-12, 2008.
|