
تعداد نشریات | 21 |
تعداد شمارهها | 610 |
تعداد مقالات | 9,026 |
تعداد مشاهده مقاله | 67,082,758 |
تعداد دریافت فایل اصل مقاله | 7,656,171 |
Hidden Markov model and Persian speech recognition | ||
International Journal of Nonlinear Analysis and Applications | ||
مقاله 242، دوره 14، شماره 1، فروردین 2023، صفحه 3111-3119 اصل مقاله (393.6 K) | ||
نوع مقاله: Research Paper | ||
شناسه دیجیتال (DOI): 10.22075/ijnaa.2022.27851.3735 | ||
نویسنده | ||
Masoume Shafieian* | ||
Assistant Professor, Department of Technology and Media Engineering IRIBU University, Tehran, Iran. | ||
تاریخ دریافت: 26 خرداد 1401، تاریخ بازنگری: 28 تیر 1401، تاریخ پذیرش: 18 مرداد 1401 | ||
چکیده | ||
Nowadays, speech recognition, which simply refers to the process of converting an audio signal into its equivalent text, has become one of the most important research topics. Although many studies have been conducted in the field of speech recognition for many languages of the world, but can be said that no more study has been conducted in the Persian language and therefore it is necessary to conduct more studies in this field. Since Persian is a rich language that can create many new words by adding a suffix (prefix) to its main root, so it can be said that the success rate of voice recognition programs in this language has also increased with the increase in the number of phonemes and therefore can have a significant improvement. Therefore, in this study, a practical approach to Persian speech recognition based on syllables, which are a unit between phonemes and words, has been used and done by the hidden Markov model. After obtaining syllable utterances, multiple coefficients are calculated for all syllables. Finally, suitable models were created and the success rate was calculated by conducting tests for the systems. To measure the performance of the system, the error rate criterion was used. The results of this study show that the word error rate for the hidden Markov model was 18.3% and increased the system performance by approximately 16% after post-processing. | ||
کلیدواژهها | ||
Hidden Markov Model؛ Persian language؛ Speech Recognition؛ Syllable؛ Syllable Based Speech Recognition | ||
مراجع | ||
[1] A. Asliyan, K. G¨unel and T. Yakhno,Syllable Based Speech Recognition Using Dynamic Time Warping, Academic Informatics, Canakkale Onsekiz Mart University, Canakkale, 2008. [15] C.S. Myers, L.R. Rabiner and A.E. Rosenberg, Performance tradeoffs in dynamic time warping algorithms for isolated word recognition, IEEE Trans. Acous. Speech Sig. Process. ASPP-28 (1980), no. 6, 623–635. | ||
آمار تعداد مشاهده مقاله: 17,278 تعداد دریافت فایل اصل مقاله: 368 |