How does language model size effects speech recognition accuracy for the Turkish language?

Asefisaray, Behnam; Meng��o�lu, Erhan; Hac��mero�lu, Murat; Sever, Hayri

Volume : 30 Issue : 2 Year : 2024

30/2Current Issue Ahead of Print Archive Most Accessed Articles Manuscript Submission

How does language model size effects speech recognition accuracy for the Turkish language? [Pamukkale Univ Muh Bilim Derg]

Pamukkale Univ Muh Bilim Derg. 2016; 22(2): 100-105 | DOI: 10.5505/pajes.2015.03371

How does language model size effects speech recognition accuracy for the Turkish language?

Behnam Asefisaray¹, Erhan Meng��o�lu², Murat Hac��mero�lu³, Hayri Sever¹
¹Computer Engineering, Hacettepe University, Ankara, Turkey
²Computer Engineering, Ted University, Ankara, Turkey
³Computer Engineering, Gazi University, Ankara, Turkey

In this paper we aimed at investigating the effect of Language Model (LM) size on Speech Recognition (SR) accuracy. We also provided details of our approach for obtaining the LM for Turkish. Since LM is obtained by statistical processing of raw text, we expect that by increasing the size of available data for training the LM, SR accuracy will improve. Since this study is based on recognition of Turkish, which is a highly agglutinative language, it is important to find out the appropriate size for the training data. The minimum required data size is expected to be much higher than the data needed to train a language model for a language with low level of agglutination such as English. In the experiments we also tried to adjust the Language Model Weight (LMW) and Active Token Count (ATC) parameters of LM as these are expected to be different for a highly agglutinative language. We showed that by increasing the training data size to an appropriate level, the recognition accuracy improved on the other hand changes on LMW and ATC did not have a positive effect on Turkish speech recognition accuracy.

Keywords: Language model, Speech recognition systems, Language model weight, Active token count

T�rk�e ses tan�ma sistemlerinde dil modeli boyutunun do�ruluk oran�na etkisi

Behnam Asefisaray¹, Erhan Meng��o�lu², Murat Hac��mero�lu³, Hayri Sever¹
¹Bilgisayar M�hendisli�i B�l�m�, Hacettepe �niversitesi, Ankara, T�rkiye.
²Bilgisayar M�hendisli�i B�l�m�, Ted �niversitesi, Ankara, T�rkiye.
³Bilgisayar M�hendisli�i B�l�m�, Gazi �niversitesi, Ankara, T�rkiye.

Bu �al��man�n hedefi, Dil Modeli (DM) �retmek i�in kullan�lan metin derlem b�y�kl��n�n, Ses Tan�ma Sistemleri (STS) �zerindeki etkisini ara�t�rmakt�r. �al��mada ayr�ca DM elde etmek i�in yap�lmas� gereken i�ler detayl� olarak anlat�lmaktad�r. DM istatistiksel olarak olu�turuldu�undan, e�itim verisinde bulunan veri miktar� artt�k�a STS do�rulu�unun artmas� beklenmektedir. Fakat T�rk�e gibi sondan eklemeli dillerde, kullan�lan derlemin b�y�kl��n�n hangi noktaya kadar sistemin do�ruluk oran� �zerinde etkin olaca�� nem ta��maktad�r. Bu �al��mada, toplanan farkl� b�y�kl�kteki metin derlemleri ile konu�ma tan�ma sisteminde Dil Model A��rl�� (DMA) ve Aktif Token Say�s� (ATS) parametrelerini de�i�tirerek yap�lan deneyler yer almaktad�r. Bu �al��ma DM boyutu b�y�d�k�e T�rk�e konu�ma tan�ma ba�ar�m�n�n y�kseldi�ini g�stermektedir. Ancak, DMA ve ATS de�erlerinde yap�lan ayarlamalar�n tan�ma ba�ar�m�na olumlu bir etki yapt�� g�zlemlenememi�tir.

Anahtar Kelimeler: Dil modeli, Ses tan�ma sistemleri, Dil modeli a��rl��, Aktif token say�s�

Behnam Asefisaray, Erhan Meng��o�lu, Murat Hac��mero�lu, Hayri Sever. How does language model size effects speech recognition accuracy for the Turkish language?. Pamukkale Univ Muh Bilim Derg. 2016; 22(2): 100-105

Corresponding Author: Erhan Meng��o�lu, T�rkiye
Manuscript Language: Turkish

TOOLS Full Text PDF Print Download citation RIS EndNote BibTex Medlars Procite Reference Manager Share with email Share Send email to author Similar articles Google Scholar