TRMOR: a finite-state-based morphological analyzer for Turkish
Yazarlar (1)
Dr. Öğr. Üyesi Ayla KAYABAŞ Kırşehir Ahi Evran Üniversitesi, Türkiye
Makale Türü Açık Erişim Özgün Makale (SSCI, AHCI, SCI, SCI-Exp dergilerinde yayınlanan tam makale)
Dergi Adı Turkish Journal of Electrical Engineering and Computer Sciences (Q4)
Dergi ISSN 1300-0632 Wos Dergi Scopus Dergi
Dergi Tarandığı Indeksler SCI-Expanded
Makale Dili Türkçe Basım Tarihi 01-2022
Cilt / Sayı / Sayfa 27 / 5 / 3837–3851 DOI 10.3906/elk-1902-125
Makale Linki https://journals.tubitak.gov.tr/elektrik/vol27/iss5/41/
Özet
Morphological analysis is an important component of natural language processing systems like spelling correction tools, parsers, machine translation systems, and dictionary tools. In this paper, we present TRMOR, a morphological analyzer for Turkish, which uses the SFST tool (Stuttgart Finite-State Transducer). TRMOR can be freely used for academic research (see http://www. cis. uni-muenchen. de/~ schmid/tools/SFST/). It covers a large part of Turkish morphology including inflection, derivation, and some compounding. It uses morphotactic and morphophonological rules and a stem lexicon. We describe the morphological structure of Turkish, explain the phonological and morphological rules implemented in TRMOR, evaluate the system, and test it in special cases. The evaluation of TRMOR was executed on gold-standard words. One thousand words were randomly selected from Wikipedia word lists. For those words, we achieved gold-standard analysis. TRMOR has 94.12% precision on these 1000 words that were randomly selected from Wikipedia word lists. Morphological analyses of Turkish are prepared for the gold-standard version since, to our knowledge, there is no gold-standard segmentation available for Turkish morphological analyzers for noncommercial purposes.
Anahtar Kelimeler
Finite-state morphology | Gold standard | Turkish morphology
BM Sürdürülebilir Kalkınma Amaçları
Atıf Sayıları
Google Scholar 22
Scopus 9
Web of Science 8
TRMOR: a finite-state-based morphological analyzer for Turkish

Paylaş