Exploit relations between the word letters and their placement in the word for Arabic root extraction

Szczegóły
Opis

Tytuł:: Exploit relations between the word letters and their placement in the word for Arabic root extraction
Autorzy:: Hawas, F. A.
Powiązania:: https://bibliotekanauki.pl/articles/305644.pdf
Data publikacji:: 2013
Wydawca:: Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie. Wydawnictwo AGH
Tematy:: rule-based stemmer
word root
suffixes
prefixes
words patterns
Źródło:: Computer Science; 2013, 14 (2); 327-341
1508-2806
2300-7036
Język:: angielski
Prawa:: CC BY: Creative Commons Uznanie autorstwa 3.0 PL
Dostawca treści:: Biblioteka Nauki
: Artykuł

Przejdź do źródła

This paper presents a new root-extraction approach for Arabic words. The approach tries to assign for Arabic words a unique root without relying on a database of word roots, a list of word patterns or a list of all the prefixes and the suffixes of the Arabic words. Unlike most of Arabic rule-based stemmers, it tries to predict the root-letters positions one by one based on some rules and relations among the word letters and their placement in the word. This paper focuses on two parts of the approach. The first one introduces some rules to distinguish between the Arabic definite article and the permanent component that may found in any Arabic word. The second one classifies Arabic letters in to groups according to their positions in the word. The proposed approach is a system composed of several modules used to extract the word root. The approach has been evaluated using the Holy Quran words. The evaluation results show a promising root extraction algorithm.

Informacja

Powiązane pozycje