Design and implementation of an effective web-based hybrid stemmer for Odia language

Gouranga Charan Jena, Siddharth Swarup Rautaray

Abstract


Stemmer is used for reducing inflectional or derived word to its stem. This technique involves removing the suffix or prefix affixed in a word. It can be used for information retrieval system to refine the overall execution of the retrieval process. This process is not equivalent to morphological analysis. This process only finds the stem of a word. This technique decreases the number of terms in information retrieval system. There are various techniques exists for stemming. In this paper, a new web-based stemmer has been proposed named as “Mula” for Odia Language. It uses the Hybrid approach (i.e. combination of brute force and suffix removal approach) for Odia language. The new born stemmer is both computationally faster and domain independent. The results are favourable and indicate that the proposed stemmer can be used effectively in Odia Information Retrieval systems. This stemmer also handles the problem of over-stemming and under-stemming in some extend.


Full Text:

PDF


DOI: http://doi.org/10.11591/ijaas.v9.i1.pp12-19

Refbacks

  • There are currently no refbacks.


International Journal of Advances in Applied Sciences (IJAAS)
p-ISSN 2252-8814, e-ISSN 2722-2594
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.


Web Analytics View IJAAS Stats