Survey of part-of-speech tagger for mixed-code Indian and foreign language used in social media

Bhushan Ashokrao Nikam

Abstract


Part-Of-Speech Tagger (POS Tagger) is a tool that scans the text in specific language and allocates chunks of speech to individual word (and another token), such as verb, adjective, nown etc., as more fine-grained POS tags are used in computational applications like 'noun-plural'. Basically, the goal of a POS tagger is to allocate linguistic (mostly grammatical) information to sub-sentential units, called tokens as well as to words and symbols (e.g. punctuation). This paper presents a survey of POS  Tagger used for code-Mixed Indian and Foreign languages. Various methods, procedures, and features required to device POS Tagger for code-mixed foreign languages especially for Indian are studied and observations related to it are reported.


Full Text:

PDF


DOI: http://doi.org/10.11591/ijaas.v8.i4.pp264-268

Refbacks

  • There are currently no refbacks.


International Journal of Advances in Applied Sciences (IJAAS)
p-ISSN 2252-8814, e-ISSN 2722-2594
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.


Web Analytics View IJAAS Stats