SciELO - Scientific Electronic Library Online

 
 número38Natural Language Syntax Description using Generative Dependency GrammarModeling a Quite Different Machine Translation using Lexical Conceptual Structure índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Polibits

versão On-line ISSN 1870-9044

Resumo

LAKSHMANA PANDIAN, S.  e  GEETHA, T. V.. Morpheme based Language Model for Tamil Part-of-Speech Tagging. Polibits [online]. 2008, n.38, pp.19-25. ISSN 1870-9044.

The paper describes a Tamil Part of Speech (POS) tagging using a corpus-based approach by formulating a Language Model using morpheme components of words. Rule based tagging, Markov model taggers, Hidden Markov Model taggers and transformation-based learning tagger are some of the methods available for part of speech tagging. In this paper, we present a language model based on the information of the stem type, last morpheme, and previous to the last morpheme part of the word for categorizing its part of speech. For estimating the contribution factors of the model, we follow generalized iterative scaling technique. Presented model has the overall F-measure of 96%.

Palavras-chave : Bayesian learning; language model; morpheme components; generalized iterative scaling.

        · texto em Inglês

 

Creative Commons License Todo o conteúdo deste periódico, exceto onde está identificado, está licenciado sob uma Licença Creative Commons