SciELO - Scientific Electronic Library Online

 
 número47N-gram Parsing for Jointly Training a Discriminative Constituency ParserExploration on Effectiveness and Efficiency of Similar Sentence Matching índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • No hay artículos similaresSimilares en SciELO

Compartir


Polibits

versión On-line ISSN 1870-9044

Resumen

FADAEE, Marzieh; GHADER, Hamidreza; FAILI, Heshaam  y  SHAKERY, Azadeh. Automatic WordNet Construction Using Markov Chain Monte Carlo. Polibits [online]. 2013, n.47, pp.13-22. ISSN 1870-9044.

WordNet is used extensively as a major lexical resource in information retrieval tasks. However, the qualities of existing Persian WordNets are far from perfect. They are either constructed manually which limits the coverage of Persian words, or automatically which results in unsatisfactory precision. This paper presents a fully-automated approach for constructing a Persian WordNet: A Bayesian Model with Markov chain Monte Carlo (MCMC) estimation. We model the problem of constructing a Persian WordNet by estimating the probability of assigning senses (synsets) to Persian words. By applying MCMC techniques in estimating these probabilities, we integrate prior knowledge in the estimation and use the expected value of generated samples to give the final estimates. This ensures great performance improvement comparing with Maximum-Likelihood and Expectation-Maximization methods. Our acquired WordNet has a precision of 90.46% which is a considerable improvement in comparison with automatically-built WordNets in Persian.

Palabras llave : Semantic network; WordNet; ontology; Bayesian inference; Markov chain Monte Carlo; Persian.

        · texto en Inglés

 

Creative Commons License Todo el contenido de esta revista, excepto dónde está identificado, está bajo una Licencia Creative Commons