SciELO - Scientific Electronic Library Online

 
vol.20 número3Using a Heterogeneous Linguistic Network for Word Sense Induction and DisambiguationVisualization of Similarity Measures for Binary Data and 2 x 2 Tables índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Computación y Sistemas

versão On-line ISSN 2007-9737versão impressa ISSN 1405-5546

Resumo

KOLESNIKOVA, Olga. Survey of Word Co-occurrence Measures for Collocation Detection. Comp. y Sist. [online]. 2016, vol.20, n.3, pp.327-344. ISSN 2007-9737.  https://doi.org/10.13053/cys-20-3-2456.

This paper presents a detailed survey of word co-occurrence measures used in natural language processing. Word co-occurrence information is vital for accurate computational text treatment, it is important to distinguish words which can combine freely with other words from other words whose preferences to generate phrases are restricted. The latter words together with their typical co-occurring companions are called collocations. To detect collocations, many word co-occurrence measures, also called association measures, are used to determine a high degree of cohesion between words in collocations as opposed to a low degree of cohesion in free word combinations. We describe such association measures grouping them in classes depending on approaches and mathematical models used to formalize word co-occurrence.

Palavras-chave : Word co-occurrence measure; association measure; collocation; statistical language model; rule-based language model; hybrid approach to model word co-occurrence.

        · texto em Inglês     · Inglês ( pdf )