SciELO - Scientific Electronic Library Online

 
vol.18 número3Soft Similarity and Soft Cosine Measure: Similarity of Features in Vector Space ModelDependency vs. Constituent Based Syntactic N-Grams in Text Similarity Measures for Paraphrase Recognition índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Computación y Sistemas

versão On-line ISSN 2007-9737versão impressa ISSN 1405-5546

Resumo

DA CUNHA, Iria; VIVALDI, Jorge; TORRES-MORENO, Juan-Manuel  e  SIERRA, Gerardo. SIMTEX: An Approach for Detecting and Measuring Textual Similarity based on Discourse and Semantics. Comp. y Sist. [online]. 2014, vol.18, n.3, pp.505-516. ISSN 2007-9737.  https://doi.org/10.13053/CyS-18-3-2033.

Nowadays automatic systems for detecting and measuring textual similarity are being developed, in order to apply them to different tasks in the field of Natural Language Processing (NLP). Currently, these systems use surface linguistic features or statistical information. Nowadays, few researchers use deep linguistic information. In this work, we present an algorithm for detecting and measuring textual similarity that takes into account information offered by discourse relations of Rhetorical Structure Theory (RST), and lexical-semantic relations included in EuroWordNet. We apply the algorithm, called SIMTEX, to texts written in Spanish, but the methodology is potentially language-independent.

Palavras-chave : Textual similarity; discourse; semantics; paraphrase.

        · texto em Inglês     · Inglês ( pdf )

 

Creative Commons License Todo o conteúdo deste periódico, exceto onde está identificado, está licenciado sob uma Licença Creative Commons