Services on Demand
Journal
Article
Indicators
Cited by SciELO
Access statistics
Related links
Similars in SciELO
Share
Computación y Sistemas
On-line version ISSN 2007-9737Print version ISSN 1405-5546
Abstract
DA CUNHA, Iria; VIVALDI, Jorge; TORRES-MORENO, Juan-Manuel and SIERRA, Gerardo. SIMTEX: An Approach for Detecting and Measuring Textual Similarity based on Discourse and Semantics. Comp. y Sist. [online]. 2014, vol.18, n.3, pp.505-516. ISSN 2007-9737. https://doi.org/10.13053/CyS-18-3-2033.
Nowadays automatic systems for detecting and measuring textual similarity are being developed, in order to apply them to different tasks in the field of Natural Language Processing (NLP). Currently, these systems use surface linguistic features or statistical information. Nowadays, few researchers use deep linguistic information. In this work, we present an algorithm for detecting and measuring textual similarity that takes into account information offered by discourse relations of Rhetorical Structure Theory (RST), and lexical-semantic relations included in EuroWordNet. We apply the algorithm, called SIMTEX, to texts written in Spanish, but the methodology is potentially language-independent.
Keywords : Textual similarity; discourse; semantics; paraphrase.