SciELO - Scientific Electronic Library Online

 
vol.17 issue2Detecting Salient Events in Large Corpora by a Combination of NLP and Data Mining TechniquesCorpus-based Sentence Deletion and Split Decisions for Spanish Text Simplification author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Computación y Sistemas

On-line version ISSN 2007-9737Print version ISSN 1405-5546

Abstract

QUINIOU, Solen; CELLIER, Peggy; CHARNOIS, Thierry  and  LEGALLOIS, Dominique. Graph Mining under Linguistic Constraints for Exploring Large Texts. Comp. y Sist. [online]. 2013, vol.17, n.2, pp.239-250. ISSN 2007-9737.

In this paper, we propose an approach to explore large texts by highlighting coherent sub-parts. The exploration method relies on a graph representation of the text according to Hoey's linguistic model which allows the selection and the binding of adjacent and non-adjacent sentences. The main contribution of our work consists in proposing a method based on both Hoey's linguistic model and a special graph mining technique, called CoHoP mining, to extract coherent sub-parts of the graph representation of the text. We have conducted some experiments on several English texts showing the interest of the proposed approach.

Keywords : Text coherence; graph representation; graph mining; Hoey's linguistic model.

        · abstract in Spanish     · text in English     · English ( pdf )

 

Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License