SciELO - Scientific Electronic Library Online

 
 issue49Distance Measurement System using Images to Determine the Position of a Sphere using the XBOX Kinect SensorComputing Polynomial Segmentation through Radial Surface Representation author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Polibits

On-line version ISSN 1870-9044

Abstract

ALONSO-RORIS, Víctor M. et al. Information Extraction in Semantic, Highly-Structured, and Semi-Structured Web Sources. Polibits [online]. 2014, n.49, pp.69-76. ISSN 1870-9044.

The evolution of the Web from the original proposal made in 1989 can be considered one of the most revolutionary technological changes in centuries. During the past 25 years the Web has evolved from a static version to a fully dynamic and interoperable intelligent ecosystem. The amount of data produced during these few decades is enormous. New applications, developed by individual developers or small companies, can take advantage of both services and data already present on the Web. Data, produced by humans and machines, may be available in different formats and through different access interfaces. This paper analyses three different types of data available on the Web and presents mechanisms for accessing and extracting this information. The authors show several applications that leverage extracted information in two areas of research: recommendations of educational resources beyond content and interactive digital TV applications.

Keywords : Information extraction; web data processing; semantic enrichment; data mining; web scraping.

        · text in English

 

Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License