SciELO - Scientific Electronic Library Online

 
 issue49MultiSearchBP: Environment for Search and Clustering of Business Process ModelsA Proposal to Incorporate More Semantics from Models into Generated Code author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Polibits

On-line version ISSN 1870-9044

Abstract

SCHNITZER, Steffen; SCHMIDT, Sebastian; RENSING, Christoph  and  HARRIEHAUSEN-MIIHLBAUER, Bettina. Combining Active and Ensemble Learning for Efficient Classification of Web Documents. Polibits [online]. 2014, n.49, pp.39-46. ISSN 1870-9044.

Classification of text remains a challenge. Most machine learning based approaches require many manually annotated training instances for a reasonable accuracy. In this article we present an approach that minimizes the human annotation effort by interactively incorporating human annotators into the training process via active learning of an ensemble learner. By passing only ambiguous instances to the human annotators the effort is reduced while maintaining a very good accuracy. Since the feedback is only used to train an additional classifier and not for re-training the whole ensemble, the computational complexity is kept relatively low.

Keywords : Text classification; active learning; user feedback; ensemble learning.

        · text in English

 

Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License