SciELO - Scientific Electronic Library Online

 
vol.15 issue2Evaluating n-gram Models for a Bilingual Word Sense Disambiguation TaskPattern Recognition for the Identification of Learning Styles on Educational Mobile and Social Network Tools author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Computación y Sistemas

On-line version ISSN 2007-9737Print version ISSN 1405-5546

Abstract

DAS, Dipankar  and  BANDYOPADHYAY, Sivaji. Document Level Emotion Tagging: Machine Learning and Resource Based Approach. Comp. y Sist. [online]. 2011, vol.15, n.2, pp.221-234. ISSN 2007-9737.

The present task involves the identification of emotions from Bengali blog documents using two separate approaches. The first one is a machine learning approach that accumulates document level information from sentences obtained from word level granular detail whereas the second one is a resource based approach that considers the Bengali WordNet Affect, the word level Bengali affective lexical resource. In the first approach, the Support Vector Machine (SVM) classifier is employed to perform the word level classification. Sense weight based average scoring technique determines the sentential emotion scores based on the word level emotion tagged constituents. The cumulative summation of sentential emotion scores is assigned to each document considering the combinations of various heuristic features. The second one implements a majority based approach to classify a given document considering the Bengali WordNet Affect lists. Instead of assigning a single emotion tag to a document, in both approaches, the best two emotion tags are assigned to each document according to the ordered emotion scores obtained. By applying the best feature combination acquired from the development set, the evaluation of 110 test documents yields the average F-Scores of 59.50% and 51.07% for the two approaches respectively with respect to all emotion classes.

Keywords : Natural language processing; computational linguistics; text; blog; document; WordNet Affect; sense weight score; CRF; SVM; emotion tagging; heuristic features.

        · abstract in Spanish     · text in English

 

Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License