SciELO - Scientific Electronic Library Online

 
vol.22 número4Feature Extraction for Token Based Word Alignment for Question Answering SystemsThe Forest Lion and the Bull: Morphosyntactic Annotation of the Panchatantra índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • No hay artículos similaresSimilares en SciELO

Compartir


Computación y Sistemas

versión On-line ISSN 2007-9737versión impresa ISSN 1405-5546

Resumen

RIF’AT, Muhammad; MAHENDRA, Rahmad; BUDI, Indra  y  WIBOWO, Haryo Akbarianto. Towards Product Attributes Extraction in Indonesian e-Commerce Platform. Comp. y Sist. [online]. 2018, vol.22, n.4, pp.1367-1375.  Epub 10-Feb-2021. ISSN 2007-9737.  https://doi.org/10.13053/cys-22-4-3073.

Product attribute extraction is an important task in e-commerce domain. Extracting pairs of attribute label and value from free-text product descriptions can be useful for many tasks, such as product matching, product categorization, faceted product search, and product recommendation. In this paper, we present a study of attribute extraction from Indonesian e-commerce product titles. We annotate 1,721 product titles with 16 attribute labels. We apply supervised learning technique using CRF algorithm. We propose combination of lexical, word embedding, and dictionary features to learn the attribute using joint extraction model. Our model achieves F1-measure 47.30% and 68.49% respectively for full match and partial match evaluation. Based on the experiment, we find that doing attributes extraction on more various number and diverse attributes simultaneously does not necessarily give worse result compared to extraction on less number of attributes.

Palabras llave : Attributes extraction; e-commerce; product title; named entity recognition; Indonesian language.

        · texto en Inglés     · Inglés ( pdf )