SciELO - Scientific Electronic Library Online

 
 issue37Web-based Bengali News Corpus for Lexicon Development and POS TaggingStudy of Example Based English to Sanskrit Machine Translation author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Polibits

On-line version ISSN 1870-9044

Abstract

DAOUD, Maher  and  BOITET, Christian. Methods for Handling Spontaneous E-commerce Arabic SMS: CATS, an Operational Proof of Concept. Polibits [online]. 2008, n.37, pp.31-41. ISSN 1870-9044.

The purpose of this paper is to show that it is necessary and possible to build (multilingual) NL-based ecommerce systems with mixed sublanguage and content-oriented methods. The analysis of the sublanguage and the integration of content-oriented methods will definitely increase the accuracy and robustness of the processing. To verify this assumption, we built an experimental system as a proof of concept. The system is a SMS-based classified ads selling and buying platform. To analyze the sublanguage, we first used a web based corpus to build the basic system. A content representation language is defined to capture the meaning of a classified ad post. The semantic grammars of content extraction are coded using the EnCo. Response generation is based on semantic matching ("looking for" and "sell" posts) and reasoning and is able to handle "no answer situations". CATS is currently deployed in Jordan by Fastlink (the largest mobile operator). Testing the content extraction component with a real noisy free texts shows a 90% F-measure.

Keywords : Spontaneous NL interface; SMS services; sublanguages; content extraction; classified ads; Arabic processing.

        · text in English

 

Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License