SciELO - Scientific Electronic Library Online

 
vol.22 número4Psychological Attachment Style Determination Using a Word Space ModelBuilding Resources For Vietnamese Clinical Text Processing índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • No hay artículos similaresSimilares en SciELO

Compartir


Computación y Sistemas

versión On-line ISSN 2007-9737versión impresa ISSN 1405-5546

Resumen

SHARMA, Vijay Kumar  y  MITTAL, Namita. An Improvement in Statistical Machine Translation in Perspective of Hindi-English Cross-Lingual Information Retrieval. Comp. y Sist. [online]. 2018, vol.22, n.4, pp.1277-1285.  Epub 10-Feb-2021. ISSN 2007-9737.  https://doi.org/10.13053/cys-22-4-3069.

Cross-Lingual Information Retrieval (CLIR) enables a user to query to the different language target documents. CLIR incorporates a Machine Translation (MT) technique which is in growing state for Indian languages due to the unavailability of enough resources. In this paper, a Statistical Machine Translation (SMT) system is trained on two parallel corpora separately. A large English language corpus is used for language modeling in SMT. Experiments are evaluated by using BLEU score, further, these experimental setups are used to translate the Hindi language queries for the experimental analysis of Hindi-English CLIR. Since SMT does not deal with morphological variants while the proposed Translation Induction Algorithm (TIA) deals with that, therefore, TIA outperforms the SMT systems in perspective of CLIR.

Palabras llave : Cross-lingual information retrieval; parallel corpus; statistical machine translation; morphological variants.

        · texto en Inglés     · Inglés ( pdf )