SciELO - Scientific Electronic Library Online

 
vol.22 número3RESyS: Towards a Rule-based Recommender System based on Semantic ReasoningUnsupervised Sentence Embeddings for Answer Summarization in Non-factoid CQA índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Computación y Sistemas

versão On-line ISSN 2007-9737versão impressa ISSN 1405-5546

Resumo

PATHAK, Amarnath; PAKRAY, Partha  e  GELBUKH, Alexander. A Formula Embedding Approach to Math Information Retrieval. Comp. y Sist. [online]. 2018, vol.22, n.3, pp.819-833. ISSN 2007-9737.  https://doi.org/10.13053/cys-22-3-3015.

Intricate math formulae, which majorly constitute the content of scientific documents, add to the complexity of scientific document retrieval. Although modifications in conventional indexing and search mechanisms have eased the complexity and exhibited notable performance, the formula embedding approach to scientific document retrieval sounds equally appealing and promising. Formula Embedding Module of the proposed system uses a Bit Position Information Table to transform math formulae, contained inside scientific documents, into binary formulae vectors. Each set bit of a formula vector designates presence of a specific mathematical entity. Mathematical user query is transformed into query vector, in similar fashion, and the corresponding relevant documents are retrieved. Relevance of a search result is characterized by extent of similarity between the indexed formula vector and the query vector. Promising performance, under moderately constrained situation, substantiates competence of the proposed approach.

Palavras-chave : Math information retrieval; formula embedding; math formula search; scientific document retrieval; precision.

        · texto em Inglês     · Inglês ( pdf )