A New Kernel to use with Discretized Temporal Series

González Abril, Luis; Velasco Morente, Francisco; Ortega Ramírez, Juan Antonio; Cuberos García Vaquero, Francisco Javier

Servicios Personalizados

Revista

Articulo

Indicadores

Citado por SciELO
Accesos

Links relacionados

Similares en SciELO

Otros
Otros

Permalink

Computación y Sistemas

versión On-line ISSN 2007-9737versión impresa ISSN 1405-5546

Comp. y Sist. vol.11 no.1 Ciudad de México jul./sep. 2007

Artículos

A New Kernel to use with Discretized Temporal Series

Un Nuevo Kernel para usar con Series Temporales Discretizadas

Luis González Abril¹, Francisco Velasco Morente¹, Juan Antonio Ortega Ramírez² and Francisco Javier Cuberos García Vaquero³

¹ Department of Applied Economics I, University of Seville (Spain)
e–mail: luisgon@us.es , velasco@us.es

² Department of Computer Science, University of Seville (Spain)
e–mail: ortega@lsi.us.es

³ Department of Planificación–Radio Televisión de Andalucía, Seville (Spain)
e–mail: fjcuberos@rtea.es

Article received on December 14, 2005; accepted on September 25, 2007

Abstract

In this paper a new Kernel, from statistical learning theory is proposed to work with symbols chains (words) obtained from a discretization procedure of a continuous features. Although the exact definition of the discretization is not strictly necessary, there must always exist either, a measure of distance or a similarity between symbols in a certain alphabet (a set of symbols). This kernel is applied on a set of television shares obtained from the seven main television stations in Andalusia (Spain). A comparative study for classification purposes is done, and the associated parameter selection is studied. Finally, it must be mentioned that this kernel has certain implications in the type of considered similarity that will be studied in further researches. The small influence of the λ parameter in identification tasks must also be discussed.

Keywords: Kernels, Discretization, Intervals Distance.

Resumen

En este artículo, un nuevo kernel (núcleo), procedente de la Teoría del aprendizaje Estadístico, es propuesto para trabajar con cadenas de símbolos obtenidos a través de un proceso de discretización de una variable continua. Aunque para la exacta definición de discretización no es estrictamente necesaria, siempre debe existir una medida de distancia o una medida de similitud entre símbolos en un determinado alfabeto (conjunto de símbolos). Este kernel es aplicado sobre un conjunto de repartos de audiencias en la televisión obtenido de las siete principales cadenas de televisión en Andalucía (España). Una comparativa con objeto de llevar a cabo una clasificación es realizada y la selección de parámetros es estudiada. Finalmente, mencionar que este kernel tiene ciertas implicaciones en el tipo de similaridad considerada las cuales serán estudiadas en futuras investigaciones. La poca influencia del parámetro λ en las tareas de identificación también debe ser analizada.

Palabras clave: Kernels, Discretización, Distancia Intervalar.

DESCARGAR ARTÍCULO EN FORMATO PDF

Acknowledgements

This work was partly supported by grant PAI–2006/0619 and PAI–2006/0513 awarded by the Junta de Andalusia.

References

1. Cristianini N. and Shawe–Taylor J. (2000). An introduction to Support Vector Machines and other kernel–based learning methods Cambridge University press 2000. [ Links ]

2. Cuberos F., Ortega J.A., Velasco F. and González L. (2003). QSI–Alternative Labelling and Noise Sensitivity. In 17 International Workshop on Qualitative Reasoning. [ Links ]

3. Cuberos F., Ortega J.A., Velasco F. and González L. (2004). A methodology for qualitative learning in time series. In 18 International Workshop on Qualitative Reasoning... [ Links ]

4. González L. and Gavilán J. (2001). Una metodología para la construcción de histogramas.. Aplicación a los ingresos de los hogares andaluces.. XIV Reunión ASEPELT–SPAIN. [ Links ]

5. González L., Velasco F., Ángulo C, Ortega J.A. and Ruiz F. (2004). Sobre núcleos, distancias y similitudes entre intervalos. Inteligencia Artificial,23, pp 111–117. [ Links ]

6. González L., Velasco F., Cuberos F. and Ortega J.A. (2006). Ameva: A discretization algorithm. Machine Learning. In revision.

7. Kurgan L. and Cios K. (2004). Caim discretization algorithm. IEEE transactions on Knowledge and Data Engineering, 16(2), pp. 145–153. [ Links ]

8. Macskassy A., Hirsh H., Banerjee A, and Dayanik A. (2003). Converting numerical classification into text classification. Artificial Intelligence , 143, pp. 51–77. [ Links ]

9. Sakoe H. and Chiba S. (1978). Dynamic programming algorithm optimisation for spoken word recognition. IEEE trans. On Acoustics, Speed and signal Proc. ASSP(26). [ Links ]

10. TNS Audiencia de Medios (2003). A service of Sofres AM company, www.sofresa.com [ Links ]