Optimal and Non-Optimal Parallel Implementations of the Sequentiall Minimal Optimization Algorithm for Support Vector Machine Training

Benjamin Castaneda, Juan C. Cockburn, Muhammad Shaaban

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

Support Vector Machines (SVMs) are supervised learning systems that have gained wide acceptance among the pattern recognition community. Learning is based on structural risk minimization over a training set and leads to a quadratic programming problem. Due to the sample size these optimization problems are very large and training remains one of the most computationally expensive stages in Support Vector Machine design. This paper addresses this problem by exploring different approaches to parallel training. Several algorithms are developed and evaluated including a non-optimal approach for parallel training based on the unbiased version of Piatt's Sequential Minimal Optimization (SMO) algorithm, an improvement to a previous biased non-optimal parallel SMO, and an optimal solution combining SMO with the Chunking approach. Experimental results show that non-optimal solutions can achieve a speed-up of 0(N2), according to the number of processors used, with a compromise in the increment of the number of Support Vectors and a decrement in accuracy. The SMO - Chunking optimal solution presents a much lesser speedup, which depends on the number of support vectors vs. total number of samples ratio.

Idioma originalInglés
Título de la publicación alojada17th ISCA International Conference on Parallel and Distributed Computing Systems 2004, PDCS 2004
EditoresDavid A. Bader, Ashfaq A. Khokhar
EditorialInternational Society for Computers and Their Applications (ISCA)
Páginas21-26
Número de páginas6
ISBN (versión digital)9781618398185
EstadoPublicada - 2004
Publicado de forma externa
Evento17th International Conference on Parallel and Distributed Computing Systems, PDCS 2004 - San Francisco, Estados Unidos
Duración: 15 set. 200417 set. 2004

Serie de la publicación

Nombre17th ISCA International Conference on Parallel and Distributed Computing Systems 2004, PDCS 2004

Conferencia

Conferencia17th International Conference on Parallel and Distributed Computing Systems, PDCS 2004
País/TerritorioEstados Unidos
CiudadSan Francisco
Período15/09/0417/09/04

Huella

Profundice en los temas de investigación de 'Optimal and Non-Optimal Parallel Implementations of the Sequentiall Minimal Optimization Algorithm for Support Vector Machine Training'. En conjunto forman una huella única.

Citar esto