Statistical properties and adaptive tuning of support vector machines

Yi Lin, Grace Wahba, Hao Zhang, Yoonkyung Lee

Research output: Contribution to journalArticle

18 Scopus citations

Abstract

In this paper we consider the statistical aspects of support vector machines (SVMs) in the classification context, and describe an approach to adaptively tuning the smoothing parameter(s) in the SVMs. The relation between the Bayes rule of classification and the SVMs is discussed, shedding light on why the SVMs work well. This relation also reveals that the misclassification rate of the SVMs is closely related to the generalized comparative Kullback-Leibler distance (GCKL) proposed in Wahba (1999, Scholkopf, Burges, & Smola (Eds.), Advances in Kernel Methods-Support Vector Learning, Cambridge, MA: MIT Press). The adaptive tuning is based on the generalized approximate cross validation (GACV), which is an easily computable proxy of the GCKL. The results are generalized to the unbalanced case where the fraction of members of the classes in the training set is different than that in the general population, and the costs of misclassification for the two kinds of errors are different. The main results in this paper have been obtained in several places elsewhere. Here we take the opportunity to organize them in one place and note how they fit together and reinforce one another. Mostly the work of the authors is reviewed.

Original languageEnglish (US)
Pages (from-to)115-136
Number of pages22
JournalMachine Learning
Volume48
Issue number1-3
DOIs
StatePublished - Jan 1 2002
Externally publishedYes

Keywords

  • Bayes rule
  • Classification
  • GACV
  • GCKL
  • Support vector machine

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence

Fingerprint Dive into the research topics of 'Statistical properties and adaptive tuning of support vector machines'. Together they form a unique fingerprint.

  • Cite this