TY - GEN

T1 - An integer support vector machine

AU - Domm, Maryanne

AU - Engel, Andrew

AU - Pierre-Louis, Péguy

AU - Goldberg, Jeff

PY - 2005/12/1

Y1 - 2005/12/1

N2 - Data mining is a technique to discover patterns and trends in data and can be used to create a model to predict those patterns and trends. This is particularly useful for data sets that are not amenable to traditional statistical analysis. One particular data mining task is classification, predicting a quantity that can only take on a finite number of values. An important class of binary classifiers are Support Vector Machines (SVMs). Traditional SVMs use constrained optimization to find a separating hyperplane. A new data point is classified based on which side of the separating hyperplane it happens to fall on. All SVMs try to minimize the number of potential errors the classifier will make by minimizing a sum of distances from the hyperplane. However, the actual task of classification does not place any importance on a distance. In order to model this more closely, we propose the Integer Support Vector Machine Classifier (ISVM). ISVM uses binary indicator error variables to directly minimize the number of potential errors the classifier can make.

AB - Data mining is a technique to discover patterns and trends in data and can be used to create a model to predict those patterns and trends. This is particularly useful for data sets that are not amenable to traditional statistical analysis. One particular data mining task is classification, predicting a quantity that can only take on a finite number of values. An important class of binary classifiers are Support Vector Machines (SVMs). Traditional SVMs use constrained optimization to find a separating hyperplane. A new data point is classified based on which side of the separating hyperplane it happens to fall on. All SVMs try to minimize the number of potential errors the classifier will make by minimizing a sum of distances from the hyperplane. However, the actual task of classification does not place any importance on a distance. In order to model this more closely, we propose the Integer Support Vector Machine Classifier (ISVM). ISVM uses binary indicator error variables to directly minimize the number of potential errors the classifier can make.

UR - http://www.scopus.com/inward/record.url?scp=33749417877&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33749417877&partnerID=8YFLogxK

U2 - 10.1109/SNPD-SAWN.2005.16

DO - 10.1109/SNPD-SAWN.2005.16

M3 - Conference contribution

AN - SCOPUS:33749417877

SN - 0769522947

SN - 9780769522944

T3 - Proceedings - Sixth Int. Conf. on Softw. Eng., Artificial Intelligence, Netw. and Parallel/Distributed Computing and First ACIS Int. Workshop on Self-Assembling Wireless Netw., SNPD/SAWN 2005

SP - 144

EP - 149

BT - Proceedings - Sixth Int. Conf. on Softw. Eng., Artif. Intelligence, Networking and Parallel/Distributed Computing and First ACIS Int. Workshop on Self-Assembling Wireless Networks, SNPD/SAWN 2005

T2 - 6th International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing and 1st ACIS International Workshop on Self-Assembling Wireless Networks, SNPD/SAWN 2005

Y2 - 23 May 2005 through 25 May 2005

ER -