Binary Separation and Training Support Vector Machines

R. Fletcher; ZANGHIRATI, Gaetano

doi:10.1017/S0962492910000024

We introduce basic ideas of binary separation by a linear hyperplane, which is a technique exploited in the Support Vector Machine (SVM) concept. This is a decision making tool for pattern recognition and related problems. We describe a fundamental Standard Problem (SP) and show how this is used in most existing research to develop a dual-based algorithm for its solution. This algorithm is shown to be deficient in certain aspects, and we develop a new primal-based SQP-like algorithm, which has some interesting features. Most practical SVM problems are not adequately handled by a linear hyperplane. We describe the technique of Nonlinear SVM which enables a nonlinear separating surface to be computed and we propose a new primal algorithm based on the use of low-rank Choleski factors. It may be however that exact separation is not desirable due to the presence of uncertain or mislabelled data. Dealing with this situation is the main challenge in developing suitable algorithms. Existing dual-based algorithms use the idea of L1 penalties, which has merit. We suggest how penalties can be incorporated into a primal-based algorithm. Another aspect of practical SVM problems is often the huge size of the data set, which poses severe challenges both for software package development and for control of ill-conditioning. We illustrate some of these issues with numerical experiments on a range of problems.