1999 MakingLargeScaleSVMLearningPractical
- (Joachims, 1999a) ⇒ Thorsten Joachims. (1999). “Making Large-Scale SVM Learning Practical.” In: (Schölkopf et al., 1999).
Subject Headings: SVMlight, SVM Learning Algorithm.
Notes
- Website: http://svmlight.joachims.org/
Cited By
2004
- (Hastie et al., 2004) ⇒ Trevor Hastie, Saharon Rosset, Robert Tibshirani, and Ji Zhu. (2004). “The Entire Regularization Path for the Support Vector Machine.” In: The Journal of Machine Learning Research, 5.
- QUOTE: Software packages, such as the widely used SVMlight (Joachims, 1999), provide default settings for C, which are then used without much further exploration. A recent introductory document (Hsu et al., 2003) supporting the LIBSVM package does encourage grid search for C.
2003
- (Blei, Ng & Jordan, 2003) ⇒ David M. Blei, Andrew Y. Ng , and Michael I. Jordan. (2003). “Latent Dirichlet Allocation.” In: The Journal of Machine Learning Research, 3.
2001
- (Chang & Lin, 2001) ⇒ Chih-Chung Chang, and Chih-Jen Lin. (2001). “LIBSVM: a library for support vector machines."
Quotes
Abstract
Training a support vector machine (SVM) leads to a quadratic optimization problem with bound constraints and one linear equality constraint. Despite the fact that this type of problem is well understood, there are many issues to be considered in designing an SVM learner. In particular, for large learning tasks with many training examples, o -the-shelf optimization techniques for general quadratic programs quickly become intractable in their memory and time requirements. SVMlight is an implementation of an SVM learner which addresses the problem of large tasks. This chapter presents algorithmic and computational results developed for SVMlightV2.0, which make large-scale SVM training more practical. The results give guidelines for the application of SVMs to large domains.
Table of Contents
11.1 Introduction 11.2 General Decomposition Algorithm 11.3 Selecting a Good Working Set 11.3.1 Convergence 11.3.2 How to Solve OP3 11.4 Shrinking: Reducing the Size of OP1 11.5 Efficient Implementation 11.5.1 Termination Criteria 11.5.2 Computing the Gradient and the Termination Criteria Effici ently 11.5.3 Computational Resources Needed in Each Iteration 11.5.4 Caching Kernel Evaluations 11.5.5 How to Solve OP2 (QP Subproblems) 11.6 Related Work 11.7 Experiments 11.7.1 How Does Training Time Scale with the Number of Training Examples? 11.7.1.1 Income Prediction 11.7.1.2 Classifying Web Pages 11.7.1.3 Ohsumed Data Set 11.7.1.4 Dectecting Faces in Images 11.7.2 What Is the Influence of the Working Set Selection Strateg y? 11.7.3 What Is the Influence of Caching? 11.7.4 What Is the Influence of Shrinking? 11.8 Conclusions
References
- Bernhard E. Boser, Isabelle M. Guyon, and Vladimir N. Vapnik. A training algorithm for optimal margin classifiers. In D. Haussler, editor, Proceedings of the 5th Annual ACM Workshop on Computational Learning Theory, pages 144{152, Pittsburgh, PA, July (1992). ACM Press.
- P. E. Gill, W. Murray, and M. H. Wright. Practical Optimization. Academic Press, 1981.
- Thorsten Joachims. Text categorization with support vector machines. In European Conference on Machine Learning (ECML), 1998.
- E. Osuna, R. Freund, and F. Girosi. Support vector machines: Training and applications. A.I. Memo (in press), MIT A. I. Lab., 1996.
- E. Osuna, R. Freund, and F. Girosi. An improved training algorithm for support vector machines. In J. Principe, L. Gile, N. Morgan, and E. Wilson, editors, Neural Networks for Signal Processing VII | Proceedings of the 1997 IEEE Workshop, pages 276 { 285, New York, 1997a. IEEE.
- E. Osuna, R. Freund, and F. Girosi. Training support vector machines: An application to face detection. In, editor, Proceedings CVPR'97,, 1997b. .
- J. Platt. Sequential minimal optimization: A fast algorithm for training support vector machines. Technical Report MSR-TR-98-14, Microsoft Research, 1998.
- R. Vanderbei. Loqo: An interior point code for quadratic programming. Technical Report SOR 94-15, Princeton University, 1994.
- Vladimir N. Vapnik. The Nature of Statistical Learning Theory. Springer Verlag, New York, 1995.
- J. Werner. Optimization - Theory and Applications. Vieweg, 1984.
- G. Zoutendijk. Methods of Feasible Directions: a Study in Linear and Non-linear Programming. Elsevier, 1970.
,
Author | volume | Date Value | title | type | journal | titleUrl | doi | note | year | |
---|---|---|---|---|---|---|---|---|---|---|
1999 MakingLargeScaleSVMLearningPractical | Thorsten Joachims | Making Large-Scale SVM Learning Practical | http://www.joachims.org/publications/joachims 99a.pdf | 1999 |