TerseSVM: A scalable approach for learning compact models in large-scale classification

Rohit Babbar, Krikamol Maundet, Bernhard Schölkopf

Research output: Chapter or section in a book/report/conference proceedingChapter in a published conference proceeding

4 Citations (SciVal)

Abstract

For large-scale multi-class classification problems, consisting of tens of thousand target categories, recent works have emphasized the need to store billions of parameters. For instance, the classical /2-norm regularization employed by a state-of-the-art method results in the model size of 17GB for a training set whose size is only 129MB. To the contrary, by using a mixed-norm regularization approach, we show that around 99.5% of the stored parameters is dispensable noise. Using this strategy, we can extract the information relevant for classification, which is constituted in remaining 0.5% of the parameters, and hence demonstrate drastic reduction in model sizes. Furthermore, the proposed method leads to improvement in generalization performance compared to state-of-the-art methods, especially for under-represented categories. Lastly, our method enjoys easy parallelization, and scales well to tens of thousand target categories.

Original languageEnglish
Title of host publication16th SIAM International Conference on Data Mining 2016, SDM 2016
EditorsSanjay Chawla Venkatasubramanian, Wagner Meira
PublisherSociety for Industrial and Applied Mathematics Publications
Pages234-242
Number of pages9
ISBN (Electronic)9781611974348
DOIs
Publication statusE-pub ahead of print - 11 Aug 2016
Event16th SIAM International Conference on Data Mining 2016, SDM 2016 - Miami, USA United States
Duration: 5 May 20167 May 2016

Publication series

Name16th SIAM International Conference on Data Mining 2016, SDM 2016

Conference

Conference16th SIAM International Conference on Data Mining 2016, SDM 2016
Country/TerritoryUSA United States
CityMiami
Period5/05/167/05/16

Bibliographical note

Publisher Copyright:
Copyright © by SIAM.

ASJC Scopus subject areas

  • Computer Science Applications
  • Software

Fingerprint

Dive into the research topics of 'TerseSVM: A scalable approach for learning compact models in large-scale classification'. Together they form a unique fingerprint.

Cite this