DiSMEC - Distributed sparse machines for extreme multi-label classification

Rohit Babbar, Bernhard Schölkopf

Research output: Chapter or section in a book/report/conference proceedingChapter in a published conference proceeding

190 Citations (SciVal)

Abstract

Extreme multi-label classification refers to supervised multi-label learning involving hundreds of thousands or even millions of labels. Datasets in extreme classification exhibit fit to power-law distribution, i.e. a large fraction of labels have very few positive instances in the data distribution. Most state-of-the-art approaches for extreme multi-label classification attempt to capture correlation among labels by embedding the label matrix to a lowdimensional linear sub-space. However, in the presence of powerlaw distributed extremely large and diverse label spaces, structural assumptions such as low rank can be easily violated. In this work, we present DiSMEC, which is a large-scale distributed framework for learning one-versus-rest linear classifiers coupled with explicit capacity control to control model size. Unlike most state-of-the-art methods, DiSMEC does not make any low rank assumptions on the label matrix. Using double layer of parallelization, DiSMEC can learn classifiers for datasets consisting hundreds of thousands labels within few hours. The explicit capacity control mechanism filters out spurious parameters which keep the model compact in size, without losing prediction accuracy. We conduct extensive empirical evaluation on publicly available real-world datasets consisting upto 670,000 labels. We compare DiSMEC with recent state-of-the-art approaches, including - SLEEC which is a leading approach for learning sparse local embeddings, and FastXML which is a tree-based approach optimizing ranking based loss function. On some of the datasets, DiSMEC can significantly boost prediction accuracies - 10% better compared to SLECC and 15% better compared to FastXML, in absolute terms.

Original languageEnglish
Title of host publicationWSDM 2017 - Proceedings of the 10th ACM International Conference on Web Search and Data Mining
PublisherAssociation for Computing Machinery
Pages721-729
Number of pages9
ISBN (Electronic)9781450346757
DOIs
Publication statusPublished - 2 Feb 2017
Event10th ACM International Conference on Web Search and Data Mining, WSDM 2017 - Cambridge, UK United Kingdom
Duration: 6 Feb 201710 Feb 2017

Publication series

NameWSDM 2017 - Proceedings of the 10th ACM International Conference on Web Search and Data Mining

Conference

Conference10th ACM International Conference on Web Search and Data Mining, WSDM 2017
Country/TerritoryUK United Kingdom
CityCambridge
Period6/02/1710/02/17

Bibliographical note

Publisher Copyright:
© 2017 ACM.

ASJC Scopus subject areas

  • Computer Science Applications
  • Information Systems
  • Computer Networks and Communications
  • Software

Fingerprint

Dive into the research topics of 'DiSMEC - Distributed sparse machines for extreme multi-label classification'. Together they form a unique fingerprint.

Cite this