A simple and effective scheme for data pre-processing in extreme classification

Sujay Khandagale, Rohit Babbar

Research output: Chapter or section in a book/report/conference proceedingChapter in a published conference proceeding

1 Citation (SciVal)

Abstract

Extreme multi-label classification (XMC) refers to supervised multi-label learning involving hundreds of thousand or even millions of labels. It has been shown to be an effective framework for addressing crucial tasks such as recommendation, ranking and web-advertising. In this paper, we propose a method for effective and well-motivated data pre-processing scheme in XMC. We show that our proposed algorithm, PrunEX, can remove upto 90% data in the input which is redundant from a classification view-point. Our scheme is universal in the sense it is applicable to all known public datasets in the domain of XMC.

Original languageEnglish
Title of host publicationESANN 2019 - Proceedings, 27th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning
PublisherESANN (i6doc.com)
Pages67-72
Number of pages6
ISBN (Electronic)9782875870650
Publication statusPublished - 26 Apr 2019
Event27th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, ESANN 2019 - Bruges, Belgium
Duration: 24 Apr 201926 Apr 2019

Publication series

NameESANN 2019 - Proceedings, 27th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning

Conference

Conference27th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, ESANN 2019
Country/TerritoryBelgium
CityBruges
Period24/04/1926/04/19

Bibliographical note

Funding Information:
This work was done when Sujay was a student at Aalto University, Finland We appreciate the computing resources provided by the Aalto Science-IT project.

ASJC Scopus subject areas

  • Artificial Intelligence
  • Information Systems

Fingerprint

Dive into the research topics of 'A simple and effective scheme for data pre-processing in extreme classification'. Together they form a unique fingerprint.

Cite this