TY - GEN
T1 - Why state-of-the-art deep learning barely works as good as a linear classifier in extreme multi-label text classification
AU - Qaraei, Mohammadreza
AU - Khandagale, Sujay
AU - Babbar, Rohit
N1 - Publisher Copyright:
© ESANN 2020 - Proceedings, 28th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning.
PY - 2020/10/2
Y1 - 2020/10/2
N2 - Extreme Multi-label Text Classification (XMTC) refers to supervised learning of a classifier which can predict a small subset of relevant labels for a document from an extremely large set. Even though deep learning algorithms have surpassed linear and kernel methods for most natural language processing tasks over the last decade; recent works show that state-of-the-art deep learning methods can only barely manage to work as well as a linear classifier for the XMTC task. The goal of this work is twofold: (i) to investigate the reasons for the comparable performance of these two strands of methods for XMTC, and (ii) to document this observation explicitly, as the efficacy of linear classifiers in this regime, has been ignored in many relevant recent works.
AB - Extreme Multi-label Text Classification (XMTC) refers to supervised learning of a classifier which can predict a small subset of relevant labels for a document from an extremely large set. Even though deep learning algorithms have surpassed linear and kernel methods for most natural language processing tasks over the last decade; recent works show that state-of-the-art deep learning methods can only barely manage to work as well as a linear classifier for the XMTC task. The goal of this work is twofold: (i) to investigate the reasons for the comparable performance of these two strands of methods for XMTC, and (ii) to document this observation explicitly, as the efficacy of linear classifiers in this regime, has been ignored in many relevant recent works.
UR - http://www.scopus.com/inward/record.url?scp=85098959798&partnerID=8YFLogxK
M3 - Chapter in a published conference proceeding
AN - SCOPUS:85098959798
T3 - ESANN 2020 - Proceedings, 28th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning
SP - 223
EP - 228
BT - ESANN 2020 - Proceedings, 28th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning
PB - ESANN (i6doc.com)
T2 - 28th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, ESANN 2020
Y2 - 2 October 2020 through 4 October 2020
ER -