QBERT: Generalist Model for Processing Questions

Zhaozhen Xu, Nello Cristianini

Research output: Chapter or section in a book/report/conference proceedingChapter in a published conference proceeding

2 Citations (SciVal)

Abstract

Using a single model across various tasks is beneficial for training and applying deep neural sequence models. We address the problem of developing generalist representations of text that can be used to perform a range of different tasks rather than being specialised to a single application. We focus on processing short questions and developing an embedding for these questions that is useful on a diverse set of problems, such as question topic classification, equivalent question recognition, and question answering. This paper introduces QBERT, a generalist model for processing questions. With QBERT, we demonstrate how we can train a multi-task network that performs all question-related tasks and has achieved similar performance compared to its corresponding single-task models.

Original languageEnglish
Title of host publicationAdvances in Intelligent Data Analysis XXI - 21st International Symposium on Intelligent Data Analysis, IDA 2023, Proceedings
EditorsBruno Crémilleux, Sibylle Hess, Siegfried Nijssen
Place of PublicationCham, Switzerland
PublisherSpringer
Pages472-483
Number of pages12
ISBN (Print)9783031300462
DOIs
Publication statusPublished - 1 Apr 2023
Event21st International Symposium on Intelligent Data Analysis, IDA 2022 - Louvain-la-Neuve, Belgium
Duration: 12 Apr 202314 Apr 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13876 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference21st International Symposium on Intelligent Data Analysis, IDA 2022
Country/TerritoryBelgium
CityLouvain-la-Neuve
Period12/04/2314/04/23

Keywords

  • Deep Learning
  • Multi-task Learning
  • Text Processing

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'QBERT: Generalist Model for Processing Questions'. Together they form a unique fingerprint.

Cite this