Complex Word Identification as a Sequence Labelling Task

Sian Gooding, Ekaterina Kochmar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Complex Word Identification (CWI) is concerned with detection of words in need of simplification and is a crucial first step in a simplification pipeline. It has been shown that reliable CWI systems considerably improve text simplification. However, most CWI systems to date address the task on a word-by-word basis, not taking the context into account. In this paper, we present a novel approach to CWI based on sequence modelling. Our system is capable of performing CWI in context, does not require extensive feature engineering and outperforms state-of-the-art systems on this task.
Original languageEnglish
Title of host publicationProceedings of the 57th Annual Meeting of the Association for Computational Linguistics: ACL 2019
PublisherACLWEB.ORG
Pages1148–1153
Number of pages5
Publication statusPublished - 28 Jul 2019
EventProceedings of the 57th Annual Meeting of the Association for Computational Linguistics: ACL 2019 - Florence, Florence, Italy
Duration: 28 Jul 20192 Aug 2019
https://acl2019.org/EN/index.xhtml.html

Conference

ConferenceProceedings of the 57th Annual Meeting of the Association for Computational Linguistics
Abbreviated titleACL 2019
Country/TerritoryItaly
CityFlorence
Period28/07/192/08/19
Internet address

Cite this