Lexical complexity is a highly subjective notion, yet this factor is often neglected in lexical simplification and readability systems which use a "one-size-fits-all" approach. In this paper, we investigate which aspects contribute to the notion of lexical complexity in various groups of readers, focusing on native and non-native speakers of English, and how the notion of complexity changes depending on the proficiency level of a non-native reader. To facilitate reproducibility of our approach and foster further research into these aspects, we release a dataset of complex words annotated by readers with different backgrounds.
|Publication status||Acceptance date - 10 Mar 2021|
|Event||2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics - Virtual|
Duration: 6 Jun 2021 → 11 Jun 2021
|Conference||2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics|
|Abbreviated title||NAACL-HLT 2021|
|Period||6/06/21 → 11/06/21|