Adjudicating LLMs as PropBank Annotators

Julia Bonn, Harish Tayyar Madabushi, Jena D. Hwang, Claire Bonial

Research output: Chapter or section in a book/report/conference proceedingChapter in a published conference proceeding

2 Citations (SciVal)
62 Downloads (Pure)

Abstract

We evaluate the ability of large language models (LLMs) to provide PropBank semantic role label annotations across different realizations of the same verbs in transitive, intransitive, and middle voice constructions. In order to assess the meta-linguistic capabilities of LLMs as well as their ability to glean such capabilities through in-context learning, we evaluate the models in a zero-shot setting, in a setting where it is given three examples of another verb used in transitive, intransitive, and middle voice constructions, and finally in a setting where it is given the examples as well as the correct sense and roleset information. We find that zero-shot knowledge of PropBank annotation is almost nonexistent. The largest model evaluated, GPT-4, achieves the best performance in the setting where it is given both examples and the correct roleset in the prompt, demonstrating that larger models can ascertain some meta-linguistic capabilities through in-context learning. However, even in this setting, which is simpler than the task of a human in PropBank annotation, the model achieves only 48% accuracy in marking numbered arguments correctly. To ensure transparency and reproducibility, we publicly release our dataset and model responses.

Original languageEnglish
Title of host publication5th International Workshop on Designing Meaning Representation, DMR 2024 at LREC-COLING 2024 - Workshop Proceedings
EditorsClaire Bonial, Julia Bonn, Jena D. Hwang
PublisherEuropean Language Resources Association (ELRA)
Pages112-123
Number of pages12
ISBN (Electronic)9782493814395
Publication statusPublished - 21 May 2024
Event5th International Workshop on Designing Meaning Representation, DMR 2024 - Torino, Italy
Duration: 21 May 2024 → …

Publication series

Name5th International Workshop on Designing Meaning Representation, DMR 2024 at LREC-COLING 2024 - Workshop Proceedings

Conference

Conference5th International Workshop on Designing Meaning Representation, DMR 2024
Country/TerritoryItaly
CityTorino
Period21/05/24 → …

Keywords

  • LLM Evaluation
  • PropBank
  • Semantic Role Labeling

ASJC Scopus subject areas

  • Library and Information Sciences
  • Linguistics and Language
  • Language and Linguistics
  • Education

Fingerprint

Dive into the research topics of 'Adjudicating LLMs as PropBank Annotators'. Together they form a unique fingerprint.

Cite this