Alignment-based extraction of multiword expressions

H D Caseli, C Ramisch, M D V Nunes, A Villavicencio

Research output: Contribution to journalArticlepeer-review

21 Citations (Scopus)

Abstract

Due to idiosyncrasies in their syntax, semantics or frequency, Multiword Expressions (MWEs) have received special attention from the NLP community, as the methods and techniques developed for the treatment of simplex words are not necessarily suitable for them. This is certainly the case for the automatic acquisition of MWEs from corpora. A lot of effort has been directed to the task of automatically identifying them, with considerable success. In this paper, we propose an approach for the identification of MWEs in a multilingual context, as a by-product of a word alignment process, that not only deals with the identification of possible MWE candidates, but also associates some multiword expressions with semantics. The results obtained indicate the feasibility and low costs in terms of tools and resources demanded by this approach, which could, for example, facilitate and speed up lexicographic work.
Original languageEnglish
Pages (from-to)59-77
Number of pages19
JournalLanguage Resources and Evaluation
Volume44
Issue number1-2
DOIs
Publication statusPublished - Apr 2010

Keywords

  • terminology
  • statistical methods
  • automatic identification
  • lexical acquisition
  • machine translation
  • multiword expressions
  • word alignment

Fingerprint Dive into the research topics of 'Alignment-based extraction of multiword expressions'. Together they form a unique fingerprint.

Cite this