The correct interpretation of Multiword Units (MWUs) is crucial to many applications in Natural Language Processing but is a challenging and complex task. In recent years, the computational treatment of MWUs has received considerable attention but there is much more to be done before we can claim that NLP and Machine Translation (MT) systems process MWUs successfully. This volume provides a general overview of the field with particular reference to Machine Translation and Translation Technology and focuses on languages such as English, Basque, French Romanian, German, Dutch and Croatian among others. The chapters of the volume illustrate a variety of topics that address this challenge, such as the use of rule-based approaches, compound splitting techniques, MWU identification methodologies in multilingual applications, and MWU alignment issues.

Multi-word Units in Machine Translation and Translation Technology

Johanna Monti
Writing – Review & Editing
;
2018-01-01

Abstract

The correct interpretation of Multiword Units (MWUs) is crucial to many applications in Natural Language Processing but is a challenging and complex task. In recent years, the computational treatment of MWUs has received considerable attention but there is much more to be done before we can claim that NLP and Machine Translation (MT) systems process MWUs successfully. This volume provides a general overview of the field with particular reference to Machine Translation and Translation Technology and focuses on languages such as English, Basque, French Romanian, German, Dutch and Croatian among others. The chapters of the volume illustrate a variety of topics that address this challenge, such as the use of rule-based approaches, compound splitting techniques, MWU identification methodologies in multilingual applications, and MWU alignment issues.
2018
Ruslan Mitkov, Johanna Monti, Gloria Corpas Pastor, Violeta Seretan
Ruslan Mitkov, Johanna Monti, Gloria Corpas Pastor, Violeta Seretan, Uxoa Iñurrieta, Itziar Aduriz, Arantza Díaz de Ilarraza, Gorka Labaka and Kepa Sarasola, Joke Daems, Michael Carl, Sonia Vandepitte, Robert J. Hartsuiker and Lieve Macken, Amalia Todiraşcu and Mirabela Navlea, Gregor Thurmair, Simon Clematide, Stéphanie Lehner, Johannes Graën and Martin Volk, Lieve Macken and Arda Tezcan, Oscar Mendoza Rivera, Lukasz Grabowski, Kristina Kocijan and Sara Librenjak, Goranka Blagus Bartolec and Ivana Matas Ivanković,Eric Wehrli and Luka Nerima
Inglese
256 + index
9789027200600
https://benjamins.com/#catalog/books/cilt.341/main
John Benjamins
Amsterdam
PAESI BASSI
multiword units, natural language processing, multiword expressions, translation technology, machine translation
Comitato scientifico
4
7 Curatele::7.1 Curatela
284
info:eu-repo/semantics/other
restricted
Mitkov, Ruslan; Monti, Johanna; Corpas Pastor, Gloria; Seretan, Violeta
File in questo prodotto:
File Dimensione Formato  
cilt.341.cover.hb.pdf

solo utenti autorizzati

Descrizione: copertina
Tipologia: Documento in Post-print
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 44.53 kB
Formato Adobe PDF
44.53 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
cilt.341.2e.kader.pdf

solo utenti autorizzati

Descrizione: interno del libro
Tipologia: Documento in Post-print
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 3.89 MB
Formato Adobe PDF
3.89 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11574/181297
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact