Searching parallel corpora for contextually equivalent terms

AuthorSearch for: ; Search for:
Proceedings titleProceedings of the 15th Conference of the European Association for Machine Translation
Conference15th Annual Conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium
Pages105112; # of pages: 8
AbstractIn this paper, we show how a large bilingual English-French parallel corpus can be brought to bear in terminology search. First, we demonstrate that the coverage of available corpora has become substantially more extensive than that of mainstream term banks. One potential drawback in searching large unstructured corpora is that large numbers of search results may need to be examined before finding a relevant match. We argue that this problem can be alleviated by contextualizing the search process: instead of looking up isolated terms one searches for terms appearing in a context that is similar to that of the term to be translated. We present an experiment on contextbased re-ranking and report highly positive results. We conclude that translators will increasingly rely on very large scale corpora for searching term equivalents.
Publication date
PublisherEuropean Association for Machine Translation
AffiliationNRC Institute for Information Technology; National Research Council Canada
Peer reviewedYes
NPARC number21268095
Export citationExport as RIS
Report a correctionReport a correction
Record identifiercb344a25-d0c4-4d78-9d46-d02956480d42
Record created2013-04-09
Record modified2016-05-09
Bookmark and share
  • Share this page with Facebook (Opens in a new window)
  • Share this page with Twitter (Opens in a new window)
  • Share this page with Google+ (Opens in a new window)
  • Share this page with Delicious (Opens in a new window)