AMBER : A modified BLEU, enhanced ranking metric

  1. (PDF, 298 KB)
AuthorSearch for: ; Search for:
Proceedings titleProceedings of the 6th Workshop on Statistical Machine Translation
ConferenceWMT 2011: Sixth Workshop on Statistical Machine translation, July 30-31, 2011, Edinburgh, Scotland
Pages7177; # of pages: 7
AbstractThis paper proposes a new automatic machine translation evaluation metric: AMBER, which is based on the metric BLEU but incorporates recall, extra penalties, and some text processing variants. There is very little linguistic information in AMBER. We evaluate its system-level correlation and sentence-level consistency scores with human rankings from the WMT shared evaluation task; AMBER achieves state-of-the-art performance.
Publication date
AffiliationNRC Institute for Information Technology; National Research Council Canada
Peer reviewedNo
NPARC number18533380
Export citationExport as RIS
Report a correctionReport a correction
Record identifier493f767b-84cf-4ecb-b6f9-2f779e258000
Record created2011-09-03
Record modified2016-05-09
Bookmark and share
  • Share this page with Facebook (Opens in a new window)
  • Share this page with Twitter (Opens in a new window)
  • Share this page with Google+ (Opens in a new window)
  • Share this page with Delicious (Opens in a new window)