BLEU

The Bilingual Evaluation Understudy (BLEU) is a popular metric used to assess machine translation systems. It is based on the geometric mean of precision scores computed from n-grams (generally for n = 1, 2, 3, 4) of the reference(s) and predicted translation, with a penalty for shorter sentences.
Related concepts:
N-GramWord Error RatePerplexity
External reference:
https://aclanthology.org/P02-1040.pdf