BLEU
The Bilingual Evaluation Understudy (BLEU) is a popular metric used to assess machine translation systems. It is based on the geometric mean of precision scores computed from n-grams (generally for n = 1, 2, 3, 4) of the reference(s) and predicted translation, with a penalty for shorter sentences.