BLEU
- Bilingual Evaluation Understudy
- MT quality metric
- Precision Based
- Precisionn=p∈hyp∑n−gram∈p∑Count(n−gram)p∈hyp∑n−gram∈p∑Countclip(n−gram)
- Countclip(n−gram)=min(matched n-gram count,maxr∈Ref(n-gram count in r))
- A weighted logarithmic average : Consider the exponential decay observed in the n-gram precision
∑n=1Nwn∏n=1Npnwn=∑n=1Nwn1exp(∑n=1Nwn∗lnpn)=exp(N1∗∑n=1Nlnpn)
Details
- Brevity Penalty
- BP={1e(1−cr)if c>rif c≤r
- BLEU=BP⋅exp(n=1∑Nwnlogpn)
- logBLEU=min(1−cr,0)+n=1∑Nwnlogpn
ROUGE
Recall-Oriented Understudy for Gisting Evaluation
ROUGE-N
An n-gram recall between a candidate summary and a set of reference summaries
Recall=TP+FNTP
OverView