ROUGE (Recall-Oriented Understudy for Gisting Evaluation)

A family of metrics that evaluate automatic summarization and machine translation models. ROUGE metrics determine the degree to which a reference text overlaps an ML model's generated text. Each member of the ROUGE family measures overlap in a different way. Higher ROUGE scores indicate more similarity between the reference text and generated text than lower ROUGE scores.

Each ROUGE family member typically generates the following metrics:

Precision

Recall

Note: ROUGE uses precision and recall somewhat differently than traditional precision and recall.

For details and examples, see:

ROUGE-L

ROUGE-N

ROUGE-S

Note: BLEU and BLEURT optimize for precision while ROUGE optimizes for recall. Consequently, BLEU and BLEURT are better metrics for evaluating machine translation (since the focus is precision) while ROUGE is a better metric for summarization (since the focus is recall).

Real-world uses

Created for this library

1.
A summarization team reports ROUGE scores against editor-written reference summaries to compare model versions.
2.
A research team reports ROUGE in its preprint so other researchers can compare summarization performance.
3.
A news platform uses ROUGE as an offline metric to compare summarization models before paying for human review.

Back to glossary

Each ROUGE family member typically generates the following metrics:

Precision

Recall

Note: ROUGE uses precision and recall somewhat differently than traditional precision and recall.

For details and examples, see:

ROUGE-L

ROUGE-N

ROUGE-S

Real-world uses

Created for this library

1.
A summarization team reports ROUGE scores against editor-written reference summaries to compare model versions.
2.
A research team reports ROUGE in its preprint so other researchers can compare summarization performance.
3.
A news platform uses ROUGE as an offline metric to compare summarization models before paying for human review.

Back to glossary

ROUGE (Recall-Oriented Understudy for Gisting Evaluation)

Real-world uses

Related terms

Loading…

ROUGE (Recall-Oriented Understudy for Gisting Evaluation)

Real-world uses

Related terms