ROUGE-L

A member of the ROUGE family focused on the length of the longest common subsequence in the reference text and generated text. The following formulas calculate recall and precision for ROUGE-L:

You can then use F1 to roll up ROUGE-L recall and ROUGE-L precision into a single metric:

Click the icon for an example calculation of ROUGE-L.

ROUGE-L ignores any newlines in the reference text and generated text, so the longest common subsequence could cross multiple sentences. When the reference text and generated text involve multiple sentences, a variation of ROUGE-L called ROUGE-Lsum is generally a better metric. ROUGE-Lsum determines the longest common subsequence for each sentence in a passage and then calculates the mean of those longest common subsequences.

Click the icon for an example calculation of ROUGE-Lsum.

Real-world uses

Created for this library

1.
A summarization team reports ROUGE-L alongside ROUGE-1 and ROUGE-2 to capture longest common subsequence between generated and reference summaries.
2.
A research team uses ROUGE-L on long-form generation to capture sequence-level similarity that unigram metrics miss.
3.
A news platform reports ROUGE-L weekly to detect drift in its summarization model's structural fidelity.

Back to glossary

A member of the ROUGE family focused on the length of the longest common subsequence in the reference text and generated text. The following formulas calculate recall and precision for ROUGE-L:

You can then use F1 to roll up ROUGE-L recall and ROUGE-L precision into a single metric:

Click the icon for an example calculation of ROUGE-L.

Click the icon for an example calculation of ROUGE-Lsum.

Real-world uses

Created for this library

1.
A summarization team reports ROUGE-L alongside ROUGE-1 and ROUGE-2 to capture longest common subsequence between generated and reference summaries.
2.
A research team uses ROUGE-L on long-form generation to capture sequence-level similarity that unigram metrics miss.
3.
A news platform reports ROUGE-L weekly to detect drift in its summarization model's structural fidelity.

Back to glossary

ROUGE-L

Real-world uses

Related terms

Loading…

ROUGE-L

Real-world uses

Related terms