Inter-Rater Agreement

A measurement of how often human raters agree when doing a task. If raters disagree, the task instructions may need to be improved. Also sometimes called inter-annotator agreement or inter-rater reliability. See also Cohen's kappa, which is one of the most popular inter-rater agreement measurements.

See Categorical data: Common issues in Machine Learning Crash Course for more information.

Real-world uses

Created for this library

1.
A search-quality team monitors inter-rater agreement on relevance judgments so label noise stays under control.
2.
A medical labeling team monitors inter-rater agreement among radiologists to confirm annotation guidelines are interpreted consistently.
3.
A research lab uses inter-rater agreement as a quality gate before accepting a new annotation batch into training data.

Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License

Back to glossary

Real-world uses

Loading…

Real-world uses