Glossary term
Glossary term
Evaluation and Benchmarks
Abbreviation for Words in Context.
Created for this library
An LLM evaluation team uses WiC to test word-sense disambiguation across model versions.
A research lab reports WiC scores in its model card so downstream users can compare lexical reasoning across versions.
A model release team includes WiC in its standard benchmark suite to detect regressions in word-sense disambiguation.
Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License