Typologically Diverse Question Answering (TyDi QA)

A large dataset for evaluating an LLM's proficiency in answering questions. The dataset contains question and answer pairs in many languages.

For details, see TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages.

Created for this library

1.
An LLM evaluation team uses TyDi QA to measure question answering across typologically diverse languages.
2.
A multilingual NLP team reports TyDi QA scores in its model card to communicate cross-language performance.
3.
A research lab uses TyDi QA in its multilingual benchmark suite to ensure language coverage in evaluation.

Loading…