Glossary term
Glossary term
Agentic Systems
Abbreviation for one right answer.
Created for this library
A factual QA team labels its evaluation set as ORA so the grader checks for the exact correct answer.
An LLM evaluation team treats arithmetic prompts as ORA and grades against the exact final value.
A code-generation team uses ORA-style evaluation on coding benchmarks where correctness is whether tests pass.
Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License