Glossary term
Glossary term
Agentic Systems
Primarily used as an abbreviation for LLM evaluations. More broadly, evals is an abbreviation for any form of evaluation.
Created for this library
An LLM product team runs a suite of evals nightly so any prompt or model change must show measurable quality improvement before launch.
An evaluation team curates domain-specific evals for its legal-tech product so model selection is grounded in client-relevant tasks.
A search team runs evals on a frozen set of human-rated queries every time the ranker is retrained, blocking releases that regress on key segments.
Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License