Glossary term
Glossary term
Agentic Systems
A response known to be good. For example, given the following prompt:
2 + 2
The golden response is hopefully:
4
Note: Some organizations define additional terms such as silver response and platinum response for responses of lower or higher quality, respectively, than the golden response. For example, an organization might use platinum response to indicate a golden response generated by an expert and then further vetted by other experts.
Click here for notes about golden response and reference text.
Created for this library
An LLM evaluation team maintains a library of golden responses per prompt so automatic graders can score new model outputs against the expected answer.
A customer support team writes a golden response for each common ticket type so the model fine-tuning loop has a clear target.
A research lab maintains golden responses for its benchmark tasks so evaluation results stay comparable across model versions.
Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License