Testing

Testing is how we make sure AI works as expected before it goes live. It includes checking accuracy, behavior, and edge cases, so there are no surprises when customers or teams start using it.

Examples

1.
LangSmith Evaluations, Promptfoo, and Patronus AI are widely used LLM testing platforms.
2.
DeepEval is an open-source pytest-style framework for testing LLM applications.
3.
Giskard and Robust Intelligence test models for bias, robustness, and adversarial attacks.

Related terms

Back to glossary

Examples

Related terms

Loading…

Examples

Related terms