Agentic AI Library

Curated Open Source Library

Start Here Library Glossary About the CreatorRoadmap Provide Feedback

Glossary term

Loading…

Home/Glossary/MBPP

Evaluation and Benchmarks

MBPPMBPP

Abbreviation for Mostly Basic Python Problems.

Real-world uses

Created for this library

1.
An LLM evaluation team uses MBPP in its standard benchmark suite to measure basic Python programming ability per model release.
2.
A research lab reports MBPP scores in its model card so downstream users can compare basic coding ability across model versions.
3.
A model release team gates promotions on MBPP scores to avoid regressing on simple coding tasks important to enterprise users.

Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License

Back to glossary

Agentic AI LibraryOpen Source · Last Reviewed 2026-06-07

Library About the Creator Roadmap PrivacyProvide Feedback LinkedIn Author Portfolio

All Rights Reserved @2026 Georgi Naydenov