Glossary term
Glossary term
Infrastructure and Serving
Deployment is when an AI system goes live for real users, whether integrated into a chatbot, voice assistant, or internal tool, marking the point where performance and reliability truly matter.
AWS SageMaker, Azure ML, and Vertex AI provide enterprise deployment platforms for AI models.
Hugging Face Inference Endpoints lets teams deploy open-source models like Llama and Mistral.
Replicate, Modal, and Fireworks AI are popular serverless deployment platforms for generative AI.