AI Glossary/Inference
AI Fundamentals

Inference

The process of using a trained model to make predictions on new, unseen data.

In-depth explanation

Once a model is trained, inference is the deployment phase where the model processes new inputs and produces outputs. Inference needs to be fast and efficient, especially in production systems. Techniques like model optimization, quantization, and specialized hardware (GPUs, TPUs) help speed up inference.

Examples

Getting a translation from Google Translate
Face detection in photos

Related terms

Master Inference.

Learn how to apply this concept with hands-on projects in our comprehensive AI programs.