Evaluation with Inspect AI
You can evaluate your customized Amazon Nova models using Inspect AI
Choose the evaluation approach that best fits your workflow:
-
Inspect AI SDK – Run evaluations interactively from a notebook or local environment against your SageMaker inference endpoint. Best for development, iteration, and quick testing.
-
Inspect AI container – Run evaluations at scale as SageMaker Training Jobs. Best for production evaluation pipelines, chaining multiple benchmarks, and automated workflows.
Recommended workflow: Start with the Inspect AI SDK to build and test your custom evaluation benchmarks using the AI assistant onboarding prompt