InferenceComponentAvailabilityZoneBalance
Configuration for balancing inference component copies across Availability Zones.
Types
Properties
Link copied to clipboard
Determines how strictly the Availability Zone balance constraint is enforced.
Link copied to clipboard
The maximum allowed difference in the number of inference component copies between any two Availability Zones. This parameter applies only when the endpoint has instances across two or more Availability Zones. A copy placement is allowed if it reduces imbalance or the resulting imbalance is within this value.
Functions
Link copied to clipboard
inline fun copy(block: InferenceComponentAvailabilityZoneBalance.Builder.() -> Unit = {}): InferenceComponentAvailabilityZoneBalance