EvaluationReferenceInput
class EvaluationReferenceInput
A reference input containing ground truth data for evaluation, scoped to a specific context level (session or trace) through its span context.
Types
Properties
Link copied to clipboard
A list of assertion statements for session-level evaluation. Each assertion describes an expected behavior or outcome the agent should demonstrate during the session.
Link copied to clipboard
The expected response for trace-level evaluation. Built-in evaluators that support this field compare the agent's actual response against this value for assessment. Custom evaluators can access it through the {expected_response} placeholder in their instructions.
Link copied to clipboard
The expected tool call sequence for session-level trajectory evaluation. Contains a list of tool names representing the tools the agent is expected to invoke.