curl --request GET \
--url https://api.openai.com/v1/evals/{eval_id}/runs \
--header 'Authorization: Bearer <token>'{
"object": "list",
"data": [
{
"object": "eval.run",
"id": "<string>",
"eval_id": "<string>",
"status": "<string>",
"model": "<string>",
"name": "<string>",
"created_at": 123,
"report_url": "<string>",
"result_counts": {
"total": 123,
"errored": 123,
"failed": 123,
"passed": 123
},
"per_model_usage": [
{
"model_name": "<string>",
"invocation_count": 123,
"prompt_tokens": 123,
"completion_tokens": 123,
"total_tokens": 123,
"cached_tokens": 123
}
],
"per_testing_criteria_results": [
{
"testing_criteria": "<string>",
"passed": 123,
"failed": 123
}
],
"data_source": {
"type": "jsonl",
"source": {
"type": "file_content",
"content": [
{
"item": {},
"sample": {}
}
]
}
},
"metadata": {},
"error": {
"code": "<string>",
"message": "<string>"
}
}
],
"first_id": "<string>",
"last_id": "<string>",
"has_more": true
}Get a list of runs for an evaluation.
curl --request GET \
--url https://api.openai.com/v1/evals/{eval_id}/runs \
--header 'Authorization: Bearer <token>'{
"object": "list",
"data": [
{
"object": "eval.run",
"id": "<string>",
"eval_id": "<string>",
"status": "<string>",
"model": "<string>",
"name": "<string>",
"created_at": 123,
"report_url": "<string>",
"result_counts": {
"total": 123,
"errored": 123,
"failed": 123,
"passed": 123
},
"per_model_usage": [
{
"model_name": "<string>",
"invocation_count": 123,
"prompt_tokens": 123,
"completion_tokens": 123,
"total_tokens": 123,
"cached_tokens": 123
}
],
"per_testing_criteria_results": [
{
"testing_criteria": "<string>",
"passed": 123,
"failed": 123
}
],
"data_source": {
"type": "jsonl",
"source": {
"type": "file_content",
"content": [
{
"item": {},
"sample": {}
}
]
}
},
"metadata": {},
"error": {
"code": "<string>",
"message": "<string>"
}
}
],
"first_id": "<string>",
"last_id": "<string>",
"has_more": true
}Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
The ID of the evaluation to retrieve runs for.
Identifier for the last run from the previous pagination request.
Number of runs to retrieve.
Sort order for runs by timestamp. Use asc for ascending order or desc for descending order. Defaults to asc.
asc, desc Filter runs by status. Use "queued" | "in_progress" | "failed" | "completed" | "canceled".
queued, in_progress, completed, canceled, failed A list of runs for the evaluation
An object representing a list of runs for an evaluation.
The type of this object. It is always set to "list".
list An array of eval run objects.
Show child attributes
The identifier of the first eval run in the data array.
The identifier of the last eval run in the data array.
Indicates whether there are more evals available.
Was this page helpful?