curl --request GET \
--url https://api.openai.com/v1/evals/{eval_id}/runs \
--header 'Authorization: Bearer <token>'
{
"object": "list",
"data": [
{
"object": "eval.run",
"id": "<string>",
"eval_id": "<string>",
"status": "<string>",
"model": "<string>",
"name": "<string>",
"created_at": 123,
"report_url": "<string>",
"result_counts": {
"total": 123,
"errored": 123,
"failed": 123,
"passed": 123
},
"per_model_usage": [
{
"model_name": "<string>",
"invocation_count": 123,
"prompt_tokens": 123,
"completion_tokens": 123,
"total_tokens": 123,
"cached_tokens": 123
}
],
"per_testing_criteria_results": [
{
"testing_criteria": "<string>",
"passed": 123,
"failed": 123
}
],
"data_source": {
"type": "jsonl",
"source": {
"type": "file_content",
"content": [
{
"item": {},
"sample": {}
}
]
}
},
"metadata": {},
"error": {
"code": "<string>",
"message": "<string>"
}
}
],
"first_id": "<string>",
"last_id": "<string>",
"has_more": true
}
Get a list of runs for an evaluation.
curl --request GET \
--url https://api.openai.com/v1/evals/{eval_id}/runs \
--header 'Authorization: Bearer <token>'
{
"object": "list",
"data": [
{
"object": "eval.run",
"id": "<string>",
"eval_id": "<string>",
"status": "<string>",
"model": "<string>",
"name": "<string>",
"created_at": 123,
"report_url": "<string>",
"result_counts": {
"total": 123,
"errored": 123,
"failed": 123,
"passed": 123
},
"per_model_usage": [
{
"model_name": "<string>",
"invocation_count": 123,
"prompt_tokens": 123,
"completion_tokens": 123,
"total_tokens": 123,
"cached_tokens": 123
}
],
"per_testing_criteria_results": [
{
"testing_criteria": "<string>",
"passed": 123,
"failed": 123
}
],
"data_source": {
"type": "jsonl",
"source": {
"type": "file_content",
"content": [
{
"item": {},
"sample": {}
}
]
}
},
"metadata": {},
"error": {
"code": "<string>",
"message": "<string>"
}
}
],
"first_id": "<string>",
"last_id": "<string>",
"has_more": true
}
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
The ID of the evaluation to retrieve runs for.
Identifier for the last run from the previous pagination request.
Number of runs to retrieve.
Sort order for runs by timestamp. Use asc
for ascending order or desc
for descending order. Defaults to asc
.
asc
, desc
Filter runs by status. Use "queued" | "in_progress" | "failed" | "completed" | "canceled".
queued
, in_progress
, completed
, canceled
, failed
A list of runs for the evaluation
An object representing a list of runs for an evaluation.
Was this page helpful?