GET
/
evals
/
{eval_id}
/
runs
curl --request GET \
  --url https://api.openai.com/v1/evals/{eval_id}/runs \
  --header 'Authorization: Bearer <token>'
{
  "object": "list",
  "data": [
    {
      "object": "eval.run",
      "id": "<string>",
      "eval_id": "<string>",
      "status": "<string>",
      "model": "<string>",
      "name": "<string>",
      "created_at": 123,
      "report_url": "<string>",
      "result_counts": {
        "total": 123,
        "errored": 123,
        "failed": 123,
        "passed": 123
      },
      "per_model_usage": [
        {
          "model_name": "<string>",
          "invocation_count": 123,
          "prompt_tokens": 123,
          "completion_tokens": 123,
          "total_tokens": 123,
          "cached_tokens": 123
        }
      ],
      "per_testing_criteria_results": [
        {
          "testing_criteria": "<string>",
          "passed": 123,
          "failed": 123
        }
      ],
      "data_source": {
        "type": "jsonl",
        "source": {
          "type": "file_content",
          "content": [
            {
              "item": {},
              "sample": {}
            }
          ]
        }
      },
      "metadata": {},
      "error": {
        "code": "<string>",
        "message": "<string>"
      }
    }
  ],
  "first_id": "<string>",
  "last_id": "<string>",
  "has_more": true
}

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Path Parameters

eval_id
string
required

The ID of the evaluation to retrieve runs for.

Query Parameters

after
string

Identifier for the last run from the previous pagination request.

limit
integer
default:20

Number of runs to retrieve.

order
enum<string>
default:asc

Sort order for runs by timestamp. Use asc for ascending order or desc for descending order. Defaults to asc.

Available options:
asc,
desc
status
enum<string>

Filter runs by status. Use "queued" | "in_progress" | "failed" | "completed" | "canceled".

Available options:
queued,
in_progress,
completed,
canceled,
failed

Response

200 - application/json

A list of runs for the evaluation

An object representing a list of runs for an evaluation.