Get eval runs

curl --request GET \
  --url https://api.openai.com/v1/evals/{eval_id}/runs \
  --header 'Authorization: Bearer <token>'

{
  "object": "list",
  "data": [
    {
      "object": "eval.run",
      "id": "<string>",
      "eval_id": "<string>",
      "status": "<string>",
      "model": "<string>",
      "name": "<string>",
      "created_at": 123,
      "report_url": "<string>",
      "result_counts": {
        "total": 123,
        "errored": 123,
        "failed": 123,
        "passed": 123
      },
      "per_model_usage": [
        {
          "model_name": "<string>",
          "invocation_count": 123,
          "prompt_tokens": 123,
          "completion_tokens": 123,
          "total_tokens": 123,
          "cached_tokens": 123
        }
      ],
      "per_testing_criteria_results": [
        {
          "testing_criteria": "<string>",
          "passed": 123,
          "failed": 123
        }
      ],
      "data_source": {
        "type": "jsonl",
        "source": {
          "type": "file_content",
          "content": [
            {
              "item": {},
              "sample": {}
            }
          ]
        }
      },
      "metadata": {},
      "error": {
        "code": "<string>",
        "message": "<string>"
      }
    }
  ],
  "first_id": "<string>",
  "last_id": "<string>",
  "has_more": true
}

GET

evals

{eval_id}

runs

Get eval runs

curl --request GET \
  --url https://api.openai.com/v1/evals/{eval_id}/runs \
  --header 'Authorization: Bearer <token>'

{
  "object": "list",
  "data": [
    {
      "object": "eval.run",
      "id": "<string>",
      "eval_id": "<string>",
      "status": "<string>",
      "model": "<string>",
      "name": "<string>",
      "created_at": 123,
      "report_url": "<string>",
      "result_counts": {
        "total": 123,
        "errored": 123,
        "failed": 123,
        "passed": 123
      },
      "per_model_usage": [
        {
          "model_name": "<string>",
          "invocation_count": 123,
          "prompt_tokens": 123,
          "completion_tokens": 123,
          "total_tokens": 123,
          "cached_tokens": 123
        }
      ],
      "per_testing_criteria_results": [
        {
          "testing_criteria": "<string>",
          "passed": 123,
          "failed": 123
        }
      ],
      "data_source": {
        "type": "jsonl",
        "source": {
          "type": "file_content",
          "content": [
            {
              "item": {},
              "sample": {}
            }
          ]
        }
      },
      "metadata": {},
      "error": {
        "code": "<string>",
        "message": "<string>"
      }
    }
  ],
  "first_id": "<string>",
  "last_id": "<string>",
  "has_more": true
}

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Path Parameters

eval_id

string

required

The ID of the evaluation to retrieve runs for.

Query Parameters

after

string

Identifier for the last run from the previous pagination request.

limit

integer

default:20

Number of runs to retrieve.

order

enum<string>

default:asc

Sort order for runs by timestamp. Use asc for ascending order or desc for descending order. Defaults to asc.

Available options:

asc,

desc

status

enum<string>

Filter runs by status. Use "queued" | "in_progress" | "failed" | "completed" | "canceled".

Available options:

queued,

in_progress,

completed,

canceled,

failed

Response

200 - application/json

A list of runs for the evaluation

An object representing a list of runs for an evaluation.

Delete an eval

Create eval run

API

Assistants

Audio

Batch

Chat

Completions

Embeddings

Evals

Files

Fine-tuning

Images

Models

Moderations

API Reference

Audit Logs

Certificates

Usage

Invites

Projects

Users

Realtime

Responses

Uploads

Vector stores

Get eval runs

Authorizations

Path Parameters

Query Parameters

Response