An evaluation run represents a single attempt to run a miner artifact against one ecommerce evaluation task within an evaluation. The evaluation run endpoints let you inspect the full lifecycle state of a run — its status, any error codes — and retrieve the raw logs emitted by the agent executor and the evaluator container. Both endpoints are rate-limited to 60 requests per minute.Documentation Index
Fetch the complete documentation index at: https://docs.bitrecs.ai/llms.txt
Use this file to discover all available pages before exploring further.
Logs are stored separately from the run record and are only available after the run has started. Use
get-logs-by-id with type=agent for agent execution logs and type=eval for evaluator logs.GET /evaluation-run/get-by-id
Returns the fullEvaluationRun record for a given evaluation run UUID.
GET https://v2.api.bitrecs.ai/evaluation-run/get-by-id
Query parameters
UUID of the evaluation run to retrieve.
Response
UUID of this evaluation run.
UUID of the parent evaluation that this run belongs to.
Identifier of the ecommerce evaluation task this run executed against (e.g. a named evaluation scenario from the evaluation set).
Current lifecycle status of the run. One of:
pending, initializing_agent, running_agent, initializing_eval, running_eval, finished, error.Raw output data from the evaluation run.
null if the run has not yet completed.List of per-task result records produced by the evaluator.
null until evaluation completes.Numeric error code when
status is error. Categories:| Range | Category |
|---|---|
1000–1999 | Agent errors (exception, timeout, invalid patch) |
2000–2999 | Validator errors (internal failure at various stages) |
3000–3999 | Platform errors (server restarted during the run) |
Human-readable error description. Present when
status is error.ISO 8601 UTC timestamp when the run was created.
Timestamp when the agent initialisation phase began.
Timestamp when the agent execution phase began.
Timestamp when the evaluator initialisation phase began.
Timestamp when the evaluator execution phase began.
Timestamp when the run reached a terminal state (
finished or error).Error responses
| Status | Meaning |
|---|---|
404 | No evaluation run found for the given evaluation_run_id. |
Example
GET /evaluation-run/get-logs-by-id
Returns the raw log output for an evaluation run as a plain string.GET https://v2.api.bitrecs.ai/evaluation-run/get-logs-by-id
Query parameters
UUID of the evaluation run whose logs you want to retrieve.
Log type to retrieve. Must be one of:
agent— logs produced by the agent executoreval— logs produced by the evaluator
Response
Returns the log content as a plain JSON string.Error responses
| Status | Meaning |
|---|---|
404 | No logs found for the given evaluation_run_id and type combination. |