Dataset examples
examples/<name>/input.json layout with
file references such as { "$file": "input/contract.pdf" }. Legacy archives
with manifest.json, input/arguments.json, expected/output.json, or
expected/error.json are rejected; export a fresh ZIP before re-importing.
Experiments
batchId; API and SDK methods call it an
experimentId.
Scores vs feedback
Evaluatorscore values are automated results. Run feedback rating values are
human review verdicts (pass, fail, or partial). Use run feedback endpoints
when humans correct or promote a real run into the dataset.