- TypeScript
- Python
evaluate(options)
Run an evaluation against a dataset.| Name | Type | Default | Description |
|---|---|---|---|
data | EvaluationDataset | Datapoint[] | — | Dataset or array |
executor | (data, ...args) => any | — | Function to evaluate |
evaluators | Record<string, Function or HumanEvaluator> | — | Scoring functions |
name | string | — | Evaluation name |
groupName | string | 'default' | Group name |
metadata | Record<string, any> | — | Evaluation metadata |
config.concurrencyLimit | number | 5 | Parallel executions (min 1) |
config.projectApiKey | string | env | API key |
config.traceExportBatchSize | number | 64 | Batch size |
Promise<EvaluationRunResult | undefined>If invoked in “prepare only” mode, returns undefined.Return shape:LaminarDataset
Load a dataset from Laminar.| Name | Type | Default | Description |
|---|---|---|---|
name | string | — | Dataset name (or use id) |
id | string | — | Dataset ID (or use name) |
fetchSize | number | 25 | Datapoints per fetch |
size()— Returns number of datapointsget(index)— Get datapoint by indexslice(start, end)— Get range of datapointspush(paths, recursive?)— Upload local files
name or id.EvaluationDataset
Abstract base class for custom datasets.size(): Promise<number> | numberget(index: number): Promise<Datapoint> | Datapoint
slice(start, end)— Helper usingget()
HumanEvaluator
Placeholder for human evaluation.HUMAN_EVALUATOR and null scores for later human annotation.