Setup for Evaluations
Create datasets for prompt evaluations
To create datasets for evaluations, you need to correctly set up columns that match your prompt’s variables. There are two methods to make datasets ready for evaluations.
Manual Setup
Start with an empty dataset and create columns manually:
Create an empty dataset
Click the ‘add’ button (+) on the project you want to create a dataset in. Once created, this dataset is available to all resources in the project.
Create and rename columns
-
Click “Add column” to create a new column. Double-click the column name to rename it. Your dataset must have at least the columns exactly matching the variable names in your prompt to run evaluations.
-
Column names must match your prompt variables precisely. If your prompt uses
{{customer_name}}
, create a column named “customer_name”.
Create rows
Create rows for each of the test cases you want to run evaluations on.
Link dataset to evaluator
Navigate to the evaluator you want to link this dataset to. Click on ‘Select dataset’ and choose your newly created dataset. If you don’t see the dataset, double-check that the column names match exactly as variables in your prompt.
Edit anywhere
Once linked, you can edit the dataset from either the variable editor or the dataset page.
Automated Setup
Choose your evaluator
Choose the evaluator you want to link this dataset with.
Create a new dataset
Click “Create new dataset” from the dataset selection dropdown.
Auto-configured columns
A new dataset is created with matching column names as per the variables in your playground.
Create rows
Create rows for each of your test cases you want to run evaluations on.