When having enough evals, we should suggest prompts that might correct the mistakes the eval surfaced, allowing to re-run an eval and compare results to help improving it.
Please authenticate to join the conversation.
In Review
Feature requests and bug reports
About 1 month ago

Hervé Labas
Get notified by email when there are changes.
In Review
Feature requests and bug reports
About 1 month ago

Hervé Labas
Get notified by email when there are changes.