Grading your response...
This action cannot be undone.
Running feedback evaluations... This may take 30-60 seconds.
Generating test responses...
Select a test result to view details
Make changes to improve scenario effectiveness
Paste the scenario configuration JavaScript code from the old Playground:
Edit the 10 sample responses that will be used for feedback evaluation. These responses will be reused across multiple test runs.
This is the exact prompt that was sent to OpenAI for grading: