Test and Evaluate the Full System

testingChallengeNovember 20, 2025

Prompt Content

Create a suite of at least 10 diverse test cases, including legitimate transactions (e.g., DEX swap, liquidity add) and potentially risky ones (e.g., suspicious approvals, flash loans). For each test case, provide the expected 'predicted_intent' and 'identified_risks'. Run your full multi-agent system on these cases and compare the output against your expected results. Document any discrepancies and analyze the reasoning paths of your agents. Configure Prefect to monitor these test runs.

Usage Tips

Copy the prompt and paste it into your preferred AI tool (Claude, ChatGPT, Gemini)

Customize placeholder values with your specific requirements and context

For best results, provide clear examples and test different variations