Test Adaptive Reasoning with Failure Injection

testingChallengeNovember 11, 2025

Prompt Content

Test the agent's 'extended thinking' capability by introducing a simulated failure during tool execution (e.g., a tool returns an error for invalid parameters). Observe how the agent adapts its plan or seeks clarification using Gemini 2.5 Pro. Document the agent's decision-making process in response to the failure.

Usage Tips

Copy the prompt and paste it into your preferred AI tool (Claude, ChatGPT, Gemini)

Customize placeholder values with your specific requirements and context

For best results, provide clear examples and test different variations