Simulate a Long-Running Refactoring Task

testingChallengeNovember 20, 2025

Prompt Content

Set up a simulated codebase (e.g., a small Python project with known refactoring needs) and define a complex, multi-stage refactoring goal (e.g., 'Refactor data access layer to use a repository pattern and improve error handling across the entire API'). Run your full MCP agent system through this task, allowing it to perform multiple iterations over several 'simulated hours' or 'days'. Record the full audit log and the final state of the codebase. Use RAGAS to evaluate the generated code documentation and explanations.

Usage Tips

Copy the prompt and paste it into your preferred AI tool (Claude, ChatGPT, Gemini)

Customize placeholder values with your specific requirements and context

For best results, provide clear examples and test different variations