Evaluate Recommendation Quality and Efficiency

evaluationChallengeOctober 9, 2025

Prompt Content

Run the full AutoGen system with a set of simulated products and market scenarios. Evaluate the accuracy of the pricing recommendations against a 'gold standard' for optimal pricing. Measure the response latency and observe the distribution of 'instant' versus 'deep' reasoning modes, correlating it with the complexity of the market changes. Document how adaptive thinking budgets were used to optimize token consumption and speed.

Usage Tips

Copy the prompt and paste it into your preferred AI tool

Customize the prompt by replacing placeholder values with your specific requirements

For best results, provide clear context and examples when using this prompt