This system is to autonomously review and enhance Python code using a multi-agent architecture. The agents collaborate in a loop to generate, critique, test, and refine code until it meets a predefined quality threshold. The goal is to simulate a collaborative programming environment that continuously improves the code through structured feedback and testing.
The system is composed of four core agents:
These agents are orchestrated in a LangGraph workflow, which continues iterating until both the critic score and test pass rate surpasses a threshold, or a maximum number of iteration is reached.