1. Purpose of System

This system is to autonomously review and enhance Python code using a multi-agent architecture. The agents collaborate in a loop to generate, critique, test, and refine code until it meets a predefined quality threshold. The goal is to simulate a collaborative programming environment that continuously improves the code through structured feedback and testing.

2. System Architecture

graph.png

The system is composed of four core agents:

These agents are orchestrated in a LangGraph workflow, which continues iterating until both the critic score and test pass rate surpasses a threshold, or a maximum number of iteration is reached.

3. Used Tools