Agent-EvalKit is an open-source toolkit for systematic AI agent evaluation. It integrates with AI coding assistants like Claude Code and frameworks like Amazon Bedrock, offering six distinct evaluation phases to streamline the assessment of agent performance.
Opening Kapyn…