All agents scored on 7 architectural dimensions. Scores are based on published benchmarks, official documentation, and community data. ยฑ1 point tolerance applies to all scores โ they reflect best-effort research, not hard metrics.
Click [src] to expand sources for each score.
Each dimension measures a specific aspect of agent architecture quality. Max score varies by dimension importance.
Task decomposition, parallel sub-agents, Coordinator mode, agent isolation. Higher impact on complex multi-file tasks. Sources: Requesty 2026, Dev.to 2026, Packmind matrix.
Cross-session persistence, context window size, memory types, retrieval quality, auto-consolidation.
Number and quality of tools, MCP support, lifecycle management, extensibility. Source: Packmind coding-agents-matrix, MCP Atlas leaderboard.
Token optimization, prompt cache strategy, BYOK flexibility, cost efficiency, cache-break tracking.
Permission chain depth, sandboxing, side-model classification, command vetting, attestation. Sources: Anthropic docs, OpenAI Codex sandbox docs.
Error handling, retry logic, timeout management, git-based recovery, graceful degradation.
GitHub stars, update frequency, documentation quality, plugin/MCP ecosystem, community responsiveness.