Co-Pilot

Updated 4 months ago

agentic-qe

Name: agentic-qe
Rating: 4.3 (159 reviews)
Author: proffesor-for-testing

Pproffesor-for-testing

0.2k

proffesor-for-testing/agentic-qe

Agent Score

💡 Summary

Agentic Quality Engineering Fleet automates quality engineering tasks using AI agents for efficient testing and analysis.

🎯 Target Audience

Software DevelopersQuality Assurance EngineersDevOps ProfessionalsProject ManagersTechnical Leads

🤖 AI Roast: “Powerful, but the setup might scare off the impatient.”

Security AnalysisMedium Risk

Risk: Medium. Review: shell/CLI command execution; outbound network access (SSRF, data egress); API keys/tokens handling and storage; dependency pinning and supply-chain risk. Run with least privilege and audit before enabling in production.

Agentic Quality Engineering Fleet

V3 brings Domain-Driven Design architecture, 12 bounded contexts, 51 specialized QE agents, TinyDancer intelligent model routing, ReasoningBank learning with Dream cycles, HNSW vector search, mathematical Coherence verification, full MinCut/Consensus integration across all 12 domains, and deep integration with Claude Flow and Agentic Flow.

What's New in v3.4.2

Skill Validation System - 4-layer trust tiers with schemas, validators, and evaluation suites (ADR-056)
63 QE Skills with Trust Tiers - 46 Tier 3 (Verified), 7 Tier 2, 5 Tier 1, 5 Tier 0 skills
CLI Validation Commands - aqe skill report, aqe eval run, regression detection

What's New in v3.4.0

AG-UI Protocol - Anthropic's streaming agent-to-user interface with real-time progress updates
A2A Protocol - Google's agent-to-agent interoperability standard for cross-tool communication
A2UI Components - Unified UI combining AG-UI streaming with A2A event handling

⚡ Quick Start

Install & Initialize

# Install globally
npm install -g agentic-qe

# Initialize your project
cd your-project
aqe init --wizard

# Or with auto-configuration
aqe init --auto

# Add MCP server to Claude Code (pick one)
# Option 1: Global install (recommended after npm install -g)
claude mcp add aqe -- aqe-mcp

# Option 2: Via npx (no global install needed)
claude mcp add aqe -- npx agentic-qe mcp

# Verify connection
claude mcp list

Use from Claude Code CLI

Ask Claude to use QE agents directly from your terminal:

# Generate comprehensive tests with learning
claude "Use qe-test-architect to create tests for src/services/user-service.ts with 95% coverage"

# Run full quality pipeline with Queen coordination
claude "Use qe-queen-coordinator to orchestrate: test generation, coverage analysis, security scan, and quality gate"

# Detect flaky tests with root cause analysis
claude "Use qe-flaky-hunter to analyze the last 100 test runs and stabilize flaky tests"

What V3 provides:

✅ 12 DDD Bounded Contexts: Organized by business domain (test-generation, coverage-analysis, security-compliance, etc.)
✅ 51 QE Agents: Including Queen Coordinator for hierarchical orchestration (44 main + 7 TDD subagents)
✅ TinyDancer Model Routing: 3-tier intelligent routing (Haiku/Sonnet/Opus) for cost optimization
✅ ReasoningBank Learning: HNSW-indexed pattern storage with experience replay
✅ O(log n) Coverage Analysis: Sublinear algorithms for efficient gap detection
✅ Claude Flow Integration: Deep integration with MCP tools and swarm orchestration
✅ Memory Coordination: Cross-agent communication via aqe/v3/* namespaces
✅ Coherence Verification (v3.3.0): Mathematical proof of belief consistency using WASM engines
✅ V2 Backward Compatibility: All V2 agents map to V3 equivalents
✅ 63 QE Skills with Trust Tiers: 46 Tier 3 (Verified) skills with full validation suites

🚀 Get Value in 60 Seconds

# 1. Install
npm install -g agentic-qe

# 2. Initialize (auto-detects your project, enables all 12 domains)
cd your-project && aqe init --auto

# 3. Generate tests immediately
claude "Generate comprehensive tests for src/services/"

# 4. Run quality assessment
claude "Assess code quality and provide deployment recommendation"

What happens:

Auto-configuration detects your tech stack (TypeScript/JS, testing framework, CI setup)
All 12 DDD domains enabled automatically - no "No factory registered" errors
Pattern learning kicks in - your project's test patterns are learned and reused
AI agents generate tests, analyze coverage, and provide actionable recommendations

🎯 Why AQE?

| Problem | AQE Solution | |---------|--------------| | Writing comprehensive tests is tedious and time-consuming | AI agents generate tests automatically with pattern reuse across projects | | Test suites become slow and expensive at scale | Sublinear O(log n) algorithms for coverage analysis and intelligent test selection | | Flaky tests waste developer time debugging false failures | ML-powered detection with root cause analysis and fix recommendations | | AI testing tools are expensive | TinyDancer 3-tier model routing reduces costs by matching task complexity to appropriate model | | No memory between test runs—every analysis starts from scratch | ReasoningBank remembers patterns, strategies, and what works for your codebase | | Agents waste tokens reading irrelevant code | Code Intelligence provides token reduction with semantic search and knowledge graphs | | Quality engineering requires complex coordination | Queen Coordinator orchestrates 51 agents across 12 domains with consensus and MinCut topology | | Tools don't understand your testing frameworks | Works with Jest, Cypress, Playwright, Vitest, Mocha, Jasmine, AVA |

✨ V3 Features

🏗️ Domain-Driven Design Architecture

V3 is built on 12 DDD Bounded Contexts, each with dedicated agents and clear responsibilities:

| Domain | Purpose | Key Agents | |--------|---------|------------| | test-generation | AI-powered test creation | qe-test-architect, qe-tdd-specialist | | test-execution | Parallel execution & retry | qe-parallel-executor, qe-retry-handler | | coverage-analysis | O(log n) gap detection | qe-coverage-specialist, qe-gap-detector | | quality-assessment | Quality gates & decisions | qe-quality-gate, qe-risk-assessor | | defect-intelligence | Prediction & root cause | qe-defect-predictor, qe-root-cause-analyzer | | requirements-validation | BDD & testability | qe-requirements-validator, qe-bdd-generator | | code-intelligence | Knowledge graph & search | qe-code-intelligence, qe-kg-builder | | security-compliance | SAST/DAST & audit | qe-security-scanner, qe-security-auditor | | contract-testing | API contracts & GraphQL | qe-contract-validator, qe-graphql-tester | | visual-accessibility | Visual regression & a11y | qe-visual-tester, qe-accessibility-auditor | | chaos-resilience | Chaos engineering & load | qe-chaos-engineer, qe-load-tester | | learning-optimization | Cross-domain learning | qe-learning-coordinator, qe-pattern-learner |

✅ Skill Trust Tiers (v3.4.2)

All 63 QE skills are categorized by trust tier based on their validation infrastructure:

| Tier | Badge | Count | Description | |------|-------|-------|-------------| | Tier 3 - Verified | | 46 | Full evaluation test suite | | Tier 2 - Validated | | 7 | Has executable validator | | Tier 1 - Structured | | 5 | Has JSON output schema | | Tier 0 - Advisory | | 5 | SKILL.md guidance only |

Tier 3 Skills are recommended for production use - they have:

JSON Schema validation for output structure
Executable validator scripts for correctness
Evaluation test suites with multi-model testing

# Check skill trust tier
aqe eval status --skill security-testing

# Run skill evaluation
aqe eval run --skill security-testing --model claude-sonnet-4

# View all trust tiers
cat .claude/skills/TRUST-TIERS.md

[Full documentation: docs/guides/skill-validation.md]

🌐 Browser Automation Integration (v3.1.0)

V3.1.0 adds full browser automation support via @claude-flow/browser integration:

| Component | Description | |-----------|-------------| | BrowserSwarmCoordinator | Parallel multi-viewport testing (4x faster) | | BrowserSecurityScanner | URL validation, PII detection with auto-masking | | 9 Workflow Templates | YAML-based reusable browser workflows | | TrajectoryAdapter | SONA learning integration with HNSW indexing |

Available Workflow Templates:

login-flow, oauth-flow - Authentication testing
form-validation, navigation-flow - User journey testing
visual-regression, accessibility-audit - Quality validation
performance-audit, api-integration, scraping-workflow - Advanced workflows

# Use browser automation from Claude Code
claude "Use security-visual-testing skill to test https://example.com across mobile, tablet, desktop viewports"

# Load and execute a workflow template
aqe workflow load login-flow --vars '{"username": "test", "password": "secret"}'

👑 Queen Coordinator & Hierarchical Orchestration

The qe-queen-coordinator manages the entire fleet with intelligent task distribution:

                    qe-queen-coordinator

5-Dim Analysis

Clarity8/10

Novelty9/10

Utility9/10

Completeness9/10

Maintainability8/10

Pros & Cons

Pros

Automates test generation and analysis.
Supports multiple testing frameworks.
Utilizes advanced AI for efficiency.
Offers a structured approach to quality engineering.

Cons

Complex setup process for new users.
May require familiarity with AI concepts.
Dependency on external tools like Claude.
Potential performance overhead with AI processing.

Related Skills

ccmp

toolCo-Pilot

86/ 100

“Powerful, but the setup might scare off the impatient.”

View Analysis

claude-mods

toolCo-Pilot

86/ 100

“Powerful, but the setup might scare off the impatient.”

View Analysis

candid

toolCo-Pilot

86/ 100

“Powerful, but the setup might scare off the impatient.”

View Analysis

Disclaimer: This content is sourced from GitHub open source projects for display and rating purposes only.