Co-Pilot
Updated 24 days ago

agentic-qe

Pproffesor-for-testing
0.2k
proffesor-for-testing/agentic-qe
86
Agent Score

💡 Summary

Agentic Quality Engineering Fleet automates quality engineering tasks using AI agents for efficient testing and analysis.

🎯 Target Audience

Software DevelopersQuality Assurance EngineersDevOps ProfessionalsProject ManagersTechnical Leads

🤖 AI Roast:Powerful, but the setup might scare off the impatient.

Security AnalysisMedium Risk

Risk: Medium. Review: shell/CLI command execution; outbound network access (SSRF, data egress); API keys/tokens handling and storage; dependency pinning and supply-chain risk. Run with least privilege and audit before enabling in production.

Agentic Quality Engineering Fleet

npm version License: MIT TypeScript Node.js

V3 (Main) | V2 Documentation | Changelog | Contributors | Issues | Discussions

V3 brings Domain-Driven Design architecture, 12 bounded contexts, 51 specialized QE agents, TinyDancer intelligent model routing, ReasoningBank learning with Dream cycles, HNSW vector search, mathematical Coherence verification, full MinCut/Consensus integration across all 12 domains, and deep integration with Claude Flow and Agentic Flow.

What's New in v3.4.2

  • Skill Validation System - 4-layer trust tiers with schemas, validators, and evaluation suites (ADR-056)
  • 63 QE Skills with Trust Tiers - 46 Tier 3 (Verified), 7 Tier 2, 5 Tier 1, 5 Tier 0 skills
  • CLI Validation Commands - aqe skill report, aqe eval run, regression detection

What's New in v3.4.0

  • AG-UI Protocol - Anthropic's streaming agent-to-user interface with real-time progress updates
  • A2A Protocol - Google's agent-to-agent interoperability standard for cross-tool communication
  • A2UI Components - Unified UI combining AG-UI streaming with A2A event handling

🏗️ DDD Architecture | 🧠 ReasoningBank + Dream Cycles | 🎯 TinyDancer Model Routing | 🔍 HNSW Vector Search | 👑 Queen Coordinator | 📊 O(log n) Coverage | 🔗 Claude Flow Integration | 🎯 12 Bounded Contexts | 📚 63 QE Skills | 🧬 Coherence Verification | ✅ Trust Tiers


⚡ Quick Start

Install & Initialize

# Install globally npm install -g agentic-qe # Initialize your project cd your-project aqe init --wizard # Or with auto-configuration aqe init --auto # Add MCP server to Claude Code (pick one) # Option 1: Global install (recommended after npm install -g) claude mcp add aqe -- aqe-mcp # Option 2: Via npx (no global install needed) claude mcp add aqe -- npx agentic-qe mcp # Verify connection claude mcp list

Use from Claude Code CLI

Ask Claude to use QE agents directly from your terminal:

# Generate comprehensive tests with learning claude "Use qe-test-architect to create tests for src/services/user-service.ts with 95% coverage" # Run full quality pipeline with Queen coordination claude "Use qe-queen-coordinator to orchestrate: test generation, coverage analysis, security scan, and quality gate" # Detect flaky tests with root cause analysis claude "Use qe-flaky-hunter to analyze the last 100 test runs and stabilize flaky tests"

What V3 provides:

  • 12 DDD Bounded Contexts: Organized by business domain (test-generation, coverage-analysis, security-compliance, etc.)
  • 51 QE Agents: Including Queen Coordinator for hierarchical orchestration (44 main + 7 TDD subagents)
  • TinyDancer Model Routing: 3-tier intelligent routing (Haiku/Sonnet/Opus) for cost optimization
  • ReasoningBank Learning: HNSW-indexed pattern storage with experience replay
  • O(log n) Coverage Analysis: Sublinear algorithms for efficient gap detection
  • Claude Flow Integration: Deep integration with MCP tools and swarm orchestration
  • Memory Coordination: Cross-agent communication via aqe/v3/* namespaces
  • Coherence Verification (v3.3.0): Mathematical proof of belief consistency using WASM engines
  • V2 Backward Compatibility: All V2 agents map to V3 equivalents
  • 63 QE Skills with Trust Tiers: 46 Tier 3 (Verified) skills with full validation suites

🚀 Get Value in 60 Seconds

# 1. Install npm install -g agentic-qe # 2. Initialize (auto-detects your project, enables all 12 domains) cd your-project && aqe init --auto # 3. Generate tests immediately claude "Generate comprehensive tests for src/services/" # 4. Run quality assessment claude "Assess code quality and provide deployment recommendation"

What happens:

  1. Auto-configuration detects your tech stack (TypeScript/JS, testing framework, CI setup)
  2. All 12 DDD domains enabled automatically - no "No factory registered" errors
  3. Pattern learning kicks in - your project's test patterns are learned and reused
  4. AI agents generate tests, analyze coverage, and provide actionable recommendations

🎯 Why AQE?

| Problem | AQE Solution | |---------|--------------| | Writing comprehensive tests is tedious and time-consuming | AI agents generate tests automatically with pattern reuse across projects | | Test suites become slow and expensive at scale | Sublinear O(log n) algorithms for coverage analysis and intelligent test selection | | Flaky tests waste developer time debugging false failures | ML-powered detection with root cause analysis and fix recommendations | | AI testing tools are expensive | TinyDancer 3-tier model routing reduces costs by matching task complexity to appropriate model | | No memory between test runs—every analysis starts from scratch | ReasoningBank remembers patterns, strategies, and what works for your codebase | | Agents waste tokens reading irrelevant code | Code Intelligence provides token reduction with semantic search and knowledge graphs | | Quality engineering requires complex coordination | Queen Coordinator orchestrates 51 agents across 12 domains with consensus and MinCut topology | | Tools don't understand your testing frameworks | Works with Jest, Cypress, Playwright, Vitest, Mocha, Jasmine, AVA |


✨ V3 Features

🏗️ Domain-Driven Design Architecture

V3 is built on 12 DDD Bounded Contexts, each with dedicated agents and clear responsibilities:

| Domain | Purpose | Key Agents | |--------|---------|------------| | test-generation | AI-powered test creation | qe-test-architect, qe-tdd-specialist | | test-execution | Parallel execution & retry | qe-parallel-executor, qe-retry-handler | | coverage-analysis | O(log n) gap detection | qe-coverage-specialist, qe-gap-detector | | quality-assessment | Quality gates & decisions | qe-quality-gate, qe-risk-assessor | | defect-intelligence | Prediction & root cause | qe-defect-predictor, qe-root-cause-analyzer | | requirements-validation | BDD & testability | qe-requirements-validator, qe-bdd-generator | | code-intelligence | Knowledge graph & search | qe-code-intelligence, qe-kg-builder | | security-compliance | SAST/DAST & audit | qe-security-scanner, qe-security-auditor | | contract-testing | API contracts & GraphQL | qe-contract-validator, qe-graphql-tester | | visual-accessibility | Visual regression & a11y | qe-visual-tester, qe-accessibility-auditor | | chaos-resilience | Chaos engineering & load | qe-chaos-engineer, qe-load-tester | | learning-optimization | Cross-domain learning | qe-learning-coordinator, qe-pattern-learner |


✅ Skill Trust Tiers (v3.4.2)

All 63 QE skills are categorized by trust tier based on their validation infrastructure:

| Tier | Badge | Count | Description | |------|-------|-------|-------------| | Tier 3 - Verified | Tier 3 | 46 | Full evaluation test suite | | Tier 2 - Validated | Tier 2 | 7 | Has executable validator | | Tier 1 - Structured | Tier 1 | 5 | Has JSON output schema | | Tier 0 - Advisory | Tier 0 | 5 | SKILL.md guidance only |

Tier 3 Skills are recommended for production use - they have:

  • JSON Schema validation for output structure
  • Executable validator scripts for correctness
  • Evaluation test suites with multi-model testing
# Check skill trust tier aqe eval status --skill security-testing # Run skill evaluation aqe eval run --skill security-testing --model claude-sonnet-4 # View all trust tiers cat .claude/skills/TRUST-TIERS.md

[Full documentation: docs/guides/skill-validation.md]


🌐 Browser Automation Integration (v3.1.0)

V3.1.0 adds full browser automation support via @claude-flow/browser integration:

| Component | Description | |-----------|-------------| | BrowserSwarmCoordinator | Parallel multi-viewport testing (4x faster) | | BrowserSecurityScanner | URL validation, PII detection with auto-masking | | 9 Workflow Templates | YAML-based reusable browser workflows | | TrajectoryAdapter | SONA learning integration with HNSW indexing |

Available Workflow Templates:

  • login-flow, oauth-flow - Authentication testing
  • form-validation, navigation-flow - User journey testing
  • visual-regression, accessibility-audit - Quality validation
  • performance-audit, api-integration, scraping-workflow - Advanced workflows
# Use browser automation from Claude Code claude "Use security-visual-testing skill to test https://example.com across mobile, tablet, desktop viewports" # Load and execute a workflow template aqe workflow load login-flow --vars '{"username": "test", "password": "secret"}'

👑 Queen Coordinator & Hierarchical Orchestration

The qe-queen-coordinator manages the entire fleet with intelligent task distribution:

                    qe-queen-coordinator
5-Dim Analysis
Clarity8/10
Novelty9/10
Utility9/10
Completeness9/10
Maintainability8/10
Pros & Cons

Pros

  • Automates test generation and analysis.
  • Supports multiple testing frameworks.
  • Utilizes advanced AI for efficiency.
  • Offers a structured approach to quality engineering.

Cons

  • Complex setup process for new users.
  • May require familiarity with AI concepts.
  • Dependency on external tools like Claude.
  • Potential performance overhead with AI processing.

Related Skills

ccmp

A
toolCo-Pilot
86/ 100

“Powerful, but the setup might scare off the impatient.”

claude-mods

A
toolCo-Pilot
86/ 100

“Powerful, but the setup might scare off the impatient.”

candid

A
toolCo-Pilot
86/ 100

“Powerful, but the setup might scare off the impatient.”

Disclaimer: This content is sourced from GitHub open source projects for display and rating purposes only.

Copyright belongs to the original author proffesor-for-testing.