agentic-qe
💡 摘要
Agentic质量工程舰队通过使用AI代理自动化质量工程任务,以实现高效的测试和分析。
🎯 适合人群
🤖 AI 吐槽: “看起来很能打,但别让配置把人劝退。”
风险:Medium。建议检查:是否执行 shell/命令行指令;是否发起外网请求(SSRF/数据外发);API Key/Token 的获取、存储与泄露风险;依赖锁定与供应链风险。以最小权限运行,并在生产环境启用前审计代码与依赖。
Agentic Quality Engineering Fleet
V3 (Main) | V2 Documentation | Changelog | Contributors | Issues | Discussions
V3 brings Domain-Driven Design architecture, 12 bounded contexts, 51 specialized QE agents, TinyDancer intelligent model routing, ReasoningBank learning with Dream cycles, HNSW vector search, mathematical Coherence verification, full MinCut/Consensus integration across all 12 domains, and deep integration with Claude Flow and Agentic Flow.
What's New in v3.4.2
- Skill Validation System - 4-layer trust tiers with schemas, validators, and evaluation suites (ADR-056)
- 63 QE Skills with Trust Tiers - 46 Tier 3 (Verified), 7 Tier 2, 5 Tier 1, 5 Tier 0 skills
- CLI Validation Commands -
aqe skill report,aqe eval run, regression detection
What's New in v3.4.0
- AG-UI Protocol - Anthropic's streaming agent-to-user interface with real-time progress updates
- A2A Protocol - Google's agent-to-agent interoperability standard for cross-tool communication
- A2UI Components - Unified UI combining AG-UI streaming with A2A event handling
🏗️ DDD Architecture | 🧠 ReasoningBank + Dream Cycles | 🎯 TinyDancer Model Routing | 🔍 HNSW Vector Search | 👑 Queen Coordinator | 📊 O(log n) Coverage | 🔗 Claude Flow Integration | 🎯 12 Bounded Contexts | 📚 63 QE Skills | 🧬 Coherence Verification | ✅ Trust Tiers
⚡ Quick Start
Install & Initialize
# Install globally npm install -g agentic-qe # Initialize your project cd your-project aqe init --wizard # Or with auto-configuration aqe init --auto # Add MCP server to Claude Code (pick one) # Option 1: Global install (recommended after npm install -g) claude mcp add aqe -- aqe-mcp # Option 2: Via npx (no global install needed) claude mcp add aqe -- npx agentic-qe mcp # Verify connection claude mcp list
Use from Claude Code CLI
Ask Claude to use QE agents directly from your terminal:
# Generate comprehensive tests with learning claude "Use qe-test-architect to create tests for src/services/user-service.ts with 95% coverage" # Run full quality pipeline with Queen coordination claude "Use qe-queen-coordinator to orchestrate: test generation, coverage analysis, security scan, and quality gate" # Detect flaky tests with root cause analysis claude "Use qe-flaky-hunter to analyze the last 100 test runs and stabilize flaky tests"
What V3 provides:
- ✅ 12 DDD Bounded Contexts: Organized by business domain (test-generation, coverage-analysis, security-compliance, etc.)
- ✅ 51 QE Agents: Including Queen Coordinator for hierarchical orchestration (44 main + 7 TDD subagents)
- ✅ TinyDancer Model Routing: 3-tier intelligent routing (Haiku/Sonnet/Opus) for cost optimization
- ✅ ReasoningBank Learning: HNSW-indexed pattern storage with experience replay
- ✅ O(log n) Coverage Analysis: Sublinear algorithms for efficient gap detection
- ✅ Claude Flow Integration: Deep integration with MCP tools and swarm orchestration
- ✅ Memory Coordination: Cross-agent communication via
aqe/v3/*namespaces - ✅ Coherence Verification (v3.3.0): Mathematical proof of belief consistency using WASM engines
- ✅ V2 Backward Compatibility: All V2 agents map to V3 equivalents
- ✅ 63 QE Skills with Trust Tiers: 46 Tier 3 (Verified) skills with full validation suites
🚀 Get Value in 60 Seconds
# 1. Install npm install -g agentic-qe # 2. Initialize (auto-detects your project, enables all 12 domains) cd your-project && aqe init --auto # 3. Generate tests immediately claude "Generate comprehensive tests for src/services/" # 4. Run quality assessment claude "Assess code quality and provide deployment recommendation"
What happens:
- Auto-configuration detects your tech stack (TypeScript/JS, testing framework, CI setup)
- All 12 DDD domains enabled automatically - no "No factory registered" errors
- Pattern learning kicks in - your project's test patterns are learned and reused
- AI agents generate tests, analyze coverage, and provide actionable recommendations
🎯 Why AQE?
| Problem | AQE Solution | |---------|--------------| | Writing comprehensive tests is tedious and time-consuming | AI agents generate tests automatically with pattern reuse across projects | | Test suites become slow and expensive at scale | Sublinear O(log n) algorithms for coverage analysis and intelligent test selection | | Flaky tests waste developer time debugging false failures | ML-powered detection with root cause analysis and fix recommendations | | AI testing tools are expensive | TinyDancer 3-tier model routing reduces costs by matching task complexity to appropriate model | | No memory between test runs—every analysis starts from scratch | ReasoningBank remembers patterns, strategies, and what works for your codebase | | Agents waste tokens reading irrelevant code | Code Intelligence provides token reduction with semantic search and knowledge graphs | | Quality engineering requires complex coordination | Queen Coordinator orchestrates 51 agents across 12 domains with consensus and MinCut topology | | Tools don't understand your testing frameworks | Works with Jest, Cypress, Playwright, Vitest, Mocha, Jasmine, AVA |
✨ V3 Features
🏗️ Domain-Driven Design Architecture
V3 is built on 12 DDD Bounded Contexts, each with dedicated agents and clear responsibilities:
| Domain | Purpose | Key Agents | |--------|---------|------------| | test-generation | AI-powered test creation | qe-test-architect, qe-tdd-specialist | | test-execution | Parallel execution & retry | qe-parallel-executor, qe-retry-handler | | coverage-analysis | O(log n) gap detection | qe-coverage-specialist, qe-gap-detector | | quality-assessment | Quality gates & decisions | qe-quality-gate, qe-risk-assessor | | defect-intelligence | Prediction & root cause | qe-defect-predictor, qe-root-cause-analyzer | | requirements-validation | BDD & testability | qe-requirements-validator, qe-bdd-generator | | code-intelligence | Knowledge graph & search | qe-code-intelligence, qe-kg-builder | | security-compliance | SAST/DAST & audit | qe-security-scanner, qe-security-auditor | | contract-testing | API contracts & GraphQL | qe-contract-validator, qe-graphql-tester | | visual-accessibility | Visual regression & a11y | qe-visual-tester, qe-accessibility-auditor | | chaos-resilience | Chaos engineering & load | qe-chaos-engineer, qe-load-tester | | learning-optimization | Cross-domain learning | qe-learning-coordinator, qe-pattern-learner |
✅ Skill Trust Tiers (v3.4.2)
All 63 QE skills are categorized by trust tier based on their validation infrastructure:
| Tier | Badge | Count | Description |
|------|-------|-------|-------------|
| Tier 3 - Verified | | 46 | Full evaluation test suite |
| Tier 2 - Validated |
| 7 | Has executable validator |
| Tier 1 - Structured |
| 5 | Has JSON output schema |
| Tier 0 - Advisory |
| 5 | SKILL.md guidance only |
Tier 3 Skills are recommended for production use - they have:
- JSON Schema validation for output structure
- Executable validator scripts for correctness
- Evaluation test suites with multi-model testing
# Check skill trust tier aqe eval status --skill security-testing # Run skill evaluation aqe eval run --skill security-testing --model claude-sonnet-4 # View all trust tiers cat .claude/skills/TRUST-TIERS.md
[Full documentation: docs/guides/skill-validation.md]
🌐 Browser Automation Integration (v3.1.0)
V3.1.0 adds full browser automation support via @claude-flow/browser integration:
| Component | Description | |-----------|-------------| | BrowserSwarmCoordinator | Parallel multi-viewport testing (4x faster) | | BrowserSecurityScanner | URL validation, PII detection with auto-masking | | 9 Workflow Templates | YAML-based reusable browser workflows | | TrajectoryAdapter | SONA learning integration with HNSW indexing |
Available Workflow Templates:
login-flow,oauth-flow- Authentication testingform-validation,navigation-flow- User journey testingvisual-regression,accessibility-audit- Quality validationperformance-audit,api-integration,scraping-workflow- Advanced workflows
# Use browser automation from Claude Code claude "Use security-visual-testing skill to test https://example.com across mobile, tablet, desktop viewports" # Load and execute a workflow template aqe workflow load login-flow --vars '{"username": "test", "password": "secret"}'
👑 Queen Coordinator & Hierarchical Orchestration
The qe-queen-coordinator manages the entire fleet with intelligent task distribution:
qe-queen-coordinator
优点
- 自动化测试生成和分析。
- 支持多种测试框架。
- 利用先进的AI提高效率。
- 提供结构化的质量工程方法。
缺点
- 新用户的设置过程复杂。
- 可能需要熟悉AI概念。
- 依赖于外部工具如Claude。
- AI处理可能带来的性能开销。
相关技能
免责声明:本内容来源于 GitHub 开源项目,仅供展示和评分分析使用。
版权归原作者所有 proffesor-for-testing.
