Co-Pilot / 辅助式

更新于 4 months ago

agentic-qe

Name: agentic-qe
Rating: 4.3 (159 reviews)
Author: proffesor-for-testing

Pproffesor-for-testing

0.2k

proffesor-for-testing/agentic-qe

Agent 评分

💡 摘要

Agentic质量工程舰队通过使用AI代理自动化质量工程任务，以实现高效的测试和分析。

🎯 适合人群

软件开发人员质量保证工程师DevOps专业人员项目经理技术负责人

🤖 AI 吐槽: “看起来很能打，但别让配置把人劝退。”

安全分析中风险

风险：Medium。建议检查：是否执行 shell/命令行指令；是否发起外网请求（SSRF/数据外发）；API Key/Token 的获取、存储与泄露风险；依赖锁定与供应链风险。以最小权限运行，并在生产环境启用前审计代码与依赖。

Agentic Quality Engineering Fleet

V3 brings Domain-Driven Design architecture, 12 bounded contexts, 51 specialized QE agents, TinyDancer intelligent model routing, ReasoningBank learning with Dream cycles, HNSW vector search, mathematical Coherence verification, full MinCut/Consensus integration across all 12 domains, and deep integration with Claude Flow and Agentic Flow.

What's New in v3.4.2

Skill Validation System - 4-layer trust tiers with schemas, validators, and evaluation suites (ADR-056)
63 QE Skills with Trust Tiers - 46 Tier 3 (Verified), 7 Tier 2, 5 Tier 1, 5 Tier 0 skills
CLI Validation Commands - aqe skill report, aqe eval run, regression detection

What's New in v3.4.0

AG-UI Protocol - Anthropic's streaming agent-to-user interface with real-time progress updates
A2A Protocol - Google's agent-to-agent interoperability standard for cross-tool communication
A2UI Components - Unified UI combining AG-UI streaming with A2A event handling

⚡ Quick Start

Install & Initialize

# Install globally
npm install -g agentic-qe

# Initialize your project
cd your-project
aqe init --wizard

# Or with auto-configuration
aqe init --auto

# Add MCP server to Claude Code (pick one)
# Option 1: Global install (recommended after npm install -g)
claude mcp add aqe -- aqe-mcp

# Option 2: Via npx (no global install needed)
claude mcp add aqe -- npx agentic-qe mcp

# Verify connection
claude mcp list

Use from Claude Code CLI

Ask Claude to use QE agents directly from your terminal:

# Generate comprehensive tests with learning
claude "Use qe-test-architect to create tests for src/services/user-service.ts with 95% coverage"

# Run full quality pipeline with Queen coordination
claude "Use qe-queen-coordinator to orchestrate: test generation, coverage analysis, security scan, and quality gate"

# Detect flaky tests with root cause analysis
claude "Use qe-flaky-hunter to analyze the last 100 test runs and stabilize flaky tests"

What V3 provides:

✅ 12 DDD Bounded Contexts: Organized by business domain (test-generation, coverage-analysis, security-compliance, etc.)
✅ 51 QE Agents: Including Queen Coordinator for hierarchical orchestration (44 main + 7 TDD subagents)
✅ TinyDancer Model Routing: 3-tier intelligent routing (Haiku/Sonnet/Opus) for cost optimization
✅ ReasoningBank Learning: HNSW-indexed pattern storage with experience replay
✅ O(log n) Coverage Analysis: Sublinear algorithms for efficient gap detection
✅ Claude Flow Integration: Deep integration with MCP tools and swarm orchestration
✅ Memory Coordination: Cross-agent communication via aqe/v3/* namespaces
✅ Coherence Verification (v3.3.0): Mathematical proof of belief consistency using WASM engines
✅ V2 Backward Compatibility: All V2 agents map to V3 equivalents
✅ 63 QE Skills with Trust Tiers: 46 Tier 3 (Verified) skills with full validation suites

🚀 Get Value in 60 Seconds

# 1. Install
npm install -g agentic-qe

# 2. Initialize (auto-detects your project, enables all 12 domains)
cd your-project && aqe init --auto

# 3. Generate tests immediately
claude "Generate comprehensive tests for src/services/"

# 4. Run quality assessment
claude "Assess code quality and provide deployment recommendation"

What happens:

Auto-configuration detects your tech stack (TypeScript/JS, testing framework, CI setup)
All 12 DDD domains enabled automatically - no "No factory registered" errors
Pattern learning kicks in - your project's test patterns are learned and reused
AI agents generate tests, analyze coverage, and provide actionable recommendations

🎯 Why AQE?

| Problem | AQE Solution | |---------|--------------| | Writing comprehensive tests is tedious and time-consuming | AI agents generate tests automatically with pattern reuse across projects | | Test suites become slow and expensive at scale | Sublinear O(log n) algorithms for coverage analysis and intelligent test selection | | Flaky tests waste developer time debugging false failures | ML-powered detection with root cause analysis and fix recommendations | | AI testing tools are expensive | TinyDancer 3-tier model routing reduces costs by matching task complexity to appropriate model | | No memory between test runs—every analysis starts from scratch | ReasoningBank remembers patterns, strategies, and what works for your codebase | | Agents waste tokens reading irrelevant code | Code Intelligence provides token reduction with semantic search and knowledge graphs | | Quality engineering requires complex coordination | Queen Coordinator orchestrates 51 agents across 12 domains with consensus and MinCut topology | | Tools don't understand your testing frameworks | Works with Jest, Cypress, Playwright, Vitest, Mocha, Jasmine, AVA |

✨ V3 Features

🏗️ Domain-Driven Design Architecture

V3 is built on 12 DDD Bounded Contexts, each with dedicated agents and clear responsibilities:

| Domain | Purpose | Key Agents | |--------|---------|------------| | test-generation | AI-powered test creation | qe-test-architect, qe-tdd-specialist | | test-execution | Parallel execution & retry | qe-parallel-executor, qe-retry-handler | | coverage-analysis | O(log n) gap detection | qe-coverage-specialist, qe-gap-detector | | quality-assessment | Quality gates & decisions | qe-quality-gate, qe-risk-assessor | | defect-intelligence | Prediction & root cause | qe-defect-predictor, qe-root-cause-analyzer | | requirements-validation | BDD & testability | qe-requirements-validator, qe-bdd-generator | | code-intelligence | Knowledge graph & search | qe-code-intelligence, qe-kg-builder | | security-compliance | SAST/DAST & audit | qe-security-scanner, qe-security-auditor | | contract-testing | API contracts & GraphQL | qe-contract-validator, qe-graphql-tester | | visual-accessibility | Visual regression & a11y | qe-visual-tester, qe-accessibility-auditor | | chaos-resilience | Chaos engineering & load | qe-chaos-engineer, qe-load-tester | | learning-optimization | Cross-domain learning | qe-learning-coordinator, qe-pattern-learner |

✅ Skill Trust Tiers (v3.4.2)

All 63 QE skills are categorized by trust tier based on their validation infrastructure:

| Tier | Badge | Count | Description | |------|-------|-------|-------------| | Tier 3 - Verified | | 46 | Full evaluation test suite | | Tier 2 - Validated | | 7 | Has executable validator | | Tier 1 - Structured | | 5 | Has JSON output schema | | Tier 0 - Advisory | | 5 | SKILL.md guidance only |

Tier 3 Skills are recommended for production use - they have:

JSON Schema validation for output structure
Executable validator scripts for correctness
Evaluation test suites with multi-model testing

# Check skill trust tier
aqe eval status --skill security-testing

# Run skill evaluation
aqe eval run --skill security-testing --model claude-sonnet-4

# View all trust tiers
cat .claude/skills/TRUST-TIERS.md

[Full documentation: docs/guides/skill-validation.md]

🌐 Browser Automation Integration (v3.1.0)

V3.1.0 adds full browser automation support via @claude-flow/browser integration:

| Component | Description | |-----------|-------------| | BrowserSwarmCoordinator | Parallel multi-viewport testing (4x faster) | | BrowserSecurityScanner | URL validation, PII detection with auto-masking | | 9 Workflow Templates | YAML-based reusable browser workflows | | TrajectoryAdapter | SONA learning integration with HNSW indexing |

Available Workflow Templates:

login-flow, oauth-flow - Authentication testing
form-validation, navigation-flow - User journey testing
visual-regression, accessibility-audit - Quality validation
performance-audit, api-integration, scraping-workflow - Advanced workflows

# Use browser automation from Claude Code
claude "Use security-visual-testing skill to test https://example.com across mobile, tablet, desktop viewports"

# Load and execute a workflow template
aqe workflow load login-flow --vars '{"username": "test", "password": "secret"}'

👑 Queen Coordinator & Hierarchical Orchestration

The qe-queen-coordinator manages the entire fleet with intelligent task distribution:

                    qe-queen-coordinator

五维分析

清晰度8/10

创新性9/10

实用性9/10

完整性9/10

可维护性8/10

优缺点分析

优点

自动化测试生成和分析。
支持多种测试框架。
利用先进的AI提高效率。
提供结构化的质量工程方法。

缺点

新用户的设置过程复杂。
可能需要熟悉AI概念。
依赖于外部工具如Claude。
AI处理可能带来的性能开销。