π‘ Summary
A comprehensive framework for AI-assisted software development with automated validation and agent evaluation.
π― Target Audience
π€ AI Roast: βPowerful, but the setup might scare off the impatient.β
Risk: Medium. Review: shell/CLI command execution; outbound network access (SSRF, data egress); API keys/tokens handling and storage; filesystem read/write scope and path traversal; dependency pinning and supply-chain risk. Run with least privilege and audit before enabling in production.
AI Development Standards
Version 3.1.0 | Last Updated: 2025-11-24
A comprehensive framework of specialized AI skills, MCP servers, and development tools for AI-assisted software development. Features automated validation, agent evaluation, and quality assurance.
π Current Resources
Total: 199 Resources
| Category | Count | Description | | ---------------- | ----- | ------------------------------------------------- | | Skills | 64 | Specialized AI methodologies and workflows | | MCPs | 51 | Model Context Protocol servers (executable tools) | | Tools | 4 | Core utility scripts | | Components | 75 | Reusable UI and system components | | Integrations | 5 | Third-party service connectors |
MCP Coverage: 79.7% (51 MCPs supporting 64 Skills)
β¨ What's New in 3.1.0
π― Agent Evaluation System (Phase 5.12)
Implement Eval-Driven Development (EDD) for continuous agent quality assurance:
- Automated testing against golden datasets
- Multiple grading strategies (exact, regex, LLM-based)
- Performance metrics and regression tracking
- 100% test pass rate in validation suite
# Run agent evaluations node scripts/run-agent-evals.js --dataset tests/fixtures/golden-dataset-example.json --mock
β‘ Two-Tier Validation System
- Quick Validation (10-30s): Registry consistency, documentation
- Full Validation (2-5min): Includes linting, type checking, tests, agent evals
npm run validate:quick # Fast feedback npm run validate:full # Comprehensive checks
π Enhanced Documentation
.claude/CLAUDE.md- Complete Claude Code configuration guideFINAL-RESOURCE-COUNTS.md- Resource tracking and metricsdocs/VALIDATION-SYSTEM.md- Validation methodology
π Quick Start
π New to this repository? Check out our Installation Guide and Quick Start Guide for step-by-step instructions.
For New Projects
# Clone the repository git clone https://github.com/daffy0208/ai-dev-standards.git cd ai-dev-standards # Install dependencies npm install # Run validation to ensure everything works npm run validate
For Existing Projects
# Clone as a reference git clone https://github.com/daffy0208/ai-dev-standards.git ~/ai-dev-standards # Reference skills and patterns in your .cursorrules or .claude/claude.md # See docs/EXISTING-PROJECTS.md for integration guide
Using with Claude Code
-
Open your project in Claude Code
-
Reference this repository in your project instructions:
You have access to ai-dev-standards at ~/ai-dev-standards When needed, reference skills from skills/ and patterns from standards/ Use the skill-registry.json to find relevant skills for tasks -
Claude will automatically discover and use appropriate skills
π What This Repository Does
Think of this as a shared knowledge base between you and Claude:
π 64 Specialized Skills
Methodologies Claude follows automatically:
- Product: mvp-builder, product-strategist, go-to-market-planner
- AI/ML: rag-implementer, multi-agent-architect, knowledge-graph-builder
- Development: frontend-builder, api-designer, backend-architect
- Infrastructure: deployment-advisor, security-engineer, performance-optimizer
- Design: ux-designer, visual-designer, design-system-architect
- Quality: testing-strategist, quality-auditor, agent-evaluator
π§ 51 MCP Servers
Executable tools that extend Claude's capabilities:
- Search: semantic-search-mcp, dark-matter-analyzer-mcp
- Quality: code-quality-scanner-mcp, security-scanner-mcp, test-runner-mcp
- AI/Data: vector-database-mcp, embedding-generator-mcp, knowledge-base-mcp
- Design: figma-sync-mcp, design-token-manager-mcp, theme-builder-mcp
- DevOps: deployment-orchestrator-mcp, database-migration-mcp
π Architecture Patterns
Proven approaches for complex systems:
- RAG architectures (Naive, Advanced, Modular)
- Multi-agent coordination patterns
- Event-driven systems
- Real-time data pipelines
- Authentication patterns
π‘οΈ Quality Assurance
- Automated validation system (2-tier)
- Agent evaluation framework (EDD)
- Security best practices
- Performance standards
- Accessibility guidelines
π‘ Key Features
β‘ Automated Validation
# Quick validation (10-30 seconds) npm run validate:quick # Full validation (2-5 minutes) npm run validate:full # Agent evaluation only node scripts/run-agent-evals.js --dataset tests/fixtures/golden-dataset-example.json --mock
Validates:
- β Registry consistency
- β Documentation accuracy
- β Code quality (ESLint)
- β Type safety (TypeScript)
- β Test coverage
- β Agent performance (NEW)
π€ Agent Evaluation System
Test AI agents against golden datasets to ensure consistent, high-quality outputs:
{ "tests": [ { "id": "T001", "input": "Create a React button component with TypeScript", "expected": "import React from 'react';", "grading": { "type": "contains", "threshold": 0.8 } } ] }
Features:
- Multiple grading types (exact match, contains, regex, LLM-graded)
- Performance metrics (latency, success rate, score)
- Historical tracking and regression detection
- Custom dataset support
π Comprehensive Documentation
- For Developers:
docs/GETTING-STARTED.md,docs/QUICK-START.md - For AI:
meta/PROJECT-CONTEXT.md,meta/HOW-TO-USE.md - Configuration:
.claude/CLAUDE.md,FINAL-RESOURCE-COUNTS.md - Validation:
docs/VALIDATION-SYSTEM.md
π― Smart Resource Discovery
# Find skills for a task grep -r "mvp" meta/skill-registry.json # Search all resources grep -r "authentication" meta/ # View resource counts cat FINAL-RESOURCE-COUNTS.md
ποΈ Repository Structure
ai-dev-standards/
βββ skills/ # 64 specialized methodologies
β βββ mvp-builder/ # MVP development & prioritization
β βββ rag-implementer/ # RAG system implementation
β βββ api-designer/ # API design patterns
β βββ [61 more...]
β
βββ mcp-servers/ # 51 executable tools
β βββ semantic-search-mcp/ # Semantic code search
β βββ vector-database-mcp/ # Vector DB integration
β βββ code-quality-scanner-mcp/
β βββ [48 more...]
β
βββ standards/ # Architecture & best practices
β βββ architecture-patterns/
β βββ best-practices/
β βββ coding-conventions/
β βββ project-structure/
β
βββ meta/ # Resource registry & context
β βββ registry.json # Master resource registry
β βββ skill-registry.json # Skill catalog
β βββ mcp-registry.json # MCP catalog
β βββ PROJECT-CONTEXT.md # For AI assistants
β
βββ docs/ # Comprehensive documentation
β βββ GETTING-STARTED.md
β βββ VALIDATION-SYSTEM.md
β βββ AGENT-VALIDATION.md # NEW!
β βββ [40+ more guides...]
β
βββ scripts/ # Automation & validation
β βββ run-agent-evals.js # NEW! Agent evaluation
β βββ validate-full.sh # Full validation suite
β βββ [20+ more scripts...]
β
βββ tests/ # Test suites & fixtures
βββ fixtures/
β βββ golden-dataset-example.json # NEW!
βββ [150+ test files...]
π― Usage Examples
Example 1: Starting a New Project
User: "I want to build a SaaS product for invoice management"
Claude uses:
1. product-strategist β Validate problem-solution fit
2. mvp-builder β Identify P0 features (invoicing, payment tracking)
3. frontend-builder β React/Next.js structure
4. api-designer β REST API design
5. deployment-advisor β Vercel + Railway recommendation
6. security-engineer β Auth, data encryption, PCI compliance
Example 2: Implementing AI Search
User: "Add AI-powered search to our documentation"
Claude uses:
1. rag-implementer β RAG methodology
2. rag-pattern.md β Advanced RAG architecture
3. vector-database-mcp β Pinecone integration
4. embedding-generator-mcp β OpenAI embeddings
5. semantic-search-mcp β Search implementation
Example 3: Code Quality Audit
User: "Audit our codebase for quality issues"
Claude uses:
1. quality-auditor β Comprehensive audit methodology
2. code-quality-scanner-mcp β Static analysis
3. security-scanner-mcp β Vulnerability detection
4. performance-profiler-mcp β Performance bottlenecks
5. test-runner-mcp β Test coverage analysis
6. agent-evaluator β AI agent quality checks (NEW!)
π Finding Skills
By Task
# Search skills by keyword grep -i "authentication" meta/skill-registry.json grep -i "database" meta/skill-registry.json grep -i "testing" meta/skill-registry.json
By Category
View meta/skill-registry.json for complete categorization:
- Product & Business (8 skills)
- AI & Machine Learning (10 skills)
- Frontend Development (6 skills)
- Backend Development (8 skills)
- Infrastructure & DevOps (8 skills)
- Design & UX (12 skills)
- Quality & Testing (12 skills)
Auto-Discovery
Skills activate automatically based on your conversation with Claude. Just describe what you want to build!
βοΈ Validation System
Two-Tier Approach
Tier 1: Quick Validation (10-30 seconds)
npm run validate:quick
Checks:
- Registry consistency
- Documentation accurac
Pros
- Comprehensive validation system
- Specialized AI skills
- Automated agent evaluation
- Extensive documentation
Cons
- Complex setup for beginners
- Potentially overwhelming documentation
- Requires familiarity with AI concepts
- Dependency on external tools
Related Skills
useful-ai-prompts
AβA treasure trove of prompts, but donβt expect them to write your novel for you.β
mcpspy
AβMCPSpy: because who doesn't want to spy on their AI's secrets?β
fastmcp
AβFastMCP: because who doesn't love a little complexity with their AI?β
Disclaimer: This content is sourced from GitHub open source projects for display and rating purposes only.
Copyright belongs to the original author daffy0208.
