Co-Pilot
Updated a month ago

ai-dev-standards

Ddaffy0208
0.0k
daffy0208/ai-dev-standards
82
Agent Score

πŸ’‘ Summary

A comprehensive framework for AI-assisted software development with automated validation and agent evaluation.

🎯 Target Audience

AI developersSoftware engineersProduct managersData scientistsDevOps professionals

πŸ€– AI Roast: β€œPowerful, but the setup might scare off the impatient.”

Security AnalysisMedium Risk

Risk: Medium. Review: shell/CLI command execution; outbound network access (SSRF, data egress); API keys/tokens handling and storage; filesystem read/write scope and path traversal; dependency pinning and supply-chain risk. Run with least privilege and audit before enabling in production.

AI Development Standards

CI Status License: MIT TypeScript Node

Version 3.1.0 | Last Updated: 2025-11-24

A comprehensive framework of specialized AI skills, MCP servers, and development tools for AI-assisted software development. Features automated validation, agent evaluation, and quality assurance.

πŸ“Š Current Resources

Total: 199 Resources

| Category | Count | Description | | ---------------- | ----- | ------------------------------------------------- | | Skills | 64 | Specialized AI methodologies and workflows | | MCPs | 51 | Model Context Protocol servers (executable tools) | | Tools | 4 | Core utility scripts | | Components | 75 | Reusable UI and system components | | Integrations | 5 | Third-party service connectors |

MCP Coverage: 79.7% (51 MCPs supporting 64 Skills)


✨ What's New in 3.1.0

🎯 Agent Evaluation System (Phase 5.12)

Implement Eval-Driven Development (EDD) for continuous agent quality assurance:

  • Automated testing against golden datasets
  • Multiple grading strategies (exact, regex, LLM-based)
  • Performance metrics and regression tracking
  • 100% test pass rate in validation suite
# Run agent evaluations node scripts/run-agent-evals.js --dataset tests/fixtures/golden-dataset-example.json --mock

⚑ Two-Tier Validation System

  • Quick Validation (10-30s): Registry consistency, documentation
  • Full Validation (2-5min): Includes linting, type checking, tests, agent evals
npm run validate:quick # Fast feedback npm run validate:full # Comprehensive checks

πŸ“š Enhanced Documentation

  • .claude/CLAUDE.md - Complete Claude Code configuration guide
  • FINAL-RESOURCE-COUNTS.md - Resource tracking and metrics
  • docs/VALIDATION-SYSTEM.md - Validation methodology

πŸš€ Quick Start

πŸ“– New to this repository? Check out our Installation Guide and Quick Start Guide for step-by-step instructions.

For New Projects

# Clone the repository git clone https://github.com/daffy0208/ai-dev-standards.git cd ai-dev-standards # Install dependencies npm install # Run validation to ensure everything works npm run validate

For Existing Projects

# Clone as a reference git clone https://github.com/daffy0208/ai-dev-standards.git ~/ai-dev-standards # Reference skills and patterns in your .cursorrules or .claude/claude.md # See docs/EXISTING-PROJECTS.md for integration guide

Using with Claude Code

  1. Open your project in Claude Code

  2. Reference this repository in your project instructions:

    You have access to ai-dev-standards at ~/ai-dev-standards When needed, reference skills from skills/ and patterns from standards/ Use the skill-registry.json to find relevant skills for tasks
  3. Claude will automatically discover and use appropriate skills


πŸ“– What This Repository Does

Think of this as a shared knowledge base between you and Claude:

πŸŽ“ 64 Specialized Skills

Methodologies Claude follows automatically:

  • Product: mvp-builder, product-strategist, go-to-market-planner
  • AI/ML: rag-implementer, multi-agent-architect, knowledge-graph-builder
  • Development: frontend-builder, api-designer, backend-architect
  • Infrastructure: deployment-advisor, security-engineer, performance-optimizer
  • Design: ux-designer, visual-designer, design-system-architect
  • Quality: testing-strategist, quality-auditor, agent-evaluator

πŸ”§ 51 MCP Servers

Executable tools that extend Claude's capabilities:

  • Search: semantic-search-mcp, dark-matter-analyzer-mcp
  • Quality: code-quality-scanner-mcp, security-scanner-mcp, test-runner-mcp
  • AI/Data: vector-database-mcp, embedding-generator-mcp, knowledge-base-mcp
  • Design: figma-sync-mcp, design-token-manager-mcp, theme-builder-mcp
  • DevOps: deployment-orchestrator-mcp, database-migration-mcp

πŸ“ Architecture Patterns

Proven approaches for complex systems:

  • RAG architectures (Naive, Advanced, Modular)
  • Multi-agent coordination patterns
  • Event-driven systems
  • Real-time data pipelines
  • Authentication patterns

πŸ›‘οΈ Quality Assurance

  • Automated validation system (2-tier)
  • Agent evaluation framework (EDD)
  • Security best practices
  • Performance standards
  • Accessibility guidelines

πŸ’‘ Key Features

⚑ Automated Validation

# Quick validation (10-30 seconds) npm run validate:quick # Full validation (2-5 minutes) npm run validate:full # Agent evaluation only node scripts/run-agent-evals.js --dataset tests/fixtures/golden-dataset-example.json --mock

Validates:

  • βœ… Registry consistency
  • βœ… Documentation accuracy
  • βœ… Code quality (ESLint)
  • βœ… Type safety (TypeScript)
  • βœ… Test coverage
  • βœ… Agent performance (NEW)

πŸ€– Agent Evaluation System

Test AI agents against golden datasets to ensure consistent, high-quality outputs:

{ "tests": [ { "id": "T001", "input": "Create a React button component with TypeScript", "expected": "import React from 'react';", "grading": { "type": "contains", "threshold": 0.8 } } ] }

Features:

  • Multiple grading types (exact match, contains, regex, LLM-graded)
  • Performance metrics (latency, success rate, score)
  • Historical tracking and regression detection
  • Custom dataset support

πŸ“Š Comprehensive Documentation

  • For Developers: docs/GETTING-STARTED.md, docs/QUICK-START.md
  • For AI: meta/PROJECT-CONTEXT.md, meta/HOW-TO-USE.md
  • Configuration: .claude/CLAUDE.md, FINAL-RESOURCE-COUNTS.md
  • Validation: docs/VALIDATION-SYSTEM.md

🎯 Smart Resource Discovery

# Find skills for a task grep -r "mvp" meta/skill-registry.json # Search all resources grep -r "authentication" meta/ # View resource counts cat FINAL-RESOURCE-COUNTS.md

πŸ—‚οΈ Repository Structure

ai-dev-standards/
β”œβ”€β”€ skills/                    # 64 specialized methodologies
β”‚   β”œβ”€β”€ mvp-builder/          # MVP development & prioritization
β”‚   β”œβ”€β”€ rag-implementer/      # RAG system implementation
β”‚   β”œβ”€β”€ api-designer/         # API design patterns
β”‚   └── [61 more...]
β”‚
β”œβ”€β”€ mcp-servers/              # 51 executable tools
β”‚   β”œβ”€β”€ semantic-search-mcp/  # Semantic code search
β”‚   β”œβ”€β”€ vector-database-mcp/  # Vector DB integration
β”‚   β”œβ”€β”€ code-quality-scanner-mcp/
β”‚   └── [48 more...]
β”‚
β”œβ”€β”€ standards/                # Architecture & best practices
β”‚   β”œβ”€β”€ architecture-patterns/
β”‚   β”œβ”€β”€ best-practices/
β”‚   β”œβ”€β”€ coding-conventions/
β”‚   └── project-structure/
β”‚
β”œβ”€β”€ meta/                     # Resource registry & context
β”‚   β”œβ”€β”€ registry.json         # Master resource registry
β”‚   β”œβ”€β”€ skill-registry.json   # Skill catalog
β”‚   β”œβ”€β”€ mcp-registry.json     # MCP catalog
β”‚   └── PROJECT-CONTEXT.md    # For AI assistants
β”‚
β”œβ”€β”€ docs/                     # Comprehensive documentation
β”‚   β”œβ”€β”€ GETTING-STARTED.md
β”‚   β”œβ”€β”€ VALIDATION-SYSTEM.md
β”‚   β”œβ”€β”€ AGENT-VALIDATION.md   # NEW!
β”‚   └── [40+ more guides...]
β”‚
β”œβ”€β”€ scripts/                  # Automation & validation
β”‚   β”œβ”€β”€ run-agent-evals.js    # NEW! Agent evaluation
β”‚   β”œβ”€β”€ validate-full.sh      # Full validation suite
β”‚   └── [20+ more scripts...]
β”‚
└── tests/                    # Test suites & fixtures
    β”œβ”€β”€ fixtures/
    β”‚   └── golden-dataset-example.json  # NEW!
    └── [150+ test files...]

🎯 Usage Examples

Example 1: Starting a New Project

User: "I want to build a SaaS product for invoice management"

Claude uses:
1. product-strategist β†’ Validate problem-solution fit
2. mvp-builder β†’ Identify P0 features (invoicing, payment tracking)
3. frontend-builder β†’ React/Next.js structure
4. api-designer β†’ REST API design
5. deployment-advisor β†’ Vercel + Railway recommendation
6. security-engineer β†’ Auth, data encryption, PCI compliance

Example 2: Implementing AI Search

User: "Add AI-powered search to our documentation"

Claude uses:
1. rag-implementer β†’ RAG methodology
2. rag-pattern.md β†’ Advanced RAG architecture
3. vector-database-mcp β†’ Pinecone integration
4. embedding-generator-mcp β†’ OpenAI embeddings
5. semantic-search-mcp β†’ Search implementation

Example 3: Code Quality Audit

User: "Audit our codebase for quality issues"

Claude uses:
1. quality-auditor β†’ Comprehensive audit methodology
2. code-quality-scanner-mcp β†’ Static analysis
3. security-scanner-mcp β†’ Vulnerability detection
4. performance-profiler-mcp β†’ Performance bottlenecks
5. test-runner-mcp β†’ Test coverage analysis
6. agent-evaluator β†’ AI agent quality checks (NEW!)

πŸ” Finding Skills

By Task

# Search skills by keyword grep -i "authentication" meta/skill-registry.json grep -i "database" meta/skill-registry.json grep -i "testing" meta/skill-registry.json

By Category

View meta/skill-registry.json for complete categorization:

  • Product & Business (8 skills)
  • AI & Machine Learning (10 skills)
  • Frontend Development (6 skills)
  • Backend Development (8 skills)
  • Infrastructure & DevOps (8 skills)
  • Design & UX (12 skills)
  • Quality & Testing (12 skills)

Auto-Discovery

Skills activate automatically based on your conversation with Claude. Just describe what you want to build!


βš™οΈ Validation System

Two-Tier Approach

Tier 1: Quick Validation (10-30 seconds)

npm run validate:quick

Checks:

  • Registry consistency
  • Documentation accurac
5-Dim Analysis
Clarity8/10
Novelty7/10
Utility9/10
Completeness9/10
Maintainability8/10
Pros & Cons

Pros

  • Comprehensive validation system
  • Specialized AI skills
  • Automated agent evaluation
  • Extensive documentation

Cons

  • Complex setup for beginners
  • Potentially overwhelming documentation
  • Requires familiarity with AI concepts
  • Dependency on external tools

Related Skills

useful-ai-prompts

A
toolCo-Pilot
88/ 100

β€œA treasure trove of prompts, but don’t expect them to write your novel for you.”

mcpspy

A
toolCo-Pilot
86/ 100

β€œMCPSpy: because who doesn't want to spy on their AI's secrets?”

fastmcp

A
toolCo-Pilot
86/ 100

β€œFastMCP: because who doesn't love a little complexity with their AI?”

Disclaimer: This content is sourced from GitHub open source projects for display and rating purposes only.

Copyright belongs to the original author daffy0208.