Co-Pilot
Updated 25 days ago

claude-data-analysis-ultra-main

Lliangdabiao
0.1k
liangdabiao/claude-data-analysis-ultra-main
80
Agent Score

๐Ÿ’ก Summary

An AI-powered platform for comprehensive and automated data analysis using intuitive commands.

๐ŸŽฏ Target Audience

Data AnalystsBusiness Intelligence ProfessionalsResearchersMarketing AnalystsE-commerce Managers

๐Ÿค– AI Roast: โ€œPowerful, but the setup might scare off the impatient.โ€

Security AnalysisMedium Risk

Risk: Medium. Review: shell/CLI command execution; outbound network access (SSRF, data egress); filesystem read/write scope and path traversal. Run with least privilege and audit before enabling in production.

Claude Data Analysis Assistant

A modern, intelligent data analysis platform built with Claude Code's sub-agents, slash-commands, skills, and hooks. Transform your data analysis workflow with AI-powered assistance and specialized analysis tools.

็ฎ€ๅ•็š„ไธ€ๅฅ่ฏ๏ผš 2ไธชๅ‘ฝไปค๏ผŒ /do-all ๅธธ่ง„ๆ•ฐๆฎๅˆ†ๆž ๏ผ› /do-more ไบ’่”็ฝ‘ๆ•ฐๆฎๅˆ†ๆž ใ€‚ ่€Œๅˆ†ๆžๆ•ฐๆฎๆ˜ฏๆ”พๅœจ /data_storage ใ€‚ๅฐฑ่ฟ™ไนˆ็ฎ€ๅ•๏ผŒ็”จ่ตทๆฅๅง๏ผ

ๆณจๆ„๏ผš ไธ‹่ฝฝ้กน็›ฎไธ‹ๆฅ๏ผŒๅˆ†ๆžๆ•ฐๆฎๆ˜ฏๆ”พๅœจ /data_storage [ๅˆ ๅŽปๅŽŸๆฅ็š„demoๆ•ฐๆฎ] ๏ผŒไฝ ้œ€่ฆๅ…ˆๅˆ ้™ค complete_analysis ๅ’Œ do_more_analysis ่ฟ™ไธคไธชๆ–‡ไปถๅคนใ€‚ๆˆ‘่ฟ™้‡Œๆ”พ็€ๆ˜ฏ็ป™ไฝ ๅ‚่€ƒๆœ€็ปˆ็š„ๅˆ†ๆž็ป“ๆžœ๏ผŒไฝœไธบไพ‹ๅญใ€‚

๐Ÿš€ Quick Start

1. Set Up Your Data

Place your dataset in the data_storage/ directory:

cp your_data.csv ./data_storage/

2. Start Analysis

Use intuitive slash commands to analyze your data:

# Complete interactive workflow with human feedback checkpoints /do-all # โญ NEW: Automatic multi-skill analysis /do-more # Basic exploratory analysis /analyze user_behavior_sample.csv exploratory # Create visualizations /visualize user_behavior_sample.csv all # Generate analysis code /generate python data-cleaning # Create comprehensive report /report user_behavior_sample.csv complete markdown

๐ŸŽฏ Key Features

โญ /do-more vs /do-all: Which Should You Use?

/do-more: Automatic Multi-Skill Analysis

Best for: Quick, automated analysis without configuration

/do-more # No parameters needed!

What it does:

  • โœ… Automatically scans data_storage/ directory
  • โœ… Identifies data types (e-commerce, user behavior, etc.)
  • โœ… Intelligently matches 7+ relevant skills
  • โœ… Executes skills in optimal order
  • โœ… Generates comprehensive HTML report
  • โœ… No human intervention required
  • โœ… Fast execution (2-5 minutes)

Output: do_more_analysis/integrated_results/Comprehensive_Analysis_Report.html


/do-all: Complete Interactive Analysis Workflow

Best for: Thorough analysis with human oversight and feedback

/do-all

What it does:

  • โœ… Reads data from data_storage/ (no parameters needed!)
  • โœ… 6-stage workflow with quality checks
  • โœ… 3 Human feedback checkpoints at critical stages
  • โœ… Interactive hypothesis generation
  • โœ… Custom code generation
  • โœ… Comprehensive documentation
  • โœ… Multiple output formats (HTML, PDF, Markdown, DOCX)

Workflow Stages:

  1. Data Quality Assessment โ†’ โš ๏ธ [human checkpoint #1] - Confirm data quality
  2. Exploratory Analysis - Statistical summaries, patterns, trends
  3. Hypothesis Generation โ†’ โš ๏ธ [human checkpoint #2] - Review research directions
  4. Visualization โ†’ โš ๏ธ [human checkpoint #3] - Approve visualization strategy
  5. Code Generation - Reproducible analysis pipeline
  6. Report Generation - Comprehensive final report

Output Directory:

complete_analysis/
โ”œโ”€โ”€ data_quality_report/          # Stage 1 output
โ”œโ”€โ”€ exploratory_analysis/         # Stage 2 output
โ”œโ”€โ”€ hypothesis_reports/           # Stage 3 output
โ”œโ”€โ”€ visualizations/               # Stage 4 output
โ”œโ”€โ”€ generated_code/               # Stage 5 output
โ”œโ”€โ”€ final_report/                 # Stage 6 output
โ””โ”€โ”€ workflow_log/                 # Execution logs

Execution Time: 10-30 minutes (depends on data size)


Comparison Summary

| Feature | /do-more | /do-all | |---------|-----------|-----------| | Data Source | Auto-scans data_storage/ | Reads from data_storage/ | | Parameters | None required | None | | Human Feedback | No | Yes (3 checkpoints) | | Execution Time | 2-5 minutes | 10-30 minutes | | Skills Used | 7+ auto-selected | Complete workflow (no skills) | | Output Format | HTML report | Multi-format (HTML/PDF/MD/DOCX) | | Code Generation | No | Yes (complete pipeline) | | Analysis Stages | Integrated execution | 6 separate stages | | Interactive | No | Yes (at checkpoints) | | Report Detail | Comprehensive | Extensive + technical | | Best For | Quick insights | Thorough analysis | | Customization | Automatic | Interactive |

Specialized Analysis Skills

12 domain-specific skills for expert-level analysis:

Customer Analysis:

  • rfm-customer-segmentation - Customer value segmentation
  • ltv-predictor - Lifetime value prediction
  • retention-analysis - Customer retention and churn
  • user-profiling-analysis - User behavior profiling

Marketing Analysis:

  • attribution-analysis-modeling - Marketing attribution
  • growth-model-analyzer - Growth hacking analysis
  • ab-testing-analyzer - A/B test validation
  • funnel-analysis - Conversion funnels

Data Analysis:

  • data-exploration-visualization - Automated EDA
  • regression-analysis-modeling - Predictive modeling
  • content-analysis - Text and NLP analysis
  • recommender-system - Recommendation engines

Intelligent Sub-Agents

  • data-explorer: Expert statistical analysis and pattern discovery
  • visualization-specialist: Beautiful, insightful charts and graphs
  • code-generator: Production-ready analysis code
  • report-writer: Comprehensive analysis reports
  • quality-assurance: Data validation and quality control
  • hypothesis-generator: Research hypothesis and insights

Intuitive Slash Commands

  • /do-more - โญ RECOMMENDED Automatic multi-skill analysis (no parameters)
  • /do-all - Complete interactive workflow with human feedback (no parameters)
  • /analyze [dataset] [type] - Perform data analysis
  • /visualize [dataset] [type] - Create visualizations
  • /generate [language] [type] - Generate analysis code
  • /report [dataset] [format] - Generate reports
  • /quality [dataset] [action] - Quality assurance
  • /hypothesis [dataset] [domain] - Generate hypotheses

Automated Workflows

  • Data Validation: Automatic quality checks on data upload
  • Smart Context: Project-aware analysis suggestions
  • Reproducible Analysis: Complete documentation and code generation
  • Beautiful Reports: HTML, Markdown, and PDF output formats

๐Ÿ“Š Usage Examples

โญ Automatic Multi-Skill Analysis

# Easiest way - no parameters needed! /do-more # Output (2-5 minutes): # do_more_analysis/integrated_results/ # โ””โ”€โ”€ Comprehensive_Analysis_Report.html

Interactive Complete Analysis

# For thorough analysis with human feedback checkpoints /do-all # Includes: # โœ“ Data Quality Assessment โ†’ [your confirmation] # โœ“ Exploratory Analysis # โœ“ Hypothesis Generation โ†’ [your approval] # โœ“ Visualizations โ†’ [your review] # โœ“ Code Generation # โœ“ Comprehensive Report

E-commerce Data Analysis

# Quick automated analysis /do-more # Or specific customer analysis /rfm-customer-segmentation olist_orders.csv /ltv-predictor order_items.csv /retention-analysis orders.csv customers.csv

User Behavior Analysis

# Complete analysis workflow /analyze user_behavior.csv exploratory /visualize user_behavior.csv trends /quality user_behavior.csv clean /report user_behavior.csv complete html /generate python user-segmentation

Sales Data Analysis

# Sales performance analysis /analyze sales_data.csv statistical /visualize sales_data.csv trends /generate sql revenue-analysis /report sales_data.csv executive pdf

Customer Analytics

# Customer segmentation /analyze customer_data.csv predictive /visualize customer_data.csv distribution /generate r clustering-analysis /hypothesis customer_data churn-prediction

๐Ÿ› ๏ธ Project Structure

claude-data-analysis/
โ”œโ”€โ”€ .claude/
โ”‚   โ”œโ”€โ”€ agents/          # Sub-agent configurations
โ”‚   โ”œโ”€โ”€ commands/        # Slash command definitions
โ”‚   โ”‚   โ”œโ”€โ”€ do-more.md   # โญ NEW! Automatic multi-skill analysis
โ”‚   โ”œโ”€โ”€ hooks/          # Automation scripts
โ”‚   โ”œโ”€โ”€ settings.json   # Claude Code settings
โ”‚   โ””โ”€โ”€ skills/         # โญ 12 Specialized analysis skills
โ”‚       โ”œโ”€โ”€ rfm-customer-segmentation/
โ”‚       โ”œโ”€โ”€ ltv-predictor/
โ”‚       โ”œโ”€โ”€ retention-analysis/
โ”‚       โ”œโ”€โ”€ funnel-analysis/
โ”‚       โ”œโ”€โ”€ growth-model-analyzer/
โ”‚       โ”œโ”€โ”€ content-analysis/
โ”‚       โ””โ”€โ”€ ... (9 more skills)
โ”œโ”€โ”€ data_storage/       # Your data files
โ”‚   โ”œโ”€โ”€ Orders.csv
โ”‚   โ”œโ”€โ”€ Customers.csv
โ”‚   โ””โ”€โ”€ ... (Olist datasets included)
โ”œโ”€โ”€ do_more_analysis/   # โญ NEW! /do-more output directory
โ”‚   โ”œโ”€โ”€ skill_execution/  # Individual skill results
โ”‚   โ”‚   โ”œโ”€โ”€ data-exploration-visualization/
โ”‚   โ”‚   โ”œโ”€โ”€ rfm-customer-segmentation/
โ”‚   โ”‚   โ”œโ”€โ”€ ltv-predictor/
โ”‚   โ”‚   โ”œโ”€โ”€ retention-analysis/
โ”‚   โ”‚   โ”œโ”€โ”€ funnel-analysis/
โ”‚   โ”‚   โ”œโ”€โ”€ growth-model-analyzer/
โ”‚   โ”‚   โ””โ”€โ”€ content-analysis/
โ”‚   โ””โ”€โ”€ integrated_results/
โ”‚       โ””โ”€โ”€ Comprehensive_Analysis_Report.html  # โญ Interactive report
โ”œโ”€โ”€ analysis_reports/   # Generated analysis reports
โ”œโ”€โ”€ visualizations/     # Generated charts
โ”œโ”€โ”€ generated_code/     # Analysis code
โ””โ”€โ”€ examples/          # Example datasets

๐ŸŽจ Sample Data

The project includes Olist Brazilian E-commerce datasets in data_storage/:

  • Orders.csv (99,441 records): Order information, status, timestamps
  • Customers.csv (99,441 records): Customer demographics, location
  • Order Items.csv: Order details, products, pricing
  • Order Payments.csv: Payment methods, installments
  • Products.csv: Product catalog, categories
  • Reviews.csv (99,224 records): Customer reviews, ratings, comments
  • Categories.csv: Product categories
  • Sellers.csv: Seller information
  • Geolocation.csv: Geographic data

Sample Workflow:

# 1. Data already in data_storage/ # 2. Run automatic analysis /do-more # 3. View results # Open: do_more_analysis/integrated_results/Comprehensive_Analysis_Report.html

๐Ÿ”ง Configuration

Environment Setup

The project uses Claude Code's configuration system. Key settings:

  1. Hooks: Automated validation and context loading
  2. Sub-agents: Specialized AI assistants for different tasks
  3. Commands: Custom slash commands for common operations

Requirements

  • Python 3.8+ for data analysis
  • Claude Code with sub-agents enabled
  • Data files in CSV, JSON, or Excel format

๐Ÿ“š Getting Started Guide

For New Users

  1. Place your data in data_storage/
  2. **Run exploratory a
5-Dim Analysis
Clarity8/10
Novelty8/10
Utility9/10
Completeness8/10
Maintainability7/10
Pros & Cons

Pros

  • Intuitive slash commands for easy use.
  • Automated multi-skill analysis saves time.
  • Comprehensive reporting in multiple formats.

Cons

  • Requires data to be in specific formats.
  • May need human oversight for complex analyses.
  • Dependency on Claude Code framework.

Related Skills

spark-engineer

A
toolCo-Pilot
86/ 100

โ€œThis skill is like a Swiss Army knife for big dataโ€”just don't expect it to cut through all the noise.โ€

whodb

A
toolCo-Pilot
84/ 100

โ€œPowerful, but the setup might scare off the impatient.โ€

exa-search

A
toolCo-Pilot
84/ 100

โ€œPowerful, but the setup might scare off the impatient.โ€

Disclaimer: This content is sourced from GitHub open source projects for display and rating purposes only.

Copyright belongs to the original author liangdabiao.