claude-data-analysis-ultra-main
💡 摘要
一个基于AI的综合自动数据分析平台,使用直观的命令。
🎯 适合人群
🤖 AI 吐槽: “看起来很能打,但别让配置把人劝退。”
风险:Medium。建议检查:是否执行 shell/命令行指令;是否发起外网请求(SSRF/数据外发);文件读写范围与路径穿越风险。以最小权限运行,并在生产环境启用前审计代码与依赖。
Claude Data Analysis Assistant
A modern, intelligent data analysis platform built with Claude Code's sub-agents, slash-commands, skills, and hooks. Transform your data analysis workflow with AI-powered assistance and specialized analysis tools.
简单的一句话: 2个命令, /do-all 常规数据分析 ; /do-more 互联网数据分析 。 而分析数据是放在 /data_storage 。就这么简单,用起来吧!
注意: 下载项目下来,分析数据是放在 /data_storage [删去原来的demo数据] ,你需要先删除 complete_analysis 和 do_more_analysis 这两个文件夹。我这里放着是给你参考最终的分析结果,作为例子。
🚀 Quick Start
1. Set Up Your Data
Place your dataset in the data_storage/ directory:
cp your_data.csv ./data_storage/
2. Start Analysis
Use intuitive slash commands to analyze your data:
# Complete interactive workflow with human feedback checkpoints /do-all # ⭐ NEW: Automatic multi-skill analysis /do-more # Basic exploratory analysis /analyze user_behavior_sample.csv exploratory # Create visualizations /visualize user_behavior_sample.csv all # Generate analysis code /generate python data-cleaning # Create comprehensive report /report user_behavior_sample.csv complete markdown
🎯 Key Features
⭐ /do-more vs /do-all: Which Should You Use?
/do-more: Automatic Multi-Skill Analysis
Best for: Quick, automated analysis without configuration
/do-more # No parameters needed!
What it does:
- ✅ Automatically scans
data_storage/directory - ✅ Identifies data types (e-commerce, user behavior, etc.)
- ✅ Intelligently matches 7+ relevant skills
- ✅ Executes skills in optimal order
- ✅ Generates comprehensive HTML report
- ✅ No human intervention required
- ✅ Fast execution (2-5 minutes)
Output: do_more_analysis/integrated_results/Comprehensive_Analysis_Report.html
/do-all: Complete Interactive Analysis Workflow
Best for: Thorough analysis with human oversight and feedback
/do-all
What it does:
- ✅ Reads data from
data_storage/(no parameters needed!) - ✅ 6-stage workflow with quality checks
- ✅ 3 Human feedback checkpoints at critical stages
- ✅ Interactive hypothesis generation
- ✅ Custom code generation
- ✅ Comprehensive documentation
- ✅ Multiple output formats (HTML, PDF, Markdown, DOCX)
Workflow Stages:
- Data Quality Assessment → ⚠️ [human checkpoint #1] - Confirm data quality
- Exploratory Analysis - Statistical summaries, patterns, trends
- Hypothesis Generation → ⚠️ [human checkpoint #2] - Review research directions
- Visualization → ⚠️ [human checkpoint #3] - Approve visualization strategy
- Code Generation - Reproducible analysis pipeline
- Report Generation - Comprehensive final report
Output Directory:
complete_analysis/
├── data_quality_report/ # Stage 1 output
├── exploratory_analysis/ # Stage 2 output
├── hypothesis_reports/ # Stage 3 output
├── visualizations/ # Stage 4 output
├── generated_code/ # Stage 5 output
├── final_report/ # Stage 6 output
└── workflow_log/ # Execution logs
Execution Time: 10-30 minutes (depends on data size)
Comparison Summary
| Feature | /do-more | /do-all |
|---------|-----------|-----------|
| Data Source | Auto-scans data_storage/ | Reads from data_storage/ |
| Parameters | None required | None |
| Human Feedback | No | Yes (3 checkpoints) |
| Execution Time | 2-5 minutes | 10-30 minutes |
| Skills Used | 7+ auto-selected | Complete workflow (no skills) |
| Output Format | HTML report | Multi-format (HTML/PDF/MD/DOCX) |
| Code Generation | No | Yes (complete pipeline) |
| Analysis Stages | Integrated execution | 6 separate stages |
| Interactive | No | Yes (at checkpoints) |
| Report Detail | Comprehensive | Extensive + technical |
| Best For | Quick insights | Thorough analysis |
| Customization | Automatic | Interactive |
Specialized Analysis Skills
12 domain-specific skills for expert-level analysis:
Customer Analysis:
rfm-customer-segmentation- Customer value segmentationltv-predictor- Lifetime value predictionretention-analysis- Customer retention and churnuser-profiling-analysis- User behavior profiling
Marketing Analysis:
attribution-analysis-modeling- Marketing attributiongrowth-model-analyzer- Growth hacking analysisab-testing-analyzer- A/B test validationfunnel-analysis- Conversion funnels
Data Analysis:
data-exploration-visualization- Automated EDAregression-analysis-modeling- Predictive modelingcontent-analysis- Text and NLP analysisrecommender-system- Recommendation engines
Intelligent Sub-Agents
- data-explorer: Expert statistical analysis and pattern discovery
- visualization-specialist: Beautiful, insightful charts and graphs
- code-generator: Production-ready analysis code
- report-writer: Comprehensive analysis reports
- quality-assurance: Data validation and quality control
- hypothesis-generator: Research hypothesis and insights
Intuitive Slash Commands
/do-more- ⭐ RECOMMENDED Automatic multi-skill analysis (no parameters)/do-all- Complete interactive workflow with human feedback (no parameters)/analyze [dataset] [type]- Perform data analysis/visualize [dataset] [type]- Create visualizations/generate [language] [type]- Generate analysis code/report [dataset] [format]- Generate reports/quality [dataset] [action]- Quality assurance/hypothesis [dataset] [domain]- Generate hypotheses
Automated Workflows
- Data Validation: Automatic quality checks on data upload
- Smart Context: Project-aware analysis suggestions
- Reproducible Analysis: Complete documentation and code generation
- Beautiful Reports: HTML, Markdown, and PDF output formats
📊 Usage Examples
⭐ Automatic Multi-Skill Analysis
# Easiest way - no parameters needed! /do-more # Output (2-5 minutes): # do_more_analysis/integrated_results/ # └── Comprehensive_Analysis_Report.html
Interactive Complete Analysis
# For thorough analysis with human feedback checkpoints /do-all # Includes: # ✓ Data Quality Assessment → [your confirmation] # ✓ Exploratory Analysis # ✓ Hypothesis Generation → [your approval] # ✓ Visualizations → [your review] # ✓ Code Generation # ✓ Comprehensive Report
E-commerce Data Analysis
# Quick automated analysis /do-more # Or specific customer analysis /rfm-customer-segmentation olist_orders.csv /ltv-predictor order_items.csv /retention-analysis orders.csv customers.csv
User Behavior Analysis
# Complete analysis workflow /analyze user_behavior.csv exploratory /visualize user_behavior.csv trends /quality user_behavior.csv clean /report user_behavior.csv complete html /generate python user-segmentation
Sales Data Analysis
# Sales performance analysis /analyze sales_data.csv statistical /visualize sales_data.csv trends /generate sql revenue-analysis /report sales_data.csv executive pdf
Customer Analytics
# Customer segmentation /analyze customer_data.csv predictive /visualize customer_data.csv distribution /generate r clustering-analysis /hypothesis customer_data churn-prediction
🛠️ Project Structure
claude-data-analysis/
├── .claude/
│ ├── agents/ # Sub-agent configurations
│ ├── commands/ # Slash command definitions
│ │ ├── do-more.md # ⭐ NEW! Automatic multi-skill analysis
│ ├── hooks/ # Automation scripts
│ ├── settings.json # Claude Code settings
│ └── skills/ # ⭐ 12 Specialized analysis skills
│ ├── rfm-customer-segmentation/
│ ├── ltv-predictor/
│ ├── retention-analysis/
│ ├── funnel-analysis/
│ ├── growth-model-analyzer/
│ ├── content-analysis/
│ └── ... (9 more skills)
├── data_storage/ # Your data files
│ ├── Orders.csv
│ ├── Customers.csv
│ └── ... (Olist datasets included)
├── do_more_analysis/ # ⭐ NEW! /do-more output directory
│ ├── skill_execution/ # Individual skill results
│ │ ├── data-exploration-visualization/
│ │ ├── rfm-customer-segmentation/
│ │ ├── ltv-predictor/
│ │ ├── retention-analysis/
│ │ ├── funnel-analysis/
│ │ ├── growth-model-analyzer/
│ │ └── content-analysis/
│ └── integrated_results/
│ └── Comprehensive_Analysis_Report.html # ⭐ Interactive report
├── analysis_reports/ # Generated analysis reports
├── visualizations/ # Generated charts
├── generated_code/ # Analysis code
└── examples/ # Example datasets
🎨 Sample Data
The project includes Olist Brazilian E-commerce datasets in data_storage/:
- Orders.csv (99,441 records): Order information, status, timestamps
- Customers.csv (99,441 records): Customer demographics, location
- Order Items.csv: Order details, products, pricing
- Order Payments.csv: Payment methods, installments
- Products.csv: Product catalog, categories
- Reviews.csv (99,224 records): Customer reviews, ratings, comments
- Categories.csv: Product categories
- Sellers.csv: Seller information
- Geolocation.csv: Geographic data
Sample Workflow:
# 1. Data already in data_storage/ # 2. Run automatic analysis /do-more # 3. View results # Open: do_more_analysis/integrated_results/Comprehensive_Analysis_Report.html
🔧 Configuration
Environment Setup
The project uses Claude Code's configuration system. Key settings:
- Hooks: Automated validation and context loading
- Sub-agents: Specialized AI assistants for different tasks
- Commands: Custom slash commands for common operations
Requirements
- Python 3.8+ for data analysis
- Claude Code with sub-agents enabled
- Data files in CSV, JSON, or Excel format
📚 Getting Started Guide
For New Users
- Place your data in
data_storage/ - **Run exploratory a
优点
- 直观的斜杠命令,易于使用。
- 自动化的多技能分析节省时间。
- 多种格式的综合报告。
缺点
- 需要数据以特定格式存在。
- 复杂分析可能需要人工监督。
- 依赖Claude Code框架。
相关技能
免责声明:本内容来源于 GitHub 开源项目,仅供展示和评分分析使用。
版权归原作者所有 liangdabiao.
