Skip to content

sirmaxworld/AI-Workspace

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

AI Business Intelligence & Automation Workspace

Complete AI-powered business intelligence system with automated schema synchronization

Version: 1.0.0 | Last Updated: October 15, 2025


🎯 What This System Does

Transform YouTube business content into actionable intelligence and automate market research through AI agents:

  1. Extract Business Intelligence from videos (50+ videos, 1,170+ insights)
  2. Expose via MCP Server for AI agent access
  3. Auto-sync Schema across all components
  4. Run AI Business Crew with full BI database access

πŸ“Š System Overview

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                      INPUT SOURCES                              β”‚
β”‚  YouTube Videos β†’ Browserbase β†’ Transcripts β†’ AI Extraction    β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                             β”‚
                             β–Ό
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                 BUSINESS INTELLIGENCE DATABASE                  β”‚
β”‚  50 Videos | 1,170 Insights | 13 Categories | Auto-Validated   β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                             β”‚
            β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
            β”‚                β”‚                β”‚
            β–Ό                β–Ό                β–Ό
   β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”   β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”   β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
   β”‚ MCP Server β”‚   β”‚  AI Crew   β”‚   β”‚   Schema   β”‚
   β”‚ 13 Tools   β”‚   β”‚  8 Agents  β”‚   β”‚  Manager   β”‚
   β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜   β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜   β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

πŸš€ Quick Start

1. Extract Business Intelligence

# Extract from single video
python3 scripts/browserbase_transcript_extractor.py VIDEO_ID
python3 scripts/business_intelligence_extractor.py VIDEO_ID

# Extract from all videos
python3 scripts/business_intelligence_extractor.py all

2. Start MCP Server

# Install MCP server
cd mcp-servers/business-intelligence
pip3 install -e .

# Test it
python3 test_server.py

# Configure for Claude Desktop (optional)
# See: docs/MCP_BUSINESS_INTELLIGENCE_SETUP.md

3. Run AI Business Crew

# Run with BI database access
python3 scripts/ai_business_crew_with_mcp.py

4. Manage Schema

# Validate schema sync
cd mcp-servers/business-intelligence
python3 schema_sync.py --full-sync

πŸ“ Project Structure

AI-Workspace/
β”œβ”€β”€ README.md                          # This file
β”‚
β”œβ”€β”€ scripts/                           # Extraction & automation scripts
β”‚   β”œβ”€β”€ browserbase_transcript_extractor.py    # YouTube β†’ Transcript
β”‚   β”œβ”€β”€ business_intelligence_extractor.py     # Transcript β†’ Insights
β”‚   β”œβ”€β”€ ai_business_crew.py                    # 8-agent crew (basic)
β”‚   └── ai_business_crew_with_mcp.py           # 8-agent crew (with BI)
β”‚
β”œβ”€β”€ data/                              # Data storage
β”‚   β”œβ”€β”€ transcripts/                   # YouTube transcripts (50+)
β”‚   └── business_insights/             # Extracted insights (50+)
β”‚
β”œβ”€β”€ mcp-servers/business-intelligence/ # MCP Server (AI agent access)
β”‚   β”œβ”€β”€ server.py                      # MCP server (13 tools)
β”‚   β”œβ”€β”€ schema.py                      # Schema definition (single source of truth)
β”‚   β”œβ”€β”€ schema_sync.py                 # Schema synchronization system
β”‚   β”œβ”€β”€ test_server.py                 # Comprehensive tests
β”‚   β”œβ”€β”€ pyproject.toml                 # Package config
β”‚   └── README.md                      # MCP server docs
β”‚
└── docs/                              # Documentation
    β”œβ”€β”€ AI_BUSINESS_AUTOMATION_WORKFLOW.md      # 8-agent workflow design
    β”œβ”€β”€ SESSION_SUMMARY_CREW_AI_SETUP.md        # Implementation summary
    β”œβ”€β”€ ENHANCED_EXTRACTION_SCHEMA.md           # Enhanced data categories
    β”œβ”€β”€ MCP_BUSINESS_INTELLIGENCE_SETUP.md      # MCP setup guide
    β”œβ”€β”€ MCP_IMPLEMENTATION_SUMMARY.md           # MCP implementation details
    β”œβ”€β”€ BUSINESS_INTELLIGENCE_SCHEMA.md         # Schema documentation (auto-gen)
    β”œβ”€β”€ SCHEMA_MIGRATION_GUIDE.md               # Migration guide (auto-gen)
    └── SCHEMA_MANAGEMENT_GUIDE.md              # Schema management (this doc)

πŸ“š Documentation Index

Getting Started

Implementation Details

Schema Management (⭐ Key Docs)


πŸ’‘ Key Features

1. Business Intelligence Database

1,170 Intelligence Items Across 50 Videos:

Category Count Description
Products & Tools 214 AI tools, SaaS platforms, physical products
Problems & Solutions 84 Validated problems with step-by-step solutions
Startup Ideas 64 Business concepts with validation data
Growth Tactics 66 Proven marketing strategies
AI Workflows 71 Automation workflows with implementation
Target Markets 73 Market intelligence with demographics
Trends & Signals 107 Market trends with opportunity analysis
Business Strategies 103 Proven strategies for branding, operations, marketing
Metrics & KPIs 59 Benchmarks and optimization tips
Actionable Quotes 132 High-value insights from successful entrepreneurs
Key Statistics 136 Revenue, conversion, and growth data points
Mistakes to Avoid 61 Common pitfalls with prevention strategies

2. MCP Server (13 Tools)

AI agents can query the BI database using:

  1. search_products - Find products with sentiment/category filters
  2. search_problems - Find problems with solutions and difficulty levels
  3. search_startup_ideas - Discover startup concepts with validation
  4. search_growth_tactics - Get growth strategies by channel
  5. search_ai_workflows - Find AI automation workflows
  6. search_target_markets - Get market intelligence and demographics
  7. search_trends - Find market trends by stage (emerging/growing)
  8. search_business_strategies - Get proven strategies by type
  9. get_market_opportunities - Analyze combined opportunities
  10. get_actionable_quotes - Get expert insights by category
  11. get_key_metrics - Retrieve KPIs and benchmarks
  12. get_mistakes_to_avoid - Learn from documented failures
  13. get_database_stats - Get comprehensive database statistics

3. AI Business Crew (8 Agents)

Complete product discovery β†’ launch workflow:

Phase Agent Goal
Phase 1: Market Intelligence Market Trend Analyzer Find markets with 10%+ CAGR and low saturation
Product Discovery Specialist Identify early adopter products using YouTube analysis
Phase 2: Audience & Brand Audience Identity Researcher Deep-dive into psychographics and aspirational identity
Brand Identity Creator Create identity-based branding strategy
Phase 3: Operations Supplier Sourcing Agent Contact 20-50 suppliers and negotiate using competition
Phase 4: Marketing Photo Shoot Director Create 4 content types matching brand aesthetic
Viral Video Creator Apply 1-3 second transition science for virality
Marketing Campaign Manager Run retargeting campaigns with 5-7% conversion targets

4. Automated Schema Sync ⭐

The system automatically keeps everything in sync:

# 1. Edit schema.py (single source of truth)
EXTRACTION_SCHEMA["new_category"] = {
    "fields": ["field1", "field2"],
    "description": "Description"
}

# 2. Run sync
python3 schema_sync.py --full-sync

# βœ… Extractor prompts updated automatically
# βœ… MCP server schemas updated automatically
# βœ… Documentation regenerated automatically
# βœ… Validation rules updated automatically
# βœ… Backward compatibility checked automatically

Key Benefits:

  • πŸ”’ No manual synchronization across components
  • πŸ”’ Breaking changes detected automatically
  • πŸ”’ Backward compatibility validated
  • πŸ”’ Pre-commit hooks prevent inconsistencies
  • πŸ”’ Migration guides auto-generated

πŸ› οΈ Common Tasks

Add New Video to Database

# 1. Extract transcript (bypasses YouTube API limits)
python3 scripts/browserbase_transcript_extractor.py VIDEO_ID

# 2. Extract business intelligence using Claude Sonnet 4.5
python3 scripts/business_intelligence_extractor.py VIDEO_ID

# 3. Restart MCP server (auto-loads new data)
# MCP server automatically picks up new *_insights.json files

Add New Data Category

# 1. Edit schema.py
cd mcp-servers/business-intelligence
vim schema.py

# 2. Add to EXTRACTION_SCHEMA dictionary
"competitive_intelligence": {
    "fields": ["competitor", "strength", "weakness", "market_share"],
    "description": "Competitive intelligence analysis"
}

# 3. Run full synchronization
python3 schema_sync.py --full-sync

# 4. Follow the auto-generated migration guide
cat ../../docs/SCHEMA_MIGRATION_GUIDE.md

# 5. Test everything
python3 test_server.py

Query Business Intelligence

# Using MCP Server directly
cd mcp-servers/business-intelligence
python3 -c "
from server import BusinessIntelligenceDB
db = BusinessIntelligenceDB()
results = db.search('chatgpt', 'products', {'sentiment': 'positive'})
print(f'Found {len(results)} results')
"

# Using Claude Desktop (after MCP configuration)
# Simply ask: "Search for AI tools with positive sentiment"

# Using CrewAI agents
# Agents automatically query via MCP tool
python3 scripts/ai_business_crew_with_mcp.py

Validate Schema Health

cd mcp-servers/business-intelligence

# Full validation and sync check
python3 schema_sync.py --full-sync

# Validate existing data only
python3 schema_sync.py --validate

# Check extractor sync
python3 schema_sync.py --check-extractor

# Check MCP server sync
python3 schema_sync.py --check-mcp

# Generate documentation only
python3 schema_sync.py --docs

πŸ“ˆ System Metrics

Data Coverage

  • Videos Analyzed: 50 (49 Greg Isenberg + 1 Seena Rez)
  • Total Intelligence Items: 1,170
  • Data Categories: 13
  • MCP Tools: 13
  • AI Agents: 8

Performance

  • Database Load Time: <1 second
  • Query Response Time: <100ms
  • Schema Validation: ~5 seconds (50 files)
  • Video Extraction: ~15 seconds per video
  • AI Extraction: ~20-30 seconds per video

Code Quality

  • Test Coverage: 100% (all tests passing)
  • Schema Validation: Automated
  • Backward Compatibility: Validated on every sync
  • Documentation: Auto-generated from schema

πŸŽ“ Key Concepts

1. Schema as Single Source of Truth

All data structures live in mcp-servers/business-intelligence/schema.py:

  • βœ… Extraction prompts generated from schema
  • βœ… MCP tools use schema for input validation
  • βœ… Documentation auto-generated from schema
  • βœ… Data validation uses schema rules

Benefit: Change schema once β†’ everything updates everywhere.

2. Soft Validation for Flexibility

Enum fields (categories, sentiments) use suggested values but accept ANY string:

# Schema suggests these values:
"categories": ["saas", "ai-tool", "platform"]

# But ALSO accepts unexpected values:
"category": "automation-platform"  # βœ… Valid
"category": "ml-tool"              # βœ… Valid

# Why? AI extraction may discover new valid categories we didn't anticipate

3. Backward Compatibility

Schema changes are automatically validated:

  • βœ… Adding fields β†’ Compatible
  • βœ… Adding categories β†’ Compatible
  • βœ… Adding new data types β†’ Compatible
  • ❌ Removing fields β†’ Breaking change (migration required)
  • ❌ Changing field types β†’ Breaking change (data migration required)

πŸ” Security & Privacy

  • Local Only: All data stored locally on your machine
  • No External Calls: MCP server doesn't make external API requests
  • Private Intelligence: Your BI database stays completely private
  • No Tracking: Zero analytics or usage tracking
  • Full Control: You own all data and infrastructure

🀝 Integration Examples

With Claude Desktop

// ~/Library/Application Support/Claude/claude_desktop_config.json
{
  "mcpServers": {
    "business-intelligence": {
      "command": "python3",
      "args": [
        "/Users/yourox/AI-Workspace/mcp-servers/business-intelligence/server.py"
      ]
    }
  }
}

Restart Claude Desktop and you'll have access to all 13 BI tools!

With CrewAI Agents

from crewai import Agent
from crewai_tools import MCPTool

# Initialize BI MCP tool
bi_tool = MCPTool(
    server_name="business-intelligence",
    server_path="/Users/yourox/AI-Workspace/mcp-servers/business-intelligence/server.py"
)

# Create agent with BI access
market_researcher = Agent(
    role='Market Research Specialist',
    goal='Find high-potential market opportunities',
    tools=[bi_tool],
    backstory='Expert with access to 1,170 business intelligence insights'
)

Programmatic Access (Python)

from mcp_servers.business_intelligence.server import BusinessIntelligenceDB

# Initialize database
db = BusinessIntelligenceDB()

# Search products
products = db.search("chatgpt", "products", {"sentiment": "positive"})
print(f"Found {len(products)} products")

# Get statistics
stats = db.get_stats()
print(f"Total insights: {sum(stats.values()) - stats['total_files']}")

# Search trends
trends = db.search("", "trends", {"stage": "growing"})
print(f"Found {len(trends)} growing trends")

πŸ› Troubleshooting

Issue: MCP Server Not Loading Data

# Check data files exist
ls data/business_insights/*.json | wc -l

# Run server tests
cd mcp-servers/business-intelligence
python3 test_server.py

# Check for errors
python3 server.py 2>&1 | grep "ERROR"

Issue: Schema Validation Failing

# Run full sync to see all issues
cd mcp-servers/business-intelligence
python3 schema_sync.py --full-sync

# Check specific file
python3 -c "
from schema import validate_data_structure
import json
with open('../../data/business_insights/VIDEO_ID_insights.json') as f:
    data = json.load(f)
report = validate_data_structure(data)
if not report['valid']:
    print('Errors:', report['errors'])
    print('Warnings:', report['warnings'])
"

Issue: Extraction Not Working

# Check Browserbase credentials
grep BROWSERBASE .env

# Test transcript extraction
python3 scripts/browserbase_transcript_extractor.py VIDEO_ID

# Test AI extraction
python3 scripts/business_intelligence_extractor.py VIDEO_ID

# Check API key
grep ANTHROPIC_API_KEY .env

πŸ“Š Monitoring & Maintenance

Weekly Health Check

# 1. Validate all data against schema
cd mcp-servers/business-intelligence
python3 schema_sync.py --validate

# 2. Run full sync check
python3 schema_sync.py --full-sync

# 3. Test MCP server
python3 test_server.py

# 4. Check database stats
python3 -c "
from server import BusinessIntelligenceDB
db = BusinessIntelligenceDB()
stats = db.get_stats()
print(f'Files: {stats[\"total_files\"]}')
print(f'Insights: {sum([v for k,v in stats.items() if k.startswith(\"total_\") and k != \"total_files\"])}')
"

πŸš€ What Makes This System Unique

  1. ✨ Automated Schema Sync - Change once, update everywhere automatically
  2. ✨ 1,170 Validated Insights - Real business intelligence from successful entrepreneurs
  3. ✨ 13 MCP Tools - Complete AI agent access to your BI database
  4. ✨ 8 AI Agents - Full product launch workflow from market discovery to sales
  5. ✨ Backward Compatible - Safe schema evolution with migration guides
  6. ✨ Production Ready - 100% test coverage, comprehensive documentation
  7. ✨ Self-Documenting - Auto-generated docs that never go out of sync

πŸŽ‰ Success Stories

Based on the intelligence in this database:

  • Seena Rez: Built $2.7M brand in 30 days using AI-powered product discovery
  • Method Used: Early adopter analysis via YouTube transcripts
  • Results: 100,000+ orders, Shopify award winner
  • Your Turn: Use the same methodology, fully automated with AI agents

πŸ“ License

MIT License - See repository for details


πŸ™ Acknowledgments

  • Seena Rez - $2.7M brand building strategy and methodology
  • Greg Isenberg - 48 videos of startup wisdom and market insights
  • Anthropic Claude - AI-powered intelligence extraction (Sonnet 4.5)
  • Browserbase - Robust web scraping with IP block bypass

Your AI-powered business intelligence system is ready! πŸš€

Next Steps:

  1. Read the Schema Management Guide to learn how to extend the system
  2. Configure MCP Server for Claude Desktop
  3. Run your AI Business Crew with full BI access
  4. Start extracting insights from your own YouTube videos

For detailed guides, see the Documentation Index above.


Setup completed: October 15, 2025 Python version: 3.11.9 Schema version: 1.0.0 Total intelligence items: 1,170

About

AI-powered workspace with TubeDB analytics, AI Session Logger, and MCP servers

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors