Complete AI-powered business intelligence system with automated schema synchronization
Version: 1.0.0 | Last Updated: October 15, 2025
Transform YouTube business content into actionable intelligence and automate market research through AI agents:
- Extract Business Intelligence from videos (50+ videos, 1,170+ insights)
- Expose via MCP Server for AI agent access
- Auto-sync Schema across all components
- Run AI Business Crew with full BI database access
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β INPUT SOURCES β
β YouTube Videos β Browserbase β Transcripts β AI Extraction β
ββββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β BUSINESS INTELLIGENCE DATABASE β
β 50 Videos | 1,170 Insights | 13 Categories | Auto-Validated β
ββββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββ
β
ββββββββββββββββββΌβββββββββββββββββ
β β β
βΌ βΌ βΌ
ββββββββββββββ ββββββββββββββ ββββββββββββββ
β MCP Server β β AI Crew β β Schema β
β 13 Tools β β 8 Agents β β Manager β
ββββββββββββββ ββββββββββββββ ββββββββββββββ
# Extract from single video
python3 scripts/browserbase_transcript_extractor.py VIDEO_ID
python3 scripts/business_intelligence_extractor.py VIDEO_ID
# Extract from all videos
python3 scripts/business_intelligence_extractor.py all# Install MCP server
cd mcp-servers/business-intelligence
pip3 install -e .
# Test it
python3 test_server.py
# Configure for Claude Desktop (optional)
# See: docs/MCP_BUSINESS_INTELLIGENCE_SETUP.md# Run with BI database access
python3 scripts/ai_business_crew_with_mcp.py# Validate schema sync
cd mcp-servers/business-intelligence
python3 schema_sync.py --full-syncAI-Workspace/
βββ README.md # This file
β
βββ scripts/ # Extraction & automation scripts
β βββ browserbase_transcript_extractor.py # YouTube β Transcript
β βββ business_intelligence_extractor.py # Transcript β Insights
β βββ ai_business_crew.py # 8-agent crew (basic)
β βββ ai_business_crew_with_mcp.py # 8-agent crew (with BI)
β
βββ data/ # Data storage
β βββ transcripts/ # YouTube transcripts (50+)
β βββ business_insights/ # Extracted insights (50+)
β
βββ mcp-servers/business-intelligence/ # MCP Server (AI agent access)
β βββ server.py # MCP server (13 tools)
β βββ schema.py # Schema definition (single source of truth)
β βββ schema_sync.py # Schema synchronization system
β βββ test_server.py # Comprehensive tests
β βββ pyproject.toml # Package config
β βββ README.md # MCP server docs
β
βββ docs/ # Documentation
βββ AI_BUSINESS_AUTOMATION_WORKFLOW.md # 8-agent workflow design
βββ SESSION_SUMMARY_CREW_AI_SETUP.md # Implementation summary
βββ ENHANCED_EXTRACTION_SCHEMA.md # Enhanced data categories
βββ MCP_BUSINESS_INTELLIGENCE_SETUP.md # MCP setup guide
βββ MCP_IMPLEMENTATION_SUMMARY.md # MCP implementation details
βββ BUSINESS_INTELLIGENCE_SCHEMA.md # Schema documentation (auto-gen)
βββ SCHEMA_MIGRATION_GUIDE.md # Migration guide (auto-gen)
βββ SCHEMA_MANAGEMENT_GUIDE.md # Schema management (this doc)
- π MCP Server Setup - Configure MCP for Claude Desktop or CrewAI
- π AI Business Workflow - 8-agent workflow based on Seena Rez's $2.7M strategy
- π MCP Implementation Summary - Complete MCP server technical details
- π Session Summary - Full implementation journey
- π Enhanced Schema - 10 advanced extraction categories
- β Schema Management Guide - How to add new data types and keep everything in sync
- π Schema Documentation - Auto-generated schema reference
- π Migration Guide - Auto-generated migration steps
1,170 Intelligence Items Across 50 Videos:
| Category | Count | Description |
|---|---|---|
| Products & Tools | 214 | AI tools, SaaS platforms, physical products |
| Problems & Solutions | 84 | Validated problems with step-by-step solutions |
| Startup Ideas | 64 | Business concepts with validation data |
| Growth Tactics | 66 | Proven marketing strategies |
| AI Workflows | 71 | Automation workflows with implementation |
| Target Markets | 73 | Market intelligence with demographics |
| Trends & Signals | 107 | Market trends with opportunity analysis |
| Business Strategies | 103 | Proven strategies for branding, operations, marketing |
| Metrics & KPIs | 59 | Benchmarks and optimization tips |
| Actionable Quotes | 132 | High-value insights from successful entrepreneurs |
| Key Statistics | 136 | Revenue, conversion, and growth data points |
| Mistakes to Avoid | 61 | Common pitfalls with prevention strategies |
AI agents can query the BI database using:
search_products- Find products with sentiment/category filterssearch_problems- Find problems with solutions and difficulty levelssearch_startup_ideas- Discover startup concepts with validationsearch_growth_tactics- Get growth strategies by channelsearch_ai_workflows- Find AI automation workflowssearch_target_markets- Get market intelligence and demographicssearch_trends- Find market trends by stage (emerging/growing)search_business_strategies- Get proven strategies by typeget_market_opportunities- Analyze combined opportunitiesget_actionable_quotes- Get expert insights by categoryget_key_metrics- Retrieve KPIs and benchmarksget_mistakes_to_avoid- Learn from documented failuresget_database_stats- Get comprehensive database statistics
Complete product discovery β launch workflow:
| Phase | Agent | Goal |
|---|---|---|
| Phase 1: Market Intelligence | Market Trend Analyzer | Find markets with 10%+ CAGR and low saturation |
| Product Discovery Specialist | Identify early adopter products using YouTube analysis | |
| Phase 2: Audience & Brand | Audience Identity Researcher | Deep-dive into psychographics and aspirational identity |
| Brand Identity Creator | Create identity-based branding strategy | |
| Phase 3: Operations | Supplier Sourcing Agent | Contact 20-50 suppliers and negotiate using competition |
| Phase 4: Marketing | Photo Shoot Director | Create 4 content types matching brand aesthetic |
| Viral Video Creator | Apply 1-3 second transition science for virality | |
| Marketing Campaign Manager | Run retargeting campaigns with 5-7% conversion targets |
The system automatically keeps everything in sync:
# 1. Edit schema.py (single source of truth)
EXTRACTION_SCHEMA["new_category"] = {
"fields": ["field1", "field2"],
"description": "Description"
}
# 2. Run sync
python3 schema_sync.py --full-sync
# β
Extractor prompts updated automatically
# β
MCP server schemas updated automatically
# β
Documentation regenerated automatically
# β
Validation rules updated automatically
# β
Backward compatibility checked automaticallyKey Benefits:
- π No manual synchronization across components
- π Breaking changes detected automatically
- π Backward compatibility validated
- π Pre-commit hooks prevent inconsistencies
- π Migration guides auto-generated
# 1. Extract transcript (bypasses YouTube API limits)
python3 scripts/browserbase_transcript_extractor.py VIDEO_ID
# 2. Extract business intelligence using Claude Sonnet 4.5
python3 scripts/business_intelligence_extractor.py VIDEO_ID
# 3. Restart MCP server (auto-loads new data)
# MCP server automatically picks up new *_insights.json files# 1. Edit schema.py
cd mcp-servers/business-intelligence
vim schema.py
# 2. Add to EXTRACTION_SCHEMA dictionary
"competitive_intelligence": {
"fields": ["competitor", "strength", "weakness", "market_share"],
"description": "Competitive intelligence analysis"
}
# 3. Run full synchronization
python3 schema_sync.py --full-sync
# 4. Follow the auto-generated migration guide
cat ../../docs/SCHEMA_MIGRATION_GUIDE.md
# 5. Test everything
python3 test_server.py# Using MCP Server directly
cd mcp-servers/business-intelligence
python3 -c "
from server import BusinessIntelligenceDB
db = BusinessIntelligenceDB()
results = db.search('chatgpt', 'products', {'sentiment': 'positive'})
print(f'Found {len(results)} results')
"
# Using Claude Desktop (after MCP configuration)
# Simply ask: "Search for AI tools with positive sentiment"
# Using CrewAI agents
# Agents automatically query via MCP tool
python3 scripts/ai_business_crew_with_mcp.pycd mcp-servers/business-intelligence
# Full validation and sync check
python3 schema_sync.py --full-sync
# Validate existing data only
python3 schema_sync.py --validate
# Check extractor sync
python3 schema_sync.py --check-extractor
# Check MCP server sync
python3 schema_sync.py --check-mcp
# Generate documentation only
python3 schema_sync.py --docs- Videos Analyzed: 50 (49 Greg Isenberg + 1 Seena Rez)
- Total Intelligence Items: 1,170
- Data Categories: 13
- MCP Tools: 13
- AI Agents: 8
- Database Load Time: <1 second
- Query Response Time: <100ms
- Schema Validation: ~5 seconds (50 files)
- Video Extraction: ~15 seconds per video
- AI Extraction: ~20-30 seconds per video
- Test Coverage: 100% (all tests passing)
- Schema Validation: Automated
- Backward Compatibility: Validated on every sync
- Documentation: Auto-generated from schema
All data structures live in mcp-servers/business-intelligence/schema.py:
- β Extraction prompts generated from schema
- β MCP tools use schema for input validation
- β Documentation auto-generated from schema
- β Data validation uses schema rules
Benefit: Change schema once β everything updates everywhere.
Enum fields (categories, sentiments) use suggested values but accept ANY string:
# Schema suggests these values:
"categories": ["saas", "ai-tool", "platform"]
# But ALSO accepts unexpected values:
"category": "automation-platform" # β
Valid
"category": "ml-tool" # β
Valid
# Why? AI extraction may discover new valid categories we didn't anticipateSchema changes are automatically validated:
- β Adding fields β Compatible
- β Adding categories β Compatible
- β Adding new data types β Compatible
- β Removing fields β Breaking change (migration required)
- β Changing field types β Breaking change (data migration required)
- Local Only: All data stored locally on your machine
- No External Calls: MCP server doesn't make external API requests
- Private Intelligence: Your BI database stays completely private
- No Tracking: Zero analytics or usage tracking
- Full Control: You own all data and infrastructure
// ~/Library/Application Support/Claude/claude_desktop_config.json
{
"mcpServers": {
"business-intelligence": {
"command": "python3",
"args": [
"/Users/yourox/AI-Workspace/mcp-servers/business-intelligence/server.py"
]
}
}
}Restart Claude Desktop and you'll have access to all 13 BI tools!
from crewai import Agent
from crewai_tools import MCPTool
# Initialize BI MCP tool
bi_tool = MCPTool(
server_name="business-intelligence",
server_path="/Users/yourox/AI-Workspace/mcp-servers/business-intelligence/server.py"
)
# Create agent with BI access
market_researcher = Agent(
role='Market Research Specialist',
goal='Find high-potential market opportunities',
tools=[bi_tool],
backstory='Expert with access to 1,170 business intelligence insights'
)from mcp_servers.business_intelligence.server import BusinessIntelligenceDB
# Initialize database
db = BusinessIntelligenceDB()
# Search products
products = db.search("chatgpt", "products", {"sentiment": "positive"})
print(f"Found {len(products)} products")
# Get statistics
stats = db.get_stats()
print(f"Total insights: {sum(stats.values()) - stats['total_files']}")
# Search trends
trends = db.search("", "trends", {"stage": "growing"})
print(f"Found {len(trends)} growing trends")# Check data files exist
ls data/business_insights/*.json | wc -l
# Run server tests
cd mcp-servers/business-intelligence
python3 test_server.py
# Check for errors
python3 server.py 2>&1 | grep "ERROR"# Run full sync to see all issues
cd mcp-servers/business-intelligence
python3 schema_sync.py --full-sync
# Check specific file
python3 -c "
from schema import validate_data_structure
import json
with open('../../data/business_insights/VIDEO_ID_insights.json') as f:
data = json.load(f)
report = validate_data_structure(data)
if not report['valid']:
print('Errors:', report['errors'])
print('Warnings:', report['warnings'])
"# Check Browserbase credentials
grep BROWSERBASE .env
# Test transcript extraction
python3 scripts/browserbase_transcript_extractor.py VIDEO_ID
# Test AI extraction
python3 scripts/business_intelligence_extractor.py VIDEO_ID
# Check API key
grep ANTHROPIC_API_KEY .env# 1. Validate all data against schema
cd mcp-servers/business-intelligence
python3 schema_sync.py --validate
# 2. Run full sync check
python3 schema_sync.py --full-sync
# 3. Test MCP server
python3 test_server.py
# 4. Check database stats
python3 -c "
from server import BusinessIntelligenceDB
db = BusinessIntelligenceDB()
stats = db.get_stats()
print(f'Files: {stats[\"total_files\"]}')
print(f'Insights: {sum([v for k,v in stats.items() if k.startswith(\"total_\") and k != \"total_files\"])}')
"- β¨ Automated Schema Sync - Change once, update everywhere automatically
- β¨ 1,170 Validated Insights - Real business intelligence from successful entrepreneurs
- β¨ 13 MCP Tools - Complete AI agent access to your BI database
- β¨ 8 AI Agents - Full product launch workflow from market discovery to sales
- β¨ Backward Compatible - Safe schema evolution with migration guides
- β¨ Production Ready - 100% test coverage, comprehensive documentation
- β¨ Self-Documenting - Auto-generated docs that never go out of sync
Based on the intelligence in this database:
- Seena Rez: Built $2.7M brand in 30 days using AI-powered product discovery
- Method Used: Early adopter analysis via YouTube transcripts
- Results: 100,000+ orders, Shopify award winner
- Your Turn: Use the same methodology, fully automated with AI agents
MIT License - See repository for details
- Seena Rez - $2.7M brand building strategy and methodology
- Greg Isenberg - 48 videos of startup wisdom and market insights
- Anthropic Claude - AI-powered intelligence extraction (Sonnet 4.5)
- Browserbase - Robust web scraping with IP block bypass
Your AI-powered business intelligence system is ready! π
Next Steps:
- Read the Schema Management Guide to learn how to extend the system
- Configure MCP Server for Claude Desktop
- Run your AI Business Crew with full BI access
- Start extracting insights from your own YouTube videos
For detailed guides, see the Documentation Index above.
Setup completed: October 15, 2025 Python version: 3.11.9 Schema version: 1.0.0 Total intelligence items: 1,170