ShizenAI

ShizenAI is a local-first semantic context manager and adaptive LLM routing layer.

The platform ingests and organizes knowledge, retrieves relevant context, and routes requests to the cheapest/fastest/most useful available path.

Product Identity

ShizenAI is structured as two layers:

Core platform: ingestion, normalization, chunking, embeddings, retrieval, routing, context packaging, observability.
Application modules: training/tutor, study workflows, transcript memory, and operational knowledge tools.

This keeps the core durable while allowing product modules to evolve independently.

What the Platform Does

Ingests source material from documents and text-like inputs.
Builds semantic chunks with metadata and vector embeddings.
Stores semantic memory in PostgreSQL + pgvector.
Retrieves top-k relevant context by similarity.
Routes requests based on confidence, complexity, and runtime constraints.
Prepares reusable context bundles for downstream AI calls and integrations.

Tech Stack (Current)

Frontend: React, TypeScript, Vite
Backend: FastAPI, SQLAlchemy, Pydantic
Database: PostgreSQL 15 + pgvector (Vector(768))
AI provider: Perplexity API (sonar models)
Infra (optional): Terraform on GCP (Compute Engine + Cloud SQL private IP + custom VPC)
Container runtime: Docker Compose

Architecture (Current)

flowchart LR
  U[User Browser] --> FE[Frontend :5173]
  FE -->|REST| BE[FastAPI Backend :8000]
  BE -->|SQL + pgvector| DB[(PostgreSQL + pgvector)]
  BE -->|LLM calls| PPLX[Perplexity API]
  BE -->|Optional TTS| XI[ElevenLabs API]

flowchart TD
  A[Admin uploads source] --> B[Parse + chunk content]
  B --> C[Perplexity summary]
  C --> D[Deterministic 768-d embedding]
  D --> E[Store KnowledgeChunk in pgvector]
  E --> F[Generate flashcard question]
  F --> G[Assign to employee]
  G --> H[Employee submits answer]
  H --> I[Similarity + Perplexity judge]
  I --> J[Update SRS state in UserReview]

flowchart LR
  Internet --> FW[Firewall: 5173 / 8000]
  FW --> VM[Compute Engine VM]
  VM --> C1[frontend container]
  VM --> C2[backend container]
  VM --> PSA[Private Service Access]
  PSA --> SQL[(Cloud SQL PostgreSQL private IP)]

Local Development

Prerequisites

Docker Engine / Docker Desktop
Git
Perplexity API key

Environment

Create .env at repo root:

PERPLEXITY_API_KEY=your_key_here
ELEVENLABS_API_KEY=optional_for_tts
DATABASE_URL=postgresql://postgres:password@postgres:5432/shizenai
VITE_API_URL=http://localhost:8000

Start

git clone https://github.com/DontSpillTheTea/ShizenAI.git
cd ShizenAI
docker compose up --build -d

Local URLs

Frontend: http://localhost:5173
API docs: http://localhost:8000/docs

Optional GCP Deployment (Terraform)

Terraform is optional and only needed when you want to deploy outside local Docker.

Infra provisions:

Custom VPC + subnet
Private services access for Cloud SQL private IP
Cloud SQL PostgreSQL 15
Compute Engine VM with startup bootstrap
Firewall rules for 5173/8000 (and optional SSH)
Static external IP output

Deploy (optional)

cd terraform
terraform init
terraform apply -auto-approve \
  -var="db_password=<your_db_password>" \
  -var="enable_ssh=true" \
  -var="ssh_source_cidr=0.0.0.0/0"

Use Terraform outputs for:

app_public_ip
db_private_ip
db_connection_name

Backend Notes

PERPLEXITY_API_KEY is used for summary/judging/topic extraction.
Embeddings are generated via a deterministic 768-d fallback to keep pgvector schema stable.
Startup seeds default users:
- admin / admin
- employee / employee

Data Model (Current Core)

users
topics
knowledge_chunks (embedding Vector(768))
flashcards
user_reviews (SRS interval + ease factor)
user_assignments
progress_cache
omi_captures
knowledge_sources

Roadmap and Reframing Documents

Root execution checklist: SHIZENAI_ROADMAP.md
Foundation cleanup and architecture audit: docs/platform_reframing_audit.md
Historical phase docs and execution logs remain as implementation history.

Operational Commands

Pause stack while preserving state:

docker compose stop

Shut down containers/network while retaining external data volumes:

docker compose down

Hard reset (includes volume deletion):

docker compose down -v
docker volume rm shizen_pg_data shizen_ollama_models

Near-Term Priorities

Finalize platform-vs-application boundaries in code and API modules.
Harden knowledge source/chunk/embedding schema and provenance metadata.
Build a first-class routing engine module with explicit decision logging.
Standardize context bundle output format for downstream model calls.
Preserve training flows as a modular consumer rather than core identity.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
backend		backend
docs		docs
execution_history		execution_history
frontend		frontend
terraform		terraform
test		test
.gitignore		.gitignore
README.md		README.md
SHIZENAI_ROADMAP.md		SHIZENAI_ROADMAP.md
docker-compose.yml		docker-compose.yml
test_upload.sh		test_upload.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ShizenAI

Product Identity

What the Platform Does

Tech Stack (Current)

Architecture (Current)

Local Development

Prerequisites

Environment

Start

Local URLs

Optional GCP Deployment (Terraform)

Deploy (optional)

Backend Notes

Data Model (Current Core)

Roadmap and Reframing Documents

Operational Commands

Near-Term Priorities

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ShizenAI

Product Identity

What the Platform Does

Tech Stack (Current)

Architecture (Current)

Local Development

Prerequisites

Environment

Start

Local URLs

Optional GCP Deployment (Terraform)

Deploy (optional)

Backend Notes

Data Model (Current Core)

Roadmap and Reframing Documents

Operational Commands

Near-Term Priorities

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages