Opensourcerer — Homework Resource Router

An AI-powered learning resource discovery system that maps homework questions to curated educational resources through an interactive visual network.

Built for HackHoya 2026 🏆

🎯 Core Philosophy

This app does NOT solve homework — it helps students find the right resources to learn from.

✅ Recommends external learning resources (Khan Academy, MIT OCW, 3Blue1Brown, etc.)
✅ Discovers prerequisite learning paths using vector geometry
✅ Automatically clusters related questions semantically
✅ Supports photo upload with OCR (handwriting supported!)
✅ Visualizes question-resource relationships as an interactive graph
✅ Saves search history for logged-in users
❌ Does NOT generate answers or step-by-step solutions
❌ Does NOT act as a tutor or chatbot

🏆 Hackathon-Winning Features

1. Learning Path Discovery via Embedding-Space Geometry

Uses vector geometry on concept embeddings to automatically infer prerequisite relationships. No manual knowledge engineering required - the system discovers that "algebra → functions → derivatives" by analyzing the geometric structure of embedding space.

2. Photo-to-Resources with Gemini Vision

Upload a photo of handwritten or typed homework and watch as Gemini Vision extracts the text automatically. Works with equations, diagrams, and messy handwriting.

3. Semantic Question Clustering

Automatically groups related questions into semantic clusters with AI-generated names and color-coded visual regions in the graph.

4. Dynamic Web Search (NEW!)

Unlike static resource databases, this searches the entire web in real-time. Works for ANY topic - from quantum physics to React hooks. The system finds resources, ranks them intelligently, then applies the smart organization features above.

See WEB_SEARCH_ARCHITECTURE.md for technical details.

🏗️ Architecture

Enhanced Pipeline (8 Stages)

Ingestion - Parse and split homework into individual questions
Concept Extraction - Use LLM to identify key concepts (not answers!)
Question Embedding - Generate vector representations
Learning Path Discovery - Find prerequisites using embedding geometry (NEW!)
Resource Retrieval - Find candidate resources via vector similarity
Ranking - Deterministic scoring based on:
- Embedding similarity (40%)
- Concept overlap (30%)
- Level appropriateness (20%)
- Format diversity (10%)
Explanation - LLM generates 2-3 sentences on why each resource fits
Clustering - Group related questions semantically (NEW!)

Tech Stack

Framework: Next.js 14 (App Router)
Language: TypeScript
Styling: Tailwind CSS
UI: Custom SpiderWeb graph visualization with cluster regions
Vector Search: In-memory cosine similarity
Embeddings: Google text-embedding-004
LLM: Google Gemini gemini-1.5-flash (text + vision)
Auth & Database: Supabase
Knowledge Graph: Auto-constructed from resource corpus

🚀 Getting Started

Prerequisites

Node.js 18+
Google Gemini API key (Get one here)
Supabase project (optional, for auth & history)

Quick Start

Clone and navigate to the project:

cd HackHoya

Install dependencies:

npm install

Create .env.local from the template:

cp env.template .env.local

Fill in your API keys in .env.local:

GEMINI_API_KEY=your-gemini-api-key
NEXT_PUBLIC_SUPABASE_URL=https://your-project.supabase.co  # Optional
NEXT_PUBLIC_SUPABASE_ANON_KEY=your-anon-key  # Optional

Run the development server:

npm run dev

Open http://localhost:3000

See TESTING_GUIDE.md for detailed testing instructions. NEXT_PUBLIC_SUPABASE_URL=https://your-project.supabase.co NEXT_PUBLIC_SUPABASE_ANON_KEY=your-anon-key


5. Run the development server:
```bash
npm run dev

Open http://localhost:3000

First Run

On the first API call, the system will:

Load 27 seed resources (Khan Academy, MIT OCW, 3Blue1Brown, etc.)
Generate embeddings for all resources (~30 seconds)
Cache everything in memory

Subsequent requests will be much faster.

📁 Project Structure

homework-router/
├── app/
│   ├── api/process/route.ts    # Main API endpoint
│   ├── layout.tsx              # Root layout
│   ├── page.tsx                # Home page with upload + results
│   └── globals.css             # Global styles
├── lib/
│   ├── types.ts                # TypeScript types
│   ├── corpus.ts               # Resource corpus manager
│   ├── llm/
│   │   └── client.ts           # Abstracted LLM client
│   ├── embeddings/
│   │   └── utils.ts            # Vector operations
│   ├── data/
│   │   └── resources.ts        # Seed resource corpus (27 items)
│   └── pipeline/
│       ├── index.ts            # Main orchestrator
│       ├── ingest.ts           # Question splitting
│       ├── concepts.ts         # LLM concept extraction
│       ├── retrieval.ts        # Vector search
│       ├── ranking.ts          # Deterministic scoring
│       └── explain.ts          # LLM explanation generation
├── public/
│   └── example-homework.txt    # Demo homework
└── package.json

🧪 Testing

Try pasting this example:

1. Find the derivative of f(x) = 3x² + 2x - 5
2. Explain the difference between kinetic and potential energy
3. What is the time complexity of binary search?

Expected output:

3 questions detected
Concepts extracted for each (e.g., "derivatives", "power rule", "kinematic energy")
Top 5 resources per question with relevance scores
Explanations focus on what the resource teaches, NOT how to solve

🎨 UI Design Notes

The current UI is minimal and functional. For your spiderweb navigation concept:

Question results are currently linear cards
Consider: radial/graph layout with questions as nodes
Shared concepts could link questions visually
Resources could branch out from each question node

Suggested libraries for spiderweb UI:

react-force-graph for force-directed graphs
vis-network for interactive network visualization
d3.js for custom graph rendering

🔧 Configuration

Adding More Resources

Edit lib/data/resources.ts and add entries:

{
  id: "unique-id",
  title: "Resource Title",
  url: "https://...",
  format: "video" | "text" | "textbook",
  level: "intro" | "intermediate" | "advanced",
  concepts: ["concept1", "concept2"],
  description: "Brief description",
}

Restart the server to re-generate embeddings.

Adjusting Ranking Weights

Edit lib/pipeline/ranking.ts:

// Current weights
score += candidate.similarityScore * 0.4;  // Embedding similarity
score += overlap * 0.3;                     // Concept overlap
score += levelScore * 0.2;                  // Level match
// + 0.1 format diversity bonus

Switching LLM Providers

The LLMClient class in lib/llm/client.ts is abstracted and currently uses Gemini. To add OpenAI:

Install openai package
Implement openaiChat() and openaiEmbed() methods
Update constructor to handle provider: "openai"

📊 API Reference

POST `/api/process`

Request:

{
  "content": "1. What is...?\n2. Explain...",
  "contentType": "text"
}

Response:

{
  "questions": [
    {
      "questionText": "What is...?",
      "detectedConcepts": ["concept1", "concept2"],
      "questionType": "conceptual",
      "resources": [
        {
          "resource": { "id": "...", "title": "...", ... },
          "score": 0.92,
          "reason": "This resource covers..."
        }
      ]
    }
  ]
}

🔒 Constraints Enforcement

Constraint	How Enforced
No homework solutions	LLM system prompts explicitly forbid solutions
Ranking before explanation	Pipeline order is hardcoded: retrieve → rank → explain
Deterministic ranking	Pure function with explicit weights (no LLM)
Resource-focused output	UI shows only resources, no answer input fields

📝 License

MIT License - feel free to use for your hackathon!

🤝 Future Enhancements

This is a hackathon-scoped project. For production use, consider:

Adding a vector database (Pinecone, pgvector) for larger corpora
Caching LLM responses to reduce costs
Rate limiting and error handling
User feedback on resource quality
Mobile-responsive improvements
Share/export functionality

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
app		app
components		components
context		context
lib		lib
pics		pics
public		public
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
DEVPOST.md		DEVPOST.md
README.md		README.md
env.template		env.template
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Opensourcerer — Homework Resource Router

🎯 Core Philosophy

🏆 Hackathon-Winning Features

1. Learning Path Discovery via Embedding-Space Geometry

2. Photo-to-Resources with Gemini Vision

3. Semantic Question Clustering

4. Dynamic Web Search (NEW!)

🏗️ Architecture

Enhanced Pipeline (8 Stages)

Tech Stack

🚀 Getting Started

Prerequisites

Quick Start

First Run

📁 Project Structure

🧪 Testing

🎨 UI Design Notes

🔧 Configuration

Adding More Resources

Adjusting Ranking Weights

Switching LLM Providers

📊 API Reference

POST `/api/process`

🔒 Constraints Enforcement

📝 License

🤝 Future Enhancements

👥 TeamBuilt with ❤️ by Opensourcerer for HackHoya 2026

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Opensourcerer — Homework Resource Router

🎯 Core Philosophy

🏆 Hackathon-Winning Features

1. Learning Path Discovery via Embedding-Space Geometry

2. Photo-to-Resources with Gemini Vision

3. Semantic Question Clustering

4. Dynamic Web Search (NEW!)

🏗️ Architecture

Enhanced Pipeline (8 Stages)

Tech Stack

🚀 Getting Started

Prerequisites

Quick Start

First Run

📁 Project Structure

🧪 Testing

🎨 UI Design Notes

🔧 Configuration

Adding More Resources

Adjusting Ranking Weights

Switching LLM Providers

📊 API Reference

POST /api/process

🔒 Constraints Enforcement

📝 License

🤝 Future Enhancements

👥 TeamBuilt with ❤️ by Opensourcerer for HackHoya 2026

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

POST `/api/process`

Packages