A self-evolving Neuro-Symbolic Swarm Agent for macOS. Moving beyond Hermes' text-based skills, Verantyx fuses local Gemma 4 and Stealth Web Gemini via "JCross" spatial memory to autonomously orchestrate native OS operations.
To those who have stumbled upon this repository and are taken aback by its chaos:
I am a student. This project originally began with the goal of building a "Symbolic Reasoning Engine" to challenge the ARC-AGI. However, I was immediately confronted with the harsh reality of data scarcity and, most importantly, astronomical API costs that a student simply cannot afford.
The reason I am currently employing browser automation (manipulating Web Gemini) is that it serves as an "experimental mode" to bypass API fees, allowing me to run hundreds of reasoning loops. It is a functional mockup designed to visualize how the system orchestrates its logic.
You might also notice that the repository's directory is a mess, containing a chaotic mix of old Python code and seemingly unrelated iOS projects, such as a "Facial Tracking App for Cerebral Palsy Patients using ARKit."
This is because this repository is not a cleanly packaged software product. It is my Labโa raw, messy accumulation of every theme, experiment, and research remnant I have ever poured my soul into.
Perhaps this project perfectly embodies the Duality of a single human being: the burning ambition of wanting "geeks all over the world to give this a Star," coexisting alongside the quiet resignation that "this is ultimately just my personal experimental log."
It is by no means a beautiful codebase. But embedded within it are all my thoughts and my relentless drive to break through the limits of "AI Structural Reasoning" using the absolute minimum of resources.
The Verantyx Browser subsystem introduces a revolutionary 4-node agentic architecture designed to bypass modern BotGuard and CAPTCHA mechanisms (like those aggressively employed by platforms such as Google Gemini & Claude) while maintaining robust, long-term memory and autonomous task execution capabilities.
By combining local Small Language Models (SLMs) with a robust "Supervisor/Worker" hierarchy, Verantyx acts as a sentient spatial router. Crucially, we enforce a Carbon Paper UI (Human-in-The-Loop) mechanism to manually mediate clipboard interactions, definitively solving the bot-detection crisis.
Verantyx has evolved beyond a simple CLI into a full 3D cybernetic workstation.
- ๐ Vera Memory (3D Cyber Space): Type
verato launch a stunning D3/Three.js integrated 3D neural map of your codebase's spatial index. - ๐ฎ Multi-Node Crucible Synthesis: Drag and drop up to 10 distinct architecture nodes into the Crucible Reaction Zone. Click the glowing
[SYNTHESIZE]button to physically fuse their JCross logic into brand new architectural intermediate representations (IR). - ๐๏ธ Cyberpunk Finder Mode: Click any node in the 3D map to slide out the sleek Finder Panel. Instantly read the raw source code and JCross semantic tags cleanly inside the browser via our new
/catlocal bridge API. - ๐ฏ Zero-DOM Safari Injection Tracking: The system autonomously tracks your active Safari windows, dynamically locating the
gemini.google.comtab via raw AppleScript geometry matching. It guarantees prompts are securely injected without relying on immediately-flagged Headless DOM automation. - ๐ฆ Nightwatch Observer: A local daemon that silently monitors and losslessly compresses your code repository into spatial
JCrossmemory structures overnight.
The system is split into four distinct cognitive nodes, completely isolating planning logic from raw web execution.
graph TD
User([๐ฑโโ๏ธ User]) -->|Task/Prompt| SLM
subgraph "Local Cognitive Layer (Secure)"
SLM[๐ค Local SLM Router<br>Qwen2.5:1.5b via Ollama]
end
subgraph "Verification Layer (API/Native)"
Senior[๐ง Senior Supervisor<br>Gemini 2.5 Pro]
Apprentice[๐ต๏ธ Apprentice Supervisor<br>Gemini 2.5 Pro]
end
subgraph "Execution Layer (Browser)"
Worker[๐ฆพ Stealth Web Actor<br>Gemini UI on Safari]
end
SLM -->|Sub-task Dispatch| Senior
SLM -->|Sub-task Dispatch| Apprentice
SLM -->|Execution Dispatch| Worker
Senior -->|Intent Validation & Memories| User
Apprentice -->|Action Auditing| User
Worker -->|DOM Interaction| Target[Web Application]
The brain of the operation runs on local hardware using Ollama. It maintains long-term memory, parses user intent, avoids context dilution, and breaks down complex prompts into specific sub-tasks to be dispatched to the remote models. It serves as an impermeable wall protecting the core agent logic from the massive context window destruction prevalent in long-running cloud instances.
The Senior Supervisor receives payloads from the SLM, analyzing them to ensure the output aligns exactly with what the User intended. It injects additional memory and refines prompts without actually executing them on the target machine.
The Apprentice operates on a 5-turn promotion cycle, shadowing the Senior and ensuring spatial state is synced accurately within the .ronin/experience.jcross database.
The "Hands and Feet". This node controls the actual Web UI. Because BotGuard instantly detects headless Chrome automation, puppeteer, or injected JavaScript events, the Worker operates entirely via human-mediated native OS actions.
To defeat advanced anti-bot systems, we implemented the Carbon Paper UIโa secure "Human-in-The-Loop" (HITL) manual handoff protocol.
Instead of writing scripts to click buttons (which are instantly blocked), the system automatically formats the perfectly optimized prompt and securely injects it into the macOS Clipboard. It then prompts the user via a terminal Dialoguer.
- Prompt Generation: SLM + Supervisors construct the optimal prompt.
- Clipboard Hydration: The OS Clipboard is silently loaded via
arboard. - Target Acquisition: The user is prompted in the CLI. The system programmatically brings
Safarito the foreground via macOS Native APIs, specifically hunting for the Gemini tab. - Human Actuation / Auto-Stealth: Depending on your mode, the system either gracefully pastes the content or prompts you to hit
Cmd+V + Enter. This completely circumvents BotGuard. - Tamper Verification: The agent polls the clipboard post-submission using geometric extraction to ensure flawless extraction.
Say goodbye to the chaotic pip install, npm run, and dependency hell.
Verantyx unifies everything under a Single Rust Entrypoint using cargo. Any auxiliary Python scripts or JavaScript UI dependencies are seamlessly executed and managed by the Rust orchestration layer.
Ensure you have your environment set up and Ollama running with your desired parameter models.
Start the primary Verantyx interactive chat repl.
# Simply run cargo. It automatically coordinates any auxiliary scripts.
cd verantyx-browser
cargo run -p ronin-hive --example interactive_chatOnce inside the REPL, you can utilize the following spatial intelligence commands:
veraLaunch the Vera Memory 3D Visualizer. This opens a browser-based neural map of your codebase's JCross state. You can click files to read them or drag them to the center to synthesize them.time-machine <path>Force a spatial indexing scan of a directory. It compresses code into.jcrossformat and links dependencies. (e.g.,time-machine .)crucible <file_1> <file_2> ... <file_N>Trigger a multi-node synthesis. Takes up to 10 file identifiers, fusing them in memory and outputting a cutting-edge architectural concept and execution snippet.clearClear the terminal repl history for a clean view.
- N-Node Crucible Expansion: Synthesis limits lifted. Safely fuse massive architectures across up to 10 nodes simultaneously.
- Finder Sidebar UI: Read codebase structure gracefully without exiting the Visualizer.
- Multi-byte Panic Elimination: Japanese character boundary index panics
[0..80]have been comprehensively replaced with safe.chars().take(N)iterators across all nodes.
Proprietary. Belongs to the Verantyx spatial intelligence framework.