Hush AI

Infrastructure

How the Sovereign Prism runs: agentic node orchestration, automated backups, LLM model fleet, and multi-layer hardening. Everything described here runs on hardware we physically own in the United Kingdom.

System Architecture Live

Internet → Cloudflare (TLS/DDoS) → Nginx (reverse proxy + rate limits)
     ↓
Sovereign Gateway (port 7777) — auth, routing, quota, think-token stripping
     ↓             ↓           ↓         ↓
LLM Proxy    Wiki App    Dashboard   Services
     ↓
LLM Server (llama.cpp) — local inference, no cloud
     ↓
Agentic Nodes: Iris → Researcher → Archivist → Analyst → Guardian
     ↓
Sovereign Refinery (SQLite + FTS + ChromaDB) — local NVMe storage

Agentic Node Orchestration

The Sovereign Prism runs as a multi-agent pipeline. Each node is a specialised Python service with its own systemd unit, watchdog supervision, and sandboxed filesystem access.

Iris
Node 1 · Telegram Bot
Receives user queries via Telegram, routes to the LLM proxy, and streams responses back. Handles inline commands, document forwarding, and multi-modal inputs.
Researcher
Node 2 · Web Intelligence
Performs deep web research using the Elite Scraper and SearXNG private search. Extracts, summarises, and archives source material for the knowledge base.
Archivist
Node 3 · Filing & Ingestion
Classifies incoming content, generates embeddings, and files articles into the Sovereign Refinery wiki with full-text search indexing.
Analyst
Node 4 · Content Analysis
Runs deeper analysis on filed content: cross-referencing, trend detection, and quality scoring across the knowledge base.
Guardian
Node 5 · Security Monitor
Monitors system health, detects anomalies, and enforces content policies. Acts as the internal security watchdog.
Wiki App
SaaS · Port 8300
Public wiki and archival application. JWT authentication, team management, RSS feed ingestion, and full-text search across the entire knowledge base.

LLM Model Fleet

All models run locally on AMD Strix Halo APU hardware with ROCm/HIP acceleration. No inference leaves the building.

ModelRoleContextQuantisation
Qwen 3.6 35B A3BPrimary (Sovereign Prism)262KQ6_K_XL
Gemma 4 E4BGateway chatbot32KQ8_K_XL
Qwen 3.6 27BFast inference65KQ8_K_XL
Qwen3-VL-32BVision & multimodal128KQ8_K_XL
VL-Rethinker 72BDeep reasoning262KQ4_K_M
Qwen3 Coder NextCode generation65KQ6_K_XL

Automated Backup System Hardened

Schedule: Daily at 03:00 UTC via cron

Code backups — all source code, configs, and scripts (excluding secrets, models, and runtime data). Each backup is verified by extracting and checking for core files, then signed with a SHA-256 checksum.

Data backups — the entire Sovereign Refinery (wiki database, FTS index, embeddings). SQLite databases get a safe .backup snapshot before archival, guaranteeing consistency even during writes.

Rotation — backups older than 30 days are automatically purged. Checksums are rotated alongside their archives.

Verification — every code backup is automatically extracted to a temporary directory and checked for the presence of config.py, wiki_app.py, and utils.py before the backup is considered valid.

What the backup excludes (by design)

Service Management

Process supervision — each node runs as a sandboxed systemd service with Restart=always and 10-second restart delay. A dedicated watchdog process polls every 30 seconds and restarts any crashed node automatically.

Systemd sandboxing (applied to all services):

Startup sequence — pre-flight health checks verify the LLM server, SearXNG, ChromaDB, bot token, and browser tool extractor before launching any nodes.

Security Hardening Layers

Layer 1: Network (UFW Firewall)

Layer 2: SSH

Layer 3: Nginx Reverse Proxy

Layer 4: Application

Layer 5: Startup Scripts

Proxy Lane Architecture

Internal services are exposed through dedicated Nginx proxy lanes, each locked to LAN traffic only.

LaneServicePortBackend
AlphaLlama 4 Scout 109B8001DGX Spark (Alicia)
Omega120B Expert8002Workstation (James)
Prism27B Multimodal8003Local proxy
MCPSovereign MCP Server8004SSE-enabled
EchoAudio Transcription8005100MB upload limit
SearXNGPrivate Search8006Docker container
CyclopsDocument OCR & Vision800755MB upload limit

Audit Trail

The Oracle Prism codebase undergoes regular security audits. Hardening is applied in phases with full rollback procedures documented for every change.

Last audit: 31 May 2026

Hardening phases completed: 8

Items addressed in latest pass:

← Back to Hush AI