Document Intelligence

Sovereign Cyclops: deterministic document intelligence where your documents never leave the building.

Extract text from any document on UK hardware. No LLM touches your data during text extraction. Deterministic, auditable, sovereign.

Available via REST API, MCP protocol, and right here.
UK-sovereign hardware
CLOUD Act immune
Full audit trail
ICO C1912355
Try It

Use Cyclops right here.

Upload a document. Choose a mode. Get results instantly.

Drop a file here or click to browse
Max 50MB
PDF DOCX JPEG PNG WebP TIFF
Processing...

Sign in to use Cyclops.

Sign In
How It Works

Two engines. Completely isolated.

Text extraction and visual description are separate paths. OCR output never touches the vision model.

OCR Engine

pdfplumber for text-based PDFs. PaddleOCR for scanned documents and images. python-docx for Word documents. Every character comes directly from the source. No LLM involved.

Deterministic text extraction

Vision Engine

Gemma 4 vision model for describing diagrams, photos, screenshots, and charts. The only path that uses an LLM. Completely separate from OCR.

Visual understanding
Key point: Text extraction and visual description are completely isolated. OCR output never touches the vision model.
Supported Formats

Every document format your organisation uses.

From single-page receipts to 200-page contracts.

PDF (up to 200 pages) DOCX JPEG PNG WebP TIFF
Data Sovereignty

Cloud OCR sends your documents abroad. Ours does not.

AWS Textract, Google Vision, and Azure Document Intelligence are operated by US companies. The CLOUD Act gives US authorities the legal power to compel access to your documents.

Cloud OCR Providers

  • Data leaves UK to US-owned infrastructure
  • Subject to CLOUD Act (H.R.4943)
  • Per-page fees that scale unpredictably
  • Vendor lock-in with proprietary formats
  • No visibility into how documents are processed

Sovereign Cyclops

  • All processing on UK hardware we own
  • Structurally immune to CLOUD Act
  • Flat-rate pricing, no per-page fees
  • Open standards, no vendor lock-in
  • Full audit trail for every document processed
Use Cases

Document intelligence for every industry.

From invoices to blueprints. Every document processed on sovereign infrastructure.

Invoice and receipt processing

Extract line items, totals, VAT, and supplier details from invoices and receipts for automated accounting workflows.

Contract text extraction

Pull clauses, dates, parties, and obligations from contracts for legal review. Handle multi-column layouts and annexes.

Medical record digitisation

Convert paper medical records, prescriptions, and clinical notes into structured digital text for NHS systems.

Insurance claim processing

Extract claim details, policy numbers, and supporting evidence from scanned claim documents and photographs.

Academic paper analysis

Process research papers, extract citations, tables, figures, and structured data from complex academic layouts.

Government form processing

Digitise and extract data from government forms, applications, and correspondence at scale.

Architectural drawing analysis

Interpret blueprints, technical drawings, and architectural plans. Extract annotations, dimensions, and labels.

Handwritten note transcription

Convert handwritten notes, letters, and annotations into searchable, editable digital text using vision AI.

Developer Integration

REST API. Bearer token auth. Two modes.

Integrate Cyclops into your existing workflows in minutes.

Extract Text (OCR)
curl -X POST https://api.hush-ai.uk/v1/cyclops/process \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -F "[email protected]" \
  -F "mode=ocr"

# Response
{
  "text": "extracted content...",
  "mode": "ocr",
  "filename": "invoice.pdf",
  "file_type": "application/pdf",
  "chars": 4820
}
Describe Image (Vision)
curl -X POST https://api.hush-ai.uk/v1/cyclops/process \
  -H "Authorization: Bearer YOUR_TOKEN" \
  -F "[email protected]" \
  -F "mode=eye"
Security & Compliance

Enterprise-grade security. No compromises.

Every layer hardened. Every request audited.

🔐

Bearer Token Auth

Every request authenticated with scoped bearer tokens. No anonymous access.

🛡️

Path Guards

Strict path validation prevents directory traversal and unauthorised file access.

⏱️

Rate Limiting

Per-token rate limiting prevents abuse and ensures fair resource allocation.

📜

Audit Logging

Every document processed is logged with timestamp, token, and metadata. Exportable.

🇬🇧

UK Hardware

Documents processed on hardware we physically own in the United Kingdom.

🔒

ICO Registered

Data controller C1912355. UK GDPR compliant. DPIA available on request.