Synapse Product - WorldFlow AI

Architecture

Four Pillars of Donor-Aware Routing

Synapse is a router core plus adapters. It can run as a standalone gateway or integrate with private serving stacks and partner platforms.

Namespace Builder

Each request is mapped to a compatibility namespace covering model, tokenizer, chat template, KV layout, backend, tenant scope, and provider version. Unsafe or unknown namespaces fail closed.

Fleet Donor Catalog

Synapse tracks semantic donor evidence, endpoint identity, backend capability, accelerator metadata, freshness, provider generation, and tenant policy scope across the fleet.

Policy-Aware Route Advisor

The route decision balances donor quality, backend capability, endpoint health, load, region, tenant rules, timeout budget, and exact/cold fallback.

Feedback and ROI Accounting

Backend materialization, backend decline, stale donor, timeout, provider validation failure, and cold fallback are recorded separately. Dollar ROI is based on confirmed materialization, not semantic similarity alone.

Integration

Gateway or Adapter Deployment

Synapse can sit in front of self-hosted endpoints as an OpenAI-compatible gateway, or integrate with private serving stacks as a route advisor. The final API surface is deployment-specific.

Deployment model

One router core, multiple ways to connect

Gateway mode gives teams a clean front door. Adapter mode lets private serving stacks stay in control while Synapse advises placement.

Gateway OpenAI-compatible ingress for self-hosted endpoints

Adapter Native route advice for private serving stacks

Synapse route advisor Policy, compatibility, donor evidence, health, and load

Route Best reuse opportunity

Fallback Exact or cold path

Feedback TTFT and avoided prefill

Semantic Evidence

Embeddings, hashes, namespace metadata, freshness, and provider generation are used as placement evidence.

Reuse Validation

The serving stack decides whether reusable KV can be safely used for the request.

Fail-Closed Policy

Unknown namespace, stale provider generation, timeout, or policy denial routes exact or cold.

Adapter Boundary

Serving-stack and partner integrations call the same router core without owning proprietary scoring.

NVIDIA Path

Serving-stack integrations treat Synapse as semantic KV evidence and route advice that composes with existing scheduling.

Comparison

Where Synapse Fits

Synapse is not a response cache or a KV store. It is the semantic routing and feedback layer that helps serving stacks use compatible donor KV.

Capability	WorldFlow AI	Semantic Response Cache	Exact Prefix Routing
Primary target	Prefill KV work	Final answers	Identical prefixes
Non-identical prompts	Routes by semantic donor evidence	May return stale/wrong answer	Misses when tokens differ
Correctness authority	Backend validates materialization	Gateway policy decides answer reuse	Exact token/block match
Fleet placement	Endpoint donor inventory	Usually not engine-aware	Exact-cache aware
Commercial controls	Policy, ROI, adapters	Cache policy	Scheduler policy
Reuse accounting	Materialization feedback	No KV feedback	Exact-cache metrics

Performance

Measure the Full Funnel

Synapse reports discovery, placement, backend confirmation, decline, and fallback separately so savings claims stay defensible.

Semantic Placement

Route outcome

Requests routed to an endpoint because it has stronger compatible donor evidence.

Materialized Reuse

Backend event

Accepted donor KV, materialized token count, TTFT delta, and fallback state.

ROI Accounting

Confirmed only

Estimated prefill GPU time and dollar savings from reuse that actually lands.

Synapse Semantic Context Routing