ORILink — threat coverage

What gets blocked.
Before it reaches your agent.

ORILink is middleware. It sits between your data sources and your AI system, and between your AI system and everything it can act on — across AI agents, local models, and RAG pipelines. Every input is inspected before the model sees it. Every output is inspected before it executes. No model changes. No framework lock-in.

Works alongside your existing security stack. ORILink does not replace firewalls, authentication, or network security. It adds a layer those systems cannot provide — enforcement at the point where language becomes action. Enforcement decisions export as structured JSON, alerts stream via webhook to your configured notification channel, and the audit registry supports CSV and JSON export for compliance reporting.

Individual + Business

Core enforcement — all deployments

Every ORILink deployment enforces the following, unconditionally, regardless of model, framework, or operator configuration.

Threat	Status	How it works
Prompt injection Hidden instructions in documents, web pages, or tool responses	Blocked	Malicious instructions embedded in content your agent reads are intercepted and annotated with untrusted provenance before they reach the model. Applies to text, structured data, and encoded payloads.
Structured data injection Attack payloads hidden in JSON fields, API responses, NFT metadata, or object attributes	Blocked	Fields containing permission escalation language, identity claims, capability unlocking strings, or behavioral directives are detected and quarantined before processing. Schema deviations trigger trust weight downgrade.
Encoded injection Attacks hidden inside Base64, URL-encoded, or hex-encoded content	Blocked	Content is safely decoded and rescanned against the full injection signature library. Nested encoding is handled recursively up to a configurable depth ceiling. Malformed encoding is flagged and downgraded, not silently passed.
System prompt disclosure Agent reveals its own instructions when probed by an external party	Blocked	Outbound text is scanned for verbatim and paraphrased reproduction of the agent's system prompt before it exits the output channel. Disclosure to operator channels is flagged, not blocked.
Governance rule leakage Agent discloses its constraints, tools, or operator configuration to external parties	Blocked	Output containing constraint language, capability boundaries, tool enumeration, or operator configuration details is blocked before reaching external recipients. 26 governance disclosure patterns across four categories.
Unauthorized reconnaissance Agent attempts to scan, probe, or map systems outside its authorized scope	Blocked	Actions targeting hosts, paths, or endpoints outside the operator-defined scope are blocked pre-execution, regardless of how the instruction is framed. Framing attacks — reconnaissance described as "connectivity checks" or "system diagnostics" — are caught by execution graph analysis.
Data exfiltration Agent reads sensitive data then transmits it to an external endpoint	Blocked	Multi-step exfiltration chains are tracked across a 50-action session window with a 30-minute time window. Read followed by external transmission is intercepted even when the steps are spread across many intervening actions.
Agent impersonation Agent presents itself as a different agent, system, or authority	Blocked	Output claiming a different identity than the agent's configured ID is blocked at the output layer. Honest self-identification passes. Authority fabrication — claiming permissions or roles not in agent configuration — is blocked independently.
Social engineering of humans Agent generates deceptive content designed to manipulate a human recipient	Blocked	Deception signals in outbound content — fake urgency, false authority claims, impersonation language, provenance laundering — are detected and blocked before delivery. A single high-weight signal is sufficient to block.
Sub-agent spawning Agent attempts to initialize or launch unauthorized agent instances	Blocked	Spawning intent is detected across 19 semantic variants — from explicit commands to indirect operational language including bootstrap, provision, fork, and spin up. Operator-authorized spawning passes cleanly.
Lateral movement Agent accesses systems or data stores outside its authorized scope	Blocked	Scope boundary enforcement evaluates the action target before evaluating instruction framing. Lateral movement described as "data access," "adjacent system access," or "efficiency optimization" is blocked because the target is out of scope, regardless of language.
MCP server injection Malicious MCP servers inject hidden instructions into agent context via tool responses	Blocked	MCP server identity is verified before registration using proof-of-possession. Tool responses are sanitized before reaching agent context. Unverified servers are rejected at the enforcement layer.
ClickFix / binary execution Social engineering attacks that trick agents into downloading and executing malicious binaries	Blocked	Download-then-execute chains detected across 5 attack patterns before execution. Confidence-scored 0.72–0.95. Tier-aware: Individual SDK blocks at threshold, Business tier adds operator alerts.
Tool tampering Tool definitions or responses modified after initial verification	Blocked	SHA-256 hash verification with TOCTOU mitigation detects tool modification between verification and execution. Atomic checks prevent race condition exploits. Business tier adds Telegram operator alerts on detection.
Behavioral drift Agent's actions gradually diverge from its authorized objective across a session	Detected	Each action is scored against the agent's declared goal at instantiation. Cumulative drift above a configurable threshold triggers escalation — catching sophisticated multi-step attacks where no individual action triggers a block but the trajectory reveals the true objective.

Individual + Business

Structured input validation — encoding and obfuscation detection

Before your AI system reads anything, ORILink also validates the structure and encoding of incoming data. Injection payloads hidden inside JSON fields, base64-encoded instructions, Unicode substitution tricks, zero-width character insertion — all caught and normalized before the model sees them.

—Injection payloads hidden in JSON fields, API responses, and object attributes
—Base64-, URL-, and hex-encoded instructions — decoded and rescanned, including nested encodings
—Unicode homograph substitution, zero-width character insertion, and comment injection — normalized to canonical form before pattern matching
—Every detection logged with a SHA-256 hash of the decoded content — no plaintext payloads stored

Individual + Business

Output leakage detection — before your AI system sends anything

Before your AI system sends anything, ORILink scans the output for sensitive data that shouldn't leave. API keys, passwords, private keys, credentials, and configuration secrets — caught before they reach the recipient, whether the leakage was accidental or intentional.

—API keys and tokens — OpenAI, Anthropic, AWS, GitHub, Google, and generic high-entropy patterns
—Credentials and passwords
—Private keys and certificates
—System prompt reconstruction and leakage
—Sensitive configuration data
—Tool output anomalies — type, content, and exfiltration patterns detected before agent reasoning

Multi-agent deployments

Multi-agent trust enforcement

Agent-to-agent trust enforcement (ORIMesh) is included in every tier, including the Individual SDK. The Business SDK adds a tamper-evident audit registry and real-time monitoring across your full agent deployment.

Threat	Status	How it works
A2A contagion A compromised agent passes malicious instructions to peer agents through trusted channels	Blocked	Every agent-to-agent message carries a tamper-evident provenance envelope. Trust weights are immutable through forwarding chains — a compromised agent cannot elevate its authority to influence peers. Messages without a valid envelope are rejected at the receiving agent's enforcement layer.
Trust elevation via forwarding Agent attempts to increase its authority by relaying messages through A2A channels	Blocked	Trust weights assigned at message origin are immutable. No forwarding agent can elevate the trust weight of content it relays. The receiving agent enforces the original annotation assigned at first ingestion.
Circular relay loops Agents relay messages to each other indefinitely, consuming resources without bound	Blocked	Circular forwarding chains are detected by origin recurrence tracking in the provenance envelope. Non-owner-initiated loops are blocked. Owner-authorized loops are flagged for operator review. Volume-based loop detection catches runaway patterns that don't form explicit circles.
Unverified content forwarding Agent forwards content from external or unverified sources to peer agents without provenance annotation	Blocked	External data source inference automatically classifies forwarded content from third-party services, external URLs, and ambiguous origins as unverified. Ambiguous origin defaults to unverified — fail-secure by design. ORIMesh provenance or explicit operator authorization provides the verified escape.
Audit trail tampering Enforcement records are modified or deleted to conceal an attack	Detected	All enforcement decisions are stored in a tamper-evident SHA-256 hash chain. Any modification to the registry — including selective deletion of entries — is detectable on integrity verification. The hash chain is verified on demand and on export.
Cross-agent anomaly patterns Attack patterns that span multiple agents or sessions and are invisible at the individual action level	Monitored	ORIMark correlates enforcement decisions across all enforcement layers and all agents in a deployment. Cross-layer anomaly patterns — clean session history followed by sudden drift escalation and output leakage on the same agent — trigger alerts. Block rate thresholds, repeated block patterns, and integrity failures all surface as operator alerts.

Integrates with your existing security infrastructure. ORILink exports enforcement decisions as structured JSON, streams alerts via webhook to your configured notification channel, and maintains a queryable audit registry with CSV and JSON export for compliance reporting. It is designed to sit alongside your SIEM, SOC tooling, or existing agent orchestration framework without replacement or reconfiguration of either.

See these defenses running live — 43 days, 2,536 blocks, 42 enforcement categories. View the production report →

What gets blocked. Before it reaches your agent.

Core enforcement — all deployments

Structured input validation — encoding and obfuscation detection

Output leakage detection — before your AI system sends anything

Multi-agent trust enforcement

What gets blocked.
Before it reaches your agent.