Skip to content

5 AI Problems Bounded Structurally — at the Protocol Level

The hardest open problems in deployable AI today are not problems of intelligence. They are problems of trust. The Modular Proactive AI Cloud OS protocol bounds them at the protocol level — by architectural constraint where the layer is structural, by normative MUSTs and runtime checks where the boundary is contractual. The scope of each guarantee is explicit (see specification §7.7); the contract reduces, it does not eliminate in the formal mathematical sense.


1. Why "structural" matters

The five problems treated here are problems every serious deployment of AI has had to confront: fabrication of facts that were never retrieved, unauthorized execution of consequential actions, loss of context across sessions and channels, brittle multi-step coordination, and the inability to guarantee that private data did not transit a path it should not have. Each has been the subject of serious work at the foundation-model layer — RLHF, Constitutional AI, refusal training, longer context windows, retrieval augmentation, content policy. The work continues and matters.

But there is a property all of those approaches share. They are probabilistic. They reduce frequency. They cannot establish guarantees. A model trained to refuse jailbreaks can be jailbroken by a new technique. A model trained to ground its claims in retrieved facts can still hallucinate one request in a thousand. A longer context window pushes the forgetting boundary further out — it does not abolish it. Probabilistic improvements at the model layer are valuable. They are not architectural guarantees.

ICNLI is the Modular Proactive AI Cloud OS standard — three load-bearing words. Modular because the kernel is finite and extensions plug in. Proactive because the system watches the domain and surfaces concerns before being asked. AI Cloud OS because AI is the primary operational user of the system (the human remains the authorizing principal), not a feature inside an application. The protocol solves the five problems at a different layer of the stack — outside the model, in the protocol around the model — using mechanisms that are deterministic by construction. The fact ledger is enforced by the kernel. The confirmation gate is enforced by the kernel. The context model is structured by the kernel. Chain semantics are contracted by the protocol. The privacy boundary is a property of the data path.

The defence against a hostile reviewer is the same in every case. The model can still be wrong; the system cannot fabricate ungrounded claims. The model can still be persuaded; the kernel cannot be persuaded. The model can still forget; the protocol does not forget. Probabilistic methods reduce frequency. Structural methods establish guarantees. The two are complementary layers.


2. Problem 1: Hallucination → Anti-Fabrication Contract

The hook: the model can be wrong; the system cannot fabricate ungrounded claims.

The industry has spent years reducing hallucination inside the model. RLHF reduces it. Constitutional AI reduces it. Grounding prompts and retrieval augmentation reduce it. None of them eliminate hallucination, and none can — a probabilistic generator can in principle produce a well-formed, plausible, false sentence on any forward pass. That property is intrinsic to the model layer; reducing frequency is the best the layer can do.

The structural insight is that the problem does not need to be solved inside the model. It can be bounded outside it. ICNLI's Anti-Fabrication Contract has four normative components. A per-session fact ledger records every tool's structured output verbatim and makes it available to the model on subsequent turns. A grounded narration requirement binds user-facing prose to facts present in the current turn's results or in the ledger: the system MUST NOT claim a tool produced a value not in that tool's output, MUST NOT quantify a result not enumerated in the underlying facts, and MUST NOT attribute a status to a resource whose status was not retrieved. Intent-routed tool selection restricts the model's tool surface to the classifier-selected subset — tools outside that surface are structurally unreachable, and a tool the model never sees cannot be invented. A PII masking gate masks personally identifiable information in context delivered to the model by default.

What makes this defensible is that the contract does not depend on the model being well-behaved. The fact ledger is enforced by the kernel. The grounded-narration check is enforced outside the model. The tool surface is determined by the router before the model ever sees it. The model can still be wrong; the system cannot fabricate ungrounded claims. The system's output is bounded by construction.

The implication for deployment is that the foundation model becomes a swappable component. A compliant implementation MAY use any model — Anthropic Claude, OpenAI GPT, Google Gemini, an open-weights model, a future model — and inherit the same structural fabrication boundary, because the boundary is not in the model. It is in the protocol around the model.

Cross-link: Anti-Fabrication Requirements.


3. Problem 2: AI Control & Safety → TWO-STEP Architectural Human-in-the-Loop

The hook: no prompt can talk the kernel out of asking.

Foundation-model providers have invested heavily in alignment and refusal training. Every safety improvement at the model layer raises the floor for the industry. It is also, by construction, statistical. Refusal training reduces the probability that the model produces a harmful output; it cannot reduce that probability to zero. Every published jailbreak is a proof of that property, and the next jailbreak is a question of when, not whether.

For consequential operations, statistical refusal is the wrong primitive. The right primitive is architectural. ICNLI mandates TWO-STEP confirmation for every state-changing operation: the system proposes with impact analysis, the actor confirms, and only then does the system execute. The confirmation gate is enforced by the kernel and is outside the model's authority. No prompt can talk the kernel out of asking, because the kernel is not what answered. The model never had the authority to bypass.

Beyond TWO-STEP, the protocol classifies every tool by safety level 0 through 4. Confirmation escalates with level: optional at L1, TWO-STEP at L2, TWO-STEP with full impact analysis at L3, danger-phrase-plus-cooling-period at L4. Impact analysis is mandatory from L2 upward and enumerates direct targets, cascade targets derived from the context model, reversibility, and backup availability. CRITICAL operations require a danger phrase demonstrating the actor understands what is about to occur and a cooling period that makes irreversible operations take longer than reversible ones. Role-based access is enforced on every dispatch.

The distinction is the whole point: architectural human-in-the-loop is deterministic, while refusal training is probabilistic. Both layers have value. They are not in opposition. ICNLI mandates the architectural layer because that is the layer at which deterministic guarantees are possible.

Cross-link: Safety Layer.


4. Problem 3: Memory & Context Persistence → 9-Level Context + User Intelligence Profile

The hook: the memory was never inside the model.

A context window is not memory. Long-context windows are an engineering achievement, but a context window is a buffer the model can see right now — not a persistent, structured, per-actor awareness model that survives the session and a model upgrade. RAG and vector stores are valuable retrieval tools, but retrieval is approximate by nature, and approximate memory is not the same primitive as architectural memory. Memory plugins bolted on top of stateless model APIs inherit the statelessness of the underlying layer.

ICNLI's structural answer is the 9-Level Context Model as a normative substrate, plus the User Intelligence Profile from the proactive layer. The nine levels resolve actor identity, role and permissions, recent activity, environmental state, domain inventory, dependency graphs, ongoing operations, alerts and anomalies, and cross-channel continuity. Each level is the kernel's responsibility, each level is resolved structurally on every turn, and each level survives the session. The User Intelligence Profile is a continuously maintained kernel resource — refreshed in the background, independent of any inbound request, available to the classifier and the model layer on every turn.

The defence against a hostile reviewer is that this is architecture, not retrieval. The memory lives in the protocol layer, in structures the kernel maintains and the model consumes. A model can be swapped underneath without losing the actor's memory, because the memory was never inside the model. A compliant implementation that upgrades from one foundation model to another preserves every actor's awareness state through the transition. That property is impossible at the model layer alone.

Cross-link: Context Model.


5. Problem 4: Multi-step Agent Reliability → Chain Orchestration with Read-Before-Write

The hook: the protocol promises the semantics any orchestrator must satisfy.

Every agent framework worth taking seriously has invented its own answer to multi-step coordination. LangChain, function-calling loops, ReAct-style agents, and the other patterns competing in this space are doing real engineering. Each invention re-implements safety, dependency handling, retry semantics, and confirmation differently. Each framework's reliability is bounded by the brittleness of its loop. None of them is a standard — they are libraries, and a library is a different kind of artifact than a contract.

ICNLI defines the answer once, at the protocol layer. Chain Orchestration is normative at Conformance Level 3, with six components: a single orchestrator (no per-tool freelancing in the model), explicit dependency declaration on every step, topological execution ordering so reads precede dependent writes, fail-fast on dropped destructive plans (silent fall-through is forbidden for write or destructive intents), per-step audit, and TWO-STEP composition (the confirmation gate composes with multi-step chains rather than being bypassed by them).

Step-Output References complete the picture. A reference is a vendor-neutral mechanism by which the verbatim structured output of an earlier step flows into the input of a later step. References resolve stably under topological reordering. The data does not pass through the model. Paraphrase is not a failure mode — the bytes are the bytes.

The defence against a hostile reviewer is that this is a contract, not a library. The protocol promises the semantics any orchestrator must satisfy: read-before-write, topological ordering, verbatim plumbing, fail-fast on dropped destructive plans, composition with confirmation. A compliant implementation MAY implement the orchestrator in any language, with any concurrency model, against any task queue — provided the contract is satisfied. That is the right shape for a standard. Libraries compete on engineering; standards define what compliance means.

Cross-link: Intent Routing and Chain Orchestration.


6. Problem 5: Privacy → Architectural Data Boundary, Not Policy

The hook: the model cannot un-see what was sent to it; the protocol controls what is sent.

Privacy at the model layer is a promise. The provider promises not to log, not to train on the inputs, not to retain beyond a window. Promises can be honored, audited, and regulated — and have value when they are. None of those mechanisms produce a deterministic guarantee that private data did not transit a path it should not have transited. The model cannot un-see what was sent to it. Whatever was sent has been seen, and the only question afterwards is the disposition of the trace.

ICNLI's structural answer is that privacy is a property of the data path, and the protocol controls the data path. The PII masking gate in the Anti-Fabrication Contract mandates that personally identifiable information MUST be masked in the context delivered to the model by default; exposure MUST be opt-in, configurable at deployment time, and auditable. The 9-level context model is permission-filtered before it reaches the model — the AI receives a structured, role-bounded, masked view of the domain, not the raw underlying data. The audit substrate records what was sent, what was masked, and who authorized any unmasking. On-premises and air-gap deployment are supported architecturally for domains where the data cannot leave the environment.

The defence against a hostile reviewer is that this is the only layer at which privacy is a property rather than a policy. The model cannot un-see what was sent to it; the protocol controls what is sent; the protocol mandates the boundary be enforced. Compliance is auditable at the protocol layer in a way it cannot be at the model layer alone.

Cross-link: Security Considerations.


7. Why this matters now

The bottleneck of useful AI in 2026 is no longer the model. Capability has crossed the threshold of usefulness across most domains; the next foundation model release will not be the unlock that decides which industries can ship AI into their workflow. The bottleneck is trustworthiness — the structural guarantees a real domain needs before AI can be allowed to touch its state. Hospitals, banks, utilities, municipalities, and regulated enterprises do not lack interest in AI. They lack the architecture that would make AI deployable inside their constraints.

ICNLI is the layer at which that architecture lives. The five problems above are not the only problems in deployable AI, but they are the load-bearing ones, and they are all solved at the same layer — outside the model, in the protocol around the model. A serious deployment in 2026 inherits the structural guarantees of the protocol it is built on. ICNLI is published so those guarantees do not have to be reinvented per deployment.


8. How to verify these claims

Every claim on this page is anchored to a specific normative requirement in the ICNLI Specification v2.0. The Anti-Fabrication Contract lives in Section 7. TWO-STEP and the Safety Layer live in Section 8. The 9-Level Context Model lives in Section 4. Chain Orchestration and Step-Output References live in Section 6. The privacy boundary is enforced by the PII masking gate in the Anti-Fabrication Contract and by the data-path requirements in Section 14. Audit your compliant implementation against the Conformance Levels — the checklist is normative.


9. Next reading

Read the ICNLI Whitepaper for the full architectural argument and the narrative form of the five-problems chapter. Read the ICNLI Specification v2.0 for the normative MUSTs that make every claim on this page auditable. Both documents are open under CC BY-SA 4.0.


About the Author

Valentin Scerbacov, founder of Imperal, Inc., is the creator of ICNLI. This page distills the five-problems chapter of the v2.0 whitepaper into a punch-list form. The flagship implementation of ICNLI is Imperal Cloud — the first ICNLI AI Cloud OS — with Webbee as the reference agent of the first ICNLI AI Cloud OS; WebHostMost is Imperal's first enterprise client, running an ICNLI-compliant deployment across multi-region hosting infrastructure.


License

© 2026 Valentin Scerbacov / Imperal, Inc.

This page is licensed under the Creative Commons Attribution-ShareAlike 4.0 International License (CC BY-SA 4.0). You are free to share and adapt this material for any purpose, including commercial, under the terms of attribution and share-alike.

Full license: https://creativecommons.org/licenses/by-sa/4.0/

ICNLI™ is a trademark of Imperal, Inc. The CC BY-SA 4.0 license applies to the content of this page but does not grant rights to use the ICNLI trademark for branding or commercial purposes without written permission. Webbee™ is a trademark of WebHostMost.

  • Creator: Valentin Scerbacov
  • Trademark Owner: Imperal, Inc.
  • Website: icnli.org
  • Trademark inquiries: license@icnli.org
  • Flagship implementation: Imperal Cloud — the first ICNLI AI Cloud OS
  • Reference agent: Webbee, the agent of the first ICNLI AI Cloud OS
  • First enterprise production deployment: WebHostMost