Model Blindness

Ask First described what the user experiences: a two-phase model where suggestions appear outside the conversation, the chatbot doesn’t generate them, and no ad fires until the user opts in. This post describes the architecture that makes model independence provable.

The Industry Is Going the Other Direction

The dominant trend in chatbot advertising is deeper integration of ads into the model.

Alibaba’s research lab published LLM-Auction, which post-trains the model via RLHF to balance ad revenue and user experience. The model’s weights encode commercial incentives directly. Kontext, the SSP bridging PubMatic to chatbot inventory, uses the same LLM powering the chatbot to generate contextual ads. Google Research’s RAG-based ad auction puts ads into the model’s context window via retrieval, so the model explicitly sees the ad content during inference.

The integration approach has a concrete problem beyond trust. Fine-tuning is too slow: ad campaigns change hourly, training takes weeks. And too expensive: retraining a frontier model per campaign rotation is economically absurd. Whoever controls the training data controls what the model recommends, and you can’t audit learned bias in weights the way you can audit a scoring function. An auction clears in milliseconds and updates in real time.

OpenAI claims their ads run on “separate systems,” what they call the Answer Independence Principle. But it’s a policy claim with no attestation. No one outside OpenAI can verify the separation. Simon Willison noted that ChatGPT includes an option to “Ask ChatGPT” about a specific ad, a user-initiated bridge across the supposed architectural boundary. The separation has a door in it.

Why Users Can’t Self-Police

A team at Michigan fine-tuned a 14B model to serve targeted ads (Phi-4-Ads) and ran it past human evaluators. The results: users couldn’t detect the embedded ads. Worse, they preferred responses with hidden advertisements. Ad injection degraded model performance by at most 3%.

This is the strongest argument for architectural enforcement. If users can’t tell when a response is commercially influenced, no amount of labeling or transparency reports fixes the problem. The system has to enforce separation because humans can’t detect violations.

The Architecture

The chatbot cannot see the ad system. This is by construction, enforced by enclave boundaries.

The chatbot runs in its own enclave. The ad system runs in a separate one. There is no communication channel from the ad system to the chatbot. It cannot know whether a suggestion appeared, whether the user opted in, or who the advertiser was.

The only data flow: conversation → embedding → ad system → UI. One-directional.

Perplexity killed ads because first-party selection destroyed trust. OpenAI kept ads and ate the trust damage. Model blindness is the alternative: run ads and preserve trust, because the model generating your answer provably doesn’t know the ads exist.

This requires the ad system to be operated by a third party, not Perplexity, not the chatbot provider. If the same company runs the conversation and the auction, the separation is only a policy again. Nothing enforces it. A third-party exchange inside a TEE enclave is the only arrangement where no single entity controls both the answers and the ads.

The Infrastructure Exists

The components for this are already in production individually.

TEE-attested auctions. CloudX runs ad auctions inside AWS Nitro Enclaves with open-source clearing code. PCR measurements prove the exact code running inside the enclave. Neither the platform nor the cloud provider can tamper with the auction.

GPU-isolated inference. The NVIDIA H100 supports confidential computing with a hardware TEE at under 5% performance overhead.

Confidential inferencing platforms. Azure confidential inferencing provides end-to-end prompt protection with AMD SEV-SNP enclaves. Neither the service operator nor the cloud provider can access prompts.

Browser-side TEE auctions. Google’s Protected Audience API already runs ad auctions inside TEEs at scale with open-source, externally verifiable binaries.

Information flow control for ML pipelines provides the theoretical framework: treat ad content as a restricted information class and enforce one-directional flow at the architecture level.

Assembling the trust layer is integration work, though the auction mechanism itself still has open research questions around equilibrium convergence and parameter calibration.

The Trust Chain

Four links. Every one verifiable.

Verifiable intent matching. Open-weight embedding models with published hashes, so anyone can reproduce the embedding and verify the auction scored it correctly.
Verifiable auction execution. A sealed TEE enclave running attested code. The scoring function is published and cryptographically proven to be what executed.
Model independence. Separate enclaves, one-directional data flow, attested separation. The chatbot doesn’t know advertising exists.
User-initiated impressions. The user asks first.

Break any link and you’re back to the same extractive ad layer in a new interface. Hold all four and you have advertising that the user can honestly consent to and the chatbot can honestly ignore.

Nothing in this architecture is specific to chatbots. If the embedding is computed on-device and the proximity check runs against a cached advertiser index, the same indicator could work in any conversation, including between humans. A user who opts into the recommender gets the dot in their messaging app. The conversation never leaves the device for ad purposes. The person on the other end doesn’t know it’s on. This is the strongest version of “ask first”: you asked before any conversation even started.

Written with Claude Opus 4.6 via Claude Code. I directed the argument; Claude researched prior art and drafted prose.

Part of the Vector Space series.