Blog

Page 131

12 articles

Tokenizer Arithmetic: The Hidden Layer That Bites You in Production
BPE tokenization creates predictable failure modes that break structured output parsers, corrupt caching strategies, and cause cost estimates to collapse under real traffic — before you blame the model, check the tokenizer.
insiderllm
Apr 1410 min
The Trust Calibration Gap: Why AI Features Get Ignored or Blindly Followed
Most AI product failures aren't model failures — they're trust failures. Either users ignore the AI entirely or they follow it without scrutiny. Here's how to design for calibrated trust.
insiderai-engineering
Apr 149 min
Trust Transfer in AI Products: Why the Same Feature Ships at One Company and Dies at Another
Identical AI features succeed in one company and fail in another. The gap isn't model quality — it's trust architecture. How brand credibility, organizational culture, and institutional endorsement determine whether an AI product earns a chance to prove itself.
ai-producttrust
Apr 149 min
When the Prompt Engineer Leaves: The AI Knowledge Transfer Problem
Prompts accumulate invisible business logic, tacit decisions, and undocumented edge-case fixes. When the author leaves, the institutional knowledge goes with them — and the costs are real.
insiderprompt-engineering
Apr 149 min
Why A/B Tests Fail for AI Features (And What to Use Instead)
Standard A/B tests break when applied to AI features. Non-deterministic outputs, novelty bias, and covariate drift invalidate results — here's which measurement methodologies actually work.
aiexperimentation
Apr 149 min
Zero-Downtime AI Deployments: It's a Distributed Systems Problem
Most teams treat prompt updates as config changes. They're not — they're production deployments with four independent migration surfaces. Here's the distributed systems framework that keeps AI systems reliable during model upgrades, prompt bumps, and tool schema changes.
deploymentai-engineering
Apr 1410 min
The Adapter Compatibility Cliff: When Your Fine-Tune Meets the New Base Model
LoRA and PEFT adapters are dimensionally locked to the base model they were trained on. When providers update the underlying model — silently or otherwise — your fine-tune can fail loudly with shape errors or, worse, degrade without raising any alarms. Here is what breaks, why it breaks, and how to protect production fine-tunes against base model updates.
llmfine-tuning
Apr 1311 min
Agent Memory Garbage Collection: Engineering Strategic Forgetting at Scale
Production agent memory systems degrade silently as stale facts and contradictions accumulate. Generational decay tiers, semantic deduplication, contradiction detection, and adaptive compression form a GC pipeline that keeps long-running agents reliable — with concrete algorithms borrowed from runtime garbage collection.
agent-memoryai-engineering
Apr 1310 min
The AI Code Review Trap: Why Faster Reviews Are Making Your Codebase Worse
AI tools make engineers faster at writing and approving code — but defect escape rates are climbing. Here's the data on automation bias, silent logic failures, and the review protocols that actually catch AI bugs.
aicode-review
Apr 1310 min
The CAP Theorem for AI Agents: Why Your Agent Fails Completely When It Should Degrade Gracefully
Most AI agents fail completely when a single tool goes down — the same consistency-vs-availability tradeoff distributed databases solved decades ago. Here is how to design the partial-availability path.
ai-agentsdistributed-systems
Apr 139 min
Cascading Context Corruption: Why One Wrong Fact Derails Your Entire Agent Run
A single hallucinated fact in step 3 of a 25-step agent run can silently corrupt every subsequent conclusion. Learn the three propagation vectors, checkpoint-and-verify patterns, and architectural strategies that prevent cascading context corruption in production agent systems.
ai-agentsreliability
Apr 138 min
Your Code Review Process Is Optimized for the Wrong Failure Mode
AI-generated code shifts defects from typos to architectural drift, hallucinated APIs, and cargo-culted patterns — yet reviewers rubber-stamp it faster. A practical checklist and metrics framework for adapting your review process.
code-reviewai-engineering
Apr 138 min

About Tian Pan

I'm Tian Pan, an engineer-founder focused on agentic engineering — building autonomous AI systems and scaling engineering teams. I write practical guides on system design, technical leadership, and shipping with AI agents. Previously an early engineer at Uber, Brex, and IoTeX.

Page 131

Tokenizer Arithmetic: The Hidden Layer That Bites You in Production

The Trust Calibration Gap: Why AI Features Get Ignored or Blindly Followed

Trust Transfer in AI Products: Why the Same Feature Ships at One Company and Dies at Another

When the Prompt Engineer Leaves: The AI Knowledge Transfer Problem

Why A/B Tests Fail for AI Features (And What to Use Instead)

Zero-Downtime AI Deployments: It's a Distributed Systems Problem

The Adapter Compatibility Cliff: When Your Fine-Tune Meets the New Base Model

Agent Memory Garbage Collection: Engineering Strategic Forgetting at Scale

The AI Code Review Trap: Why Faster Reviews Are Making Your Codebase Worse

The CAP Theorem for AI Agents: Why Your Agent Fails Completely When It Should Degrade Gracefully

Cascading Context Corruption: Why One Wrong Fact Derails Your Entire Agent Run

Your Code Review Process Is Optimized for the Wrong Failure Mode

About Tian Pan