Blog

Page 55

12 articles

The Acknowledgment-Action Gap: Your Agent's 'Got It' Is Not a Commitment
Production agents confidently confirm actions that never happened because teams treat chat text as a contract instead of the tool call. A pattern for separating narration from commitment.
insiderai-agents
Apr 2211 min
The Agent Backfill Problem: Your Model Upgrade Is a Trial of the Last 90 Days
When a smarter model disagrees with the one you shipped, every durable agent decision becomes a contested record. A framework for eval, decision, and action replay — plus the architectural prerequisites and policy matrix you need before the next upgrade.
insiderai-agents
Apr 2212 min
The Agent Capability Cliff: Why Your Model Upgrade Made the Easy 95% Perfect and the Hard 5% Your Worst Quarter
Model upgrades raise your aggregate pass rate while concentrating the residual failures on the hardest 5% of traffic — here's how stratified evals and capability-frontier probing expose the cliff before it lands in your on-call rotation.
insiderai-engineering
Apr 2211 min
Agent Idempotency Is an Orchestration Contract, Not a Tool Property
Tool-level idempotency keys are not enough when a non-deterministic planner can re-emit the same action. The contract has to live at the orchestration boundary, keyed by structural run state — not by model-authored arguments.
ai-agentsidempotency
Apr 2210 min
Agent Latency Budgets Are Trees, Not Lines — You Have Been Debugging the Wrong Axis
Agent latency is a nested tree of planning calls, tool fan-outs, and sub-agents — flame graphs sorted by duration hide the critical path, so local optimizations miss the real budget violation. Here's how to budget, propagate deadlines, and observe slack the tree way.
agent-latencyobservability
Apr 2212 min
Agent Memory Schema Evolution Is Protobuf on Hard Mode
Agent memory has two schemas — the store and the model's context — and only one of them migrates with a SQL script. Why protobuf's additive-only discipline is the right starting point, and what the shadow-write playbook needs on top.
insiderai-agents
Apr 2211 min
Silent Success: When Your Agent Says Done and Nothing Actually Happened
Agents fail by continuing to talk. Confident prose papers over tool errors while writes never commit. The fix: demote the model's claim to a hypothesis, promote tool responses and post-action probes to authoritative signals, and measure effect landing instead of turn success.
insiderai-agents
Apr 2210 min
The Agent Paged Me at 3 AM: Blast-Radius Policy for Tools That Reach Humans
Granting an agent PagerDuty access is an infra decision with product-team consequences. A control plane for human-facing tools — rate limits, dry-run, off-ramps — that prompts can't enforce.
insiderai-agents
Apr 2212 min
Your AI Chat Transcripts Are Evidence: Retention Design for LLM Products Under Legal Hold
Chat logs are ESI. Design retention in four tiers, build a hold registry before you need it, and tag provenance at ingestion — or pay for the same architecture in the middle of discovery.
insiderai-infrastructure
Apr 2211 min
The AI Interview Collapse: Engineering Hiring Has Lost Its Signal
Technical roles saw 48% AI-assisted cheating across 19,368 interviews and 61% of cheaters cleared the bar. A look at why detection cannot win, why no-AI policies punish honest candidates, and the interview formats replacing the broken ones.
hiringengineering-management
Apr 2211 min
The AI Observability Leak: Your Tracing Stack Is a Data Exfiltration Surface
Hosted tracing SDKs quietly ship full prompts and responses past your trust boundary. A compliance playbook for LLM teams: classify fields, scrub before egress, audit the SDK as policy.
ai-observabilitydata-privacy
Apr 2211 min
Your AI Product Needs an SRE Before It Needs Another Model
Most struggling AI teams run frontier models on 2012-era operations. The next hire that fixes it is usually an SRE, not another applied scientist.
sreai-engineering
Apr 229 min

About Tian Pan

I'm Tian Pan, an engineer-founder focused on agentic engineering — building autonomous AI systems and scaling engineering teams. I write practical guides on system design, technical leadership, and shipping with AI agents. Previously an early engineer at Uber, Brex, and IoTeX.

Page 55

The Acknowledgment-Action Gap: Your Agent's 'Got It' Is Not a Commitment

The Agent Backfill Problem: Your Model Upgrade Is a Trial of the Last 90 Days

The Agent Capability Cliff: Why Your Model Upgrade Made the Easy 95% Perfect and the Hard 5% Your Worst Quarter

Agent Idempotency Is an Orchestration Contract, Not a Tool Property

Agent Latency Budgets Are Trees, Not Lines — You Have Been Debugging the Wrong Axis

Agent Memory Schema Evolution Is Protobuf on Hard Mode

Silent Success: When Your Agent Says Done and Nothing Actually Happened

The Agent Paged Me at 3 AM: Blast-Radius Policy for Tools That Reach Humans

Your AI Chat Transcripts Are Evidence: Retention Design for LLM Products Under Legal Hold

The AI Interview Collapse: Engineering Hiring Has Lost Its Signal

The AI Observability Leak: Your Tracing Stack Is a Data Exfiltration Surface

Your AI Product Needs an SRE Before It Needs Another Model

About Tian Pan