Blog

Page 142

12 articles

The Warranty Problem: Who Pays When Your AI Feature Is Wrong?
Software warranties assumed deterministic behavior — AI features break that assumption. A practical guide to the liability, insurance, and contract gaps engineering teams face when shipping non-deterministic systems.
insiderai-liability
Apr 119 min
When Your Agents Disagree: Consensus and Arbitration in Multi-Agent Systems
How to resolve conflicting outputs from peer AI agents when there's no ground truth — covering majority voting, confidence weighting, judge models, and when to surface disagreement to users rather than hide it.
insidermulti-agent
Apr 1111 min
Write-Ahead Logging for AI Agents: Borrowing Database Recovery Patterns for Crash-Safe Execution
Database WAL patterns map directly to AI agent workflows — an execution journal that logs intent before action and outcome before advancing enables skip-replay recovery, exactly-once side effects, and deterministic resumption after mid-workflow crashes.
insiderai-agents
Apr 1110 min
Capability Probing: How to Map Your Model's Limitations Before Users Do
Map your LLM's failure boundaries before deployment using probe suites, capability matrices, canary prompts, and a probe-to-regression pipeline that catches silent regressions across model upgrades.
llm-testingcapability-probing
Apr 1010 min
CZ's 'Freedom of Money': from a Jiangsu Boy to a Crypto Empire - Chapter-by-Chapter Summary
A comprehensive chapter-by-chapter breakdown of Changpeng Zhao's autobiography, 'Freedom of Money.' From a rural Jiangsu village to a Canadian immigrant, from a Wall Street coder to founding the world's largest crypto exchange, and his journey through a guilty plea, prison, and newfound freedom—all 25 chapters detailed.
cryptobiography
Apr 1039 min
The Hidden Token Tax: How Overhead Silently Drains Your LLM Context Window
System prompts, tool schemas, and chat history silently consume 30-60% of your LLM context window before user content arrives — here's how to audit and cut the overhead.
insiderllm-optimization
Apr 108 min
The Infinity Machine: How Demis Hassabis Built DeepMind and Chased AGI
From chess prodigy to Nobel Prize co-winner, Demis Hassabis built DeepMind into the world's most ambitious AI research lab. Sebastian Mallaby's biography traces the scientific breakthroughs, corporate battles, and existential dilemmas behind the quest for artificial general intelligence.
insiderai
Apr 10160 min
The Accuracy Threshold Problem: When Your AI Feature Is Too Good to Ignore and Too Bad to Trust
Deploying an AI feature at 70–85% accuracy creates a uniquely dangerous zone: good enough to attract habitual use, bad enough to cause visible failures that collapse user trust. Here's what the research says about why this zone is so treacherous and how to design your way out of it.
ai-engineeringreliability
Apr 910 min
Adversarial Agent Monitoring: Building Oversight That Can't Be Gamed
Single-layer LLM-as-judge monitoring fails over 52% of the time against sophisticated agents. The four-layer defense stack — behavioral fingerprinting, action auditing, multi-monitor consensus, and tool-layer constraints — that holds up in production.
insiderai-agents
Apr 910 min
Why Agent Cost Forecasting Is Broken — And What to Do Instead
Traditional cost forecasting fails for AI agents because execution paths are stochastic, not deterministic. Learn decision-loop cost modeling, Monte Carlo simulation, and the guardrail patterns that make agent spend predictable.
ai-agentsfinops
Apr 910 min
Agent-Friendly APIs: What Backend Engineers Get Wrong When AI Becomes the Client
Most REST APIs silently break when AI agents become the client — ambiguous errors cause retry loops, offset pagination corrupts traversals, and request-count rate limits collapse under multi-agent coordination. Here's what to fix and why it matters.
insiderapi-design
Apr 911 min
Agent Idempotency: Why Your AI Agent Sends That Email Twice
Production AI agents retry failed tool calls — and duplicate payments, emails, and real-world actions. Four battle-tested patterns from distributed systems make agent side effects safely retryable.
ai-agentidempotency
Apr 99 min

About Tian Pan

I'm Tian Pan, an engineer-founder focused on agentic engineering — building autonomous AI systems and scaling engineering teams. I write practical guides on system design, technical leadership, and shipping with AI agents. Previously an early engineer at Uber, Brex, and IoTeX.

Page 142

The Warranty Problem: Who Pays When Your AI Feature Is Wrong?

When Your Agents Disagree: Consensus and Arbitration in Multi-Agent Systems

Write-Ahead Logging for AI Agents: Borrowing Database Recovery Patterns for Crash-Safe Execution

Capability Probing: How to Map Your Model's Limitations Before Users Do

CZ's 'Freedom of Money': from a Jiangsu Boy to a Crypto Empire - Chapter-by-Chapter Summary

The Hidden Token Tax: How Overhead Silently Drains Your LLM Context Window

The Infinity Machine: How Demis Hassabis Built DeepMind and Chased AGI

The Accuracy Threshold Problem: When Your AI Feature Is Too Good to Ignore and Too Bad to Trust

Adversarial Agent Monitoring: Building Oversight That Can't Be Gamed

Why Agent Cost Forecasting Is Broken — And What to Do Instead

Agent-Friendly APIs: What Backend Engineers Get Wrong When AI Becomes the Client

Agent Idempotency: Why Your AI Agent Sends That Email Twice

About Tian Pan