Blog

Page 121

12 articles

Zero-Shot vs. Few-Shot in Production: When Examples Help and When They Hurt
The empirical case for when to use zero-shot vs. few-shot prompting — and why static examples at scale often make things worse.
llmprompting
Apr 1610 min
Agent Fleet Observability: Monitoring 1,000 Concurrent Agent Runs Without Dashboard Blindness
Individual span trees per agent run collapse at fleet scale. Here are the fleet-level signals, sampling strategies, and behavioral fingerprinting techniques that actually work when you're running hundreds of concurrent agents.
insiderobservability
Apr 1512 min
Agent Identity and Least-Privilege Authorization: The Security Footgun Your AI Team Is Ignoring
When your AI agent calls internal APIs, whose identity does it present? Most teams give agents a broad service account token and move on. Here's why that's a security footgun and what production-grade agent authorization actually looks like.
securityai-agents
Apr 159 min
The Agent Loading State Problem: Designing for the 45-Second UX Abyss
Users abandon silent UIs at ten seconds, but modern agents run thirty to one hundred twenty. The gap is a design surface most teams still fill with a spinner — here is what to ship instead.
ai-agentsux
Apr 1511 min
Your Agent Traces Are Lying: Cardinality, Sampling, and Span Hierarchies for LLM Agents
Distributed tracing was designed for ~10 spans per request. A single agent run can produce hundreds, and default OpenTelemetry configurations systematically undercount the work. Here's the span hierarchy, tail sampling policy, and payload handling that survive production agent workloads.
insiderobservability
Apr 1511 min
Agentic Task Complexity Estimation: Budget Tokens Before You Execute
LLM agents commit resources before knowing how deep a task runs. Here's the complexity estimation layer — tiered routing, budget-tracker injection, plan template caching, and DAG-based decomposition — that prevents irreversible early mistakes and makes agent costs predictable.
insideragent-architecture
Apr 1510 min
When Your AI Agent Consumes from Kafka: The Design Assumptions That Break
Running AI agents on message queues breaks the assumptions baked into queue semantics. Here's how idempotency, ordering, and backpressure work differently when your consumer is stochastic.
insiderai-agents
Apr 1511 min
AI-Assisted Incident Response: How LLMs Change the SRE Playbook Without Replacing It
AI copilots in on-call workflows can surface correlated signals and draft runbook actions—but they introduce failure modes traditional SREs aren't trained to catch. A practical guide to integrating LLMs into incident response without making outages worse.
sreincident-response
Apr 1511 min
The AI Capability Ratchet: How One Smart Feature Breaks Your Entire Product
Shipping one impressive AI feature permanently raises user expectations for every other feature in your product — including ones you haven't touched. Here's the mechanism, real examples, and how to manage the expectation debt before it hits your support queue.
ai-productproduct-strategy
Apr 1510 min
The AI Dependency Footprint: When Every Feature Adds a New Infrastructure Owner
Every AI feature you ship introduces new infrastructure dependencies — vector databases, embedding models, eval frameworks, GPU serving layers. The problem isn't the dependencies themselves. It's that nobody owns them.
ai engineeringinfrastructure
Apr 159 min
AI Feature Decommissioning Forensics: What Dead Features Teach That Successful Ones Cannot
The AI features your company quietly killed contain the failure patterns your next launch will hit. A forensic template, a leading-indicator catalog, and how to read the evidence dead features leave behind.
insiderai-engineering
Apr 1511 min
The AI Incident Severity Taxonomy: When Is a Hallucination a Sev-0?
Traditional severity classification breaks for probabilistic AI systems. A multidimensional framework for classifying AI incidents — beyond binary broken/working to capture scope, reversibility, and compounding damage.
insiderai-engineering
Apr 1511 min

About Tian Pan

I'm Tian Pan, an engineer-founder focused on agentic engineering — building autonomous AI systems and scaling engineering teams. I write practical guides on system design, technical leadership, and shipping with AI agents. Previously an early engineer at Uber, Brex, and IoTeX.

Page 121

Zero-Shot vs. Few-Shot in Production: When Examples Help and When They Hurt

Agent Fleet Observability: Monitoring 1,000 Concurrent Agent Runs Without Dashboard Blindness

Agent Identity and Least-Privilege Authorization: The Security Footgun Your AI Team Is Ignoring

The Agent Loading State Problem: Designing for the 45-Second UX Abyss

Your Agent Traces Are Lying: Cardinality, Sampling, and Span Hierarchies for LLM Agents

Agentic Task Complexity Estimation: Budget Tokens Before You Execute

When Your AI Agent Consumes from Kafka: The Design Assumptions That Break

AI-Assisted Incident Response: How LLMs Change the SRE Playbook Without Replacing It

The AI Capability Ratchet: How One Smart Feature Breaks Your Entire Product

The AI Dependency Footprint: When Every Feature Adds a New Infrastructure Owner

AI Feature Decommissioning Forensics: What Dead Features Teach That Successful Ones Cannot

The AI Incident Severity Taxonomy: When Is a Hallucination a Sev-0?

About Tian Pan