How AI dev tools actually work.

No hand-waving. Written, searchable breakdowns of MCP, Claude Code, coding agents, and the internals behind the tools you use every day — each one paired with a video.

Read the tutorials →

Latest tutorials

How to Write Acceptance Criteria for LLM Output (Not Just 'Be Accurate')

Stop writing vague prompts. Learn how to define testable acceptance criteria for LLM output — covering format, edge cases, missing data, and ambiguity.

prompt engineeringLLM outputacceptance criteriaClaude
Agent Escalation: When to Hand Off to a Human vs. Keep Handling

Learn the four exact signals that should trigger agent-to-human escalation, why sentiment is never one of them, and how to structure a clean handoff summary card.

agentsescalationhuman-in-the-loopClaude
Building a Customer Support Agent with Claude: Tools, Policy Hooks, and Escalation

Learn how to build a production Claude agent with scoped tools, code-enforced policy caps, structured errors, clean escalation, and a logging plane.

Claudeagentstool useescalation
CCA-F Exam Study Guide: Claude Certified Architect Foundations

Map every CCA-F exam domain to the Claude stack layers, understand score weights, and build a focused study path using official episode blocks.

CCA-FClaude APIMCPClaude Code
Anthropic Agent SDK: Use Claude Code's Engine in Your App

Learn how the Anthropic Agent SDK exposes Claude Code's agent loop, built-in tools, and MCP support as a library you call from Python or TypeScript.

ClaudeAgent SDKClaude Codeagents
Claude Code Hooks Explained: Deterministic Guards for the Agent Loop

Learn how Claude Code hooks enforce hard rules at every lifecycle event — pre-tool, post-tool, session start/stop — that the model cannot override.

Claude Codehooksagent looptool use
Claude Code Extensions: Skills, Subagents, Hooks, and Plugins

Learn the four extension points in Claude Code — skills, subagents, hooks, and plugins — and when to reach for each one.

Claude CodeSkillsSubagentsHooks
How CLAUDE.md File Hierarchy Works: User, Project, Subtree, Local

Learn how Claude's four-tier CLAUDE.md hierarchy (user, project, subtree, local) controls which rules apply, and how precedence is resolved.

ClaudeCLAUDE.mdconfigurationprecedence
How to Run Claude Code Headlessly in CI/CD Pipelines

Learn how to use claude -p to run Claude Code headlessly in CI/CD: reviewing PRs, generating changelogs, and scripting AI tasks without an interactive shell.

Claude CodeCI/CDheadlessautomation
Claude's 5-Layer Stack: MCP, Hooks, Skills, and Subagents Explained

A structured map of every Claude feature across 5 layers and 2 cross-cutting planes, so you can place any new Anthropic release instantly.

ClaudeMCPagent loophooks
Confidence Fields and Human-in-the-Loop Routing for LLM Extraction Pipelines

Learn how to add a confidence field to LLM tool schemas and route low-confidence extractions to a human review queue automatically.

structured extractionconfidence scoringhuman-in-the-looptool calling
Context Engineering: Pin, Summarize, Prune, and Compact

Learn four techniques to keep long Claude sessions coherent and affordable: pin stable facts, summarize resolved turns, prune tool output, and compact under pressure.

context windowtokensClaudecontext engineering
How to Guarantee JSON Output from Claude with Structured Outputs

Learn how to use tool choice, Pydantic schemas, and Claude's native structured output to get reliable JSON from LLMs every time — no parser explosions.

structured outputClaudePydantictool use
How Claude Code Edits Your Repo: Inside the Agentic Edit Loop

Claude Code doesn't regenerate whole files — it reads, locates, and makes surgical string edits in a verify loop. Here's exactly how an agentic code editor changes your repo, step by step.

Claude CodeAI coding agentsClaudeagentic editing
How Claude Code's Agent Loop Works (and Why It Breaks)

Learn how Claude's agent loop works, what causes infinite loops and premature exits, and the four controls that keep agents on track.

Claude Codeagent looptool usestop reason
How Claude Tool Calling Actually Works: The Request-Execute Model

Learn how Claude's tool calling works under the hood: the model requests, your code executes. Covers tool use blocks, the three executor lanes, and schema tips.

Claudetool callingMCPAnthropic API
How LLM Tokens Work — And Why They Explain Your AI Bill

Claude never sees your words — it sees tokens. Here's what tokenization actually is, why it drives every dollar of your AI bill, and how to reason about token cost.

tokenstokenizationLLMClaude
How Claude's Context Window Works: Limits, Costs, and Overflow

Learn what fills Claude's context window, which models support 1M tokens, when pricing changes, and what happens when you overflow it.

Claudecontext windowtokenspricing
How to Write LLM Evals: Testing AI Apps with Real Data

Learn how to replace gut-feel LLM testing with a real eval harness: datasets, graders, CI score gates, and LLM-as-judge caveats.

evalsLLM testingCIClaude
How MCP Apps Work: Tools That Return Interactive UI

MCP tools can return real interactive UI — not just text. Here's how MCP Apps render forms, dashboards, and widgets inside Claude and ChatGPT, with a minimal working example.

MCPModel Context ProtocolClaudeMCP Apps
Anthropic Managed Agents: Claude Runs the Loop for You

Learn how Anthropic's managed agents API offloads the agent loop, tool sandbox, and persistent state to Anthropic's cloud, replacing hand-rolled loops.

Claudeagentsmanaged agentssessions
The 6 Multi-Agent Patterns That Actually Work in 2026

Orchestrator-worker, routing, pipeline, parallel fan-out, reflection, and debate — the six multi-agent patterns worth using, when each one pays off, and when a single agent still wins.

AI agentsmulti-agent systemsorchestrationLLM
How to Structure a Production Claude Agent: All Layers Explained

Learn the full layered architecture of a production Claude agent: gateway, model router, agent loop, scoped tools, policy hooks, prompt caching, and clean escalation.

Claudeagent architectureMCPprompt caching
How Prompt Caching Cuts Your AI Bill ~90% (and the Floor Trap)

Prompt caching reuses a stable prompt prefix so you stop paying full price to resend the same context. Here's how it works, the real numbers, and the floor trap that quietly costs you money.

prompt cachingClaude APIAI costAnthropic
Prompt Injection Attacks Explained: How to Defend Your AI Agent

Learn how prompt injection turns untrusted text into commands, why indirect injection is the dangerous case, and three layered defenses every agent needs.

prompt injectionClaudeAI securityagents
How to Build a Structured Data Extraction Pipeline with Claude

Learn how to wire forced schemas, few-shot examples, validation retry, confidence routing, and prompt caching into one production extraction flow.

structured outputtool useprompt cachingdata extraction
Temperature, Top-P, and Top-K Explained: Controlling LLM Randomness

Learn how temperature, top-p, and top-k sampling parameters shape an LLM's output distribution and when to tune each for your use case.

temperaturesamplingtop-ptop-k
How to Pin Model Output Format Using Few-Shot Examples

Learn why few-shot examples in the messages array beat temperature tweaks for fixing format, locale, and edge-case failures in LLM output.

few-shot promptingClaudeprompt engineeringoutput format
How Claude Works: A 5-Layer Mental Model for Developers

Learn the 5-layer Claude Stack — model, Messages API, MCP tools, agent loop, and surfaces — and how they compose into real AI-powered apps.

ClaudeMCPAgentsClaude Code
Building a Multi-Agent Research System with Isolated Contexts

Learn how to build a multi-agent research pipeline with a coordinator, scoped sub-agents, and provenance-preserving synthesis — without context overflow.

multi-agentClaudecontext managementresearch pipeline
Claude Architecture Explained: 5 Layers and 2 Cross-Cutting Planes

A complete recap of the Claude stack: model, protocol, reach, orchestration, and surfaces — plus the two planes that cut through every layer.

Claudeagent architectureMCPorchestration
How Claude Token Billing Works: Input, Output, and Cache Costs

Learn how Claude breaks API costs into input, output, and cached tokens, why output tokens cost 5x more, and how to fix the three most common cost leaks in agents.

tokenscost optimizationClaude APIcaching
Understanding stop_reason in the Claude Messages API

Learn how stop_reason controls agent loop branching in the Claude API — end_turn, tool_use, max_tokens, pause_turn, and more explained.

ClaudeMessages APIstop_reasonagent loop
What Is MCP (Model Context Protocol) and How It Works

Learn how MCP reduces AI tool integrations from N×M to N+M by giving every client and tool a single shared protocol to speak.

MCPClaudeAI toolstool use