Blog

Practical writing on AI engineering, infrastructure, backend systems, and production lessons learned.

Reading Series

View series →

Series

Claude Code: Source Reading Series

A working engineer's read through every subsystem in the Claude Code source.

18postsExplore →

Curated migration review batch

Spotlighted legacy posts, rewritten with the new MDX component system

This batch highlights a few older notes that were worth preserving and upgrading — not just importing. The rest of the archive remains available below.

Curated

April 1, 2026

4 min read

Claude Code: Source Reading Series — Start Here

The entry point for the Claude Code source-reading series. Five layers, sixteen posts. Start here to navigate the series in order.

ai-agentsclaude-codearchitectureseries-indexruntime

Read curated article →

Curated

March 31, 2026

4 min read

Building an Agent Client from Claude Code Patterns

What I extracted from the learning-claude-code codebase about sessions, permissions, plans, subagents, and remote transport when building a serious agent client.

ai-agentsagent-clienttoolingarchitecturesystems-design

Read curated article →

Curated

March 6, 2025

4 min read

OIDC for CI/CD: Replacing Long-Lived Cloud Credentials with Workload Identity

A practical migration guide for using OIDC in CI/CD so pipelines can assume cloud roles without storing long-lived secrets.

authenticationoidcci-cdawssecurity

Read curated article →

Curated

February 18, 2025

9 min read

How You Actually Get a Google OAuth Refresh Token for Chrome Web Store Automation

A precise walkthrough of the Google OAuth path to a refresh token: why you must start with a client_id, why Google gives you an authorization code first, why localhost redirects are common, and how that turns into Chrome Web Store automation.

oauthchrome-extensionci-cdinfrastructureapi-design

Read curated article →

Curated

October 31, 2024

4 min read

Redis Distributed Locks: What They Solve, Where They Break, and How to Use Them Safely

A pragmatic guide to Redis-based distributed locks for high-concurrency systems, including ownership, expiry, contention, and when a lock should be replaced by a better architecture.

cachingredisdistributed-lockhigh-concurrencysystem-design

Read curated article →

Curated

April 3, 2024

4 min read

Kubernetes vs ECS: Choosing the Right Control Plane for Real Teams

A practical comparison of Kubernetes and Amazon ECS focused on platform ownership, operational complexity, and when each option is the better bet.

kubernetesecsorchestrationeksplatform-engineering

Read curated article →

Latest from the archive

Freshly published writing and the broader imported archive continue to live here.

Latest

Claude Code Observability: Dual-Sink Analytics, OTEL Spans, Perfetto Traces

April 2, 2026•3 min read

How dual-sink event routing, compile-time PII enforcement, full-turn OTEL instrumentation, and Perfetto/Chrome Trace export give Claude Code production-grade observability from a local-first runtime.

ai-agentsclaude-codeobservabilityanalyticsopentelemetrytracing

Read article →

Blog

Reading Series

Claude Code: Source Reading Series

Spotlighted legacy posts, rewritten with the new MDX component system

Claude Code: Source Reading Series — Start Here

Building an Agent Client from Claude Code Patterns

OIDC for CI/CD: Replacing Long-Lived Cloud Credentials with Workload Identity

How You Actually Get a Google OAuth Refresh Token for Chrome Web Store Automation

Redis Distributed Locks: What They Solve, Where They Break, and How to Use Them Safely

Kubernetes vs ECS: Choosing the Right Control Plane for Real Teams

Latest from the archive

Claude Code Observability: Dual-Sink Analytics, OTEL Spans, Perfetto Traces

Archive

Claude Code Architecture: Five Principles for Building Agent Runtimes

Claude Code Input Pipeline: Three Paths Before the Model Runs

Claude Code LSP Integration: Persistent Servers, Push-Based Diagnostics

Claude Code MCP Assembly: Seven Config Sources, Three Capability Surfaces

Claude Code Memory System: Five Layers from Injection to Consolidation

Claude Code REPL: 5000 Lines Binding Every Subsystem

Claude Code Tool Execution: Scheduling, Streaming, and Governance

Claude Code Multi-Agent Runtime: Coordinator, Three Backends, Disjoint Ownership

Claude Code Task Runtime: Background Work as First-Class Objects

Claude Code Subagent Runtime: Context Isolation and State Sharing

Claude Code Tool Internals: BashTool, FileEditTool, AgentTool, Compact

Claude Code UI Runtime: External Store, Custom Ink Pipeline, Session Control Plane

Claude Code Command System: Six Sources, One Registry

Claude Code Permission Engine: Seven Steps, Bypass-Immune Gates

Claude Code Query Loop: The Recoverable Turn Engine

Claude Code Boot Sequence: How a CLI Becomes a Runtime Host

Claude Code Tool System: 30+ Methods, Fail-Closed Defaults

EKS in Production: Operator Patterns and the Path to Writing Your Own

GPU Inference Pipeline: A Visual Guide to Serving LLMs at Scale

High-Concurrency System Design: The Tradeoffs Nobody Warns You About

Building Production RAG + MCP Agents: A Practical Architecture Guide

Hello World

Bearer Tokens: What 'Bearer' Means and What It Costs You

Terraform jsonencode: Fixing List Interpolation Errors in IAM Policies