Blog
Engineering notes on AI agents, automation, and the infrastructure behind them.

June 7, 2026 · 4 min read
The voice AI that pays back for a small business is narrow and boring: after-hours triage, booking, order status. Here is what ships and what to skip.
May 30, 2026 · 5 min read
Turn your task manager into agent memory: an adapter-not-migration CLI and MCP server with hybrid RRF retrieval over TickTick or an Obsidian vault.

May 9, 2026 · 10 min read
Pinecone vs RunPod for vector search: managed serverless against self-hosted Qdrant on rented GPU. Break-even math, latency, and the hybrid most teams ship.

May 8, 2026 · 5 min read
Pinecone and RunPod live at different RAG-stack layers. Vector DB vs GPU inference, real cost math, when self-hosting Qdrant beats Pinecone.

May 7, 2026 · 7 min read
Automate YouTube Shorts end-to-end: pick segments, write hooks with Claude, build CapCut drafts via CLI. Open-source pipeline + my paid blueprint.

April 29, 2026 · 16 min read
Run Claude Code on a local LLM via ANTHROPIC_BASE_URL. Native Anthropic endpoints for Ollama, LM Studio, llama.cpp, vLLM. 32K context floor.

April 17, 2026 · 13 min read
How to choose an LLM for production workloads. 7 selection criteria, a decision tree, an evaluation process, and a requirements checklist from real deployments. Download the free AI Automation Checklist.

April 16, 2026 · 15 min read
Self-hosted LLM vs API cost analysis with break-even math. When to self-host, when to stay on Claude, and the hybrid pattern most production teams actually use. Download the free AI Automation Checklist.

April 15, 2026 · 18 min read
An opinionated LLM API comparison for production. Claude vs GPT vs Gemini vs Mistral vs DeepSeek on features, developer experience, reliability, and fit. Download the free AI Automation Checklist.

April 14, 2026 · 11 min read
Zapier (per-task), Make (per-operation), n8n (per-execution) — same workflow at 10k / 100k / 1M runs. Where each wins, where each breaks.

April 12, 2026 · 17 min read
Claude vs ChatGPT for developers in 2026. Chat, CLI, IDE, and API compared by a practitioner running ten agents in production. Download the free AI Automation Checklist.

April 11, 2026 · 12 min read
LLM API cost comparison for 2026. Model your real workload costs with prompt caching, output tokens, reasoning, and batch API factored in. Download the free AI Automation Checklist.