Tagged: Ai

21 posts

AI Agent Best Practices: Trust Your Own Results Before Google

AI Agent Best Practices: Trust Your Own Results Before Google

June 16, 2026 · 3 min read · blog
Your AI agent reaches for googled best practices before your own proven fixes. Wire a trust order into your CLAUDE.md and agent loop instead.
Why AI Coding Agents Skip Your Definition of Done

Why AI Coding Agents Skip Your Definition of Done

June 16, 2026 · 4 min read · blog
AI coding agents agree to your process, then skip it. Why review can't catch it, and the one fix that works: a deterministic finish-line gate.
Your Task Manager Is the Best Agent Memory You're Not Using

Your Task Manager Is the Best Agent Memory You're Not Using

June 13, 2026 · 4 min read · blog
Agent memory without a new vector DB. Your task app is years of curated, ranked context. ATS gives your agent a hybrid-retrieval channel into it.
Build the Harness Once With Your Best Model. Run It on a Cheap One.

Build the Harness Once With Your Best Model. Run It on a Cheap One.

June 3, 2026 · 4 min read · blog
Agents forget and good ones cost. The fix is not a better model. Put the goal in deterministic scripts and run a cheap model against them.
Most of Your AI Skills Will Rot. Here's Which Ones Compound.

Most of Your AI Skills Will Rot. Here's Which Ones Compound.

June 3, 2026 · 4 min read · blog
A skill's lifespan is set by what it couples to, not how good the prompt is. Why most AI skills rot, which parts compound, and how to tell.
Claude Code Stops Following Your CLAUDE.md: Read-Once Rules and Hooks

Claude Code Stops Following Your CLAUDE.md: Read-Once Rules and Hooks

June 2, 2026 · 4 min read · blog
Claude Code reads your CLAUDE.md once at startup, so rules decay as the session fills up. Move the ones that must never break into hooks.
Claude Opus 4.8 Is Out. The Number I Care About Isn't on the Benchmark Chart.

Claude Opus 4.8 Is Out. The Number I Care About Isn't on the Benchmark Chart.

May 29, 2026 · 3 min read · blog
Opus 4.8 shipped May 28. For unattended cron agents, the upgrades that matter are not the benchmark scores. A use-case breakdown from real builds.
Splitting Grounding from Reasoning in Browser-Agent Stacks

Splitting Grounding from Reasoning in Browser-Agent Stacks

May 19, 2026 · 4 min read · blog
Browser-agent stacks bundle grounding and reasoning. A local 2B parser splits them, beats GPT-4o on ScreenSpot-v2 by 2.5x, costs $4 to train.
Context Engineering Is Just File Naming

Context Engineering Is Just File Naming

May 12, 2026 · 4 min read · blog
Context engineering sounds new. It is the file-naming hygiene developers always had, load-bearing now because LLMs read what you point them at.
Your AI Workflow Doesn't Need Better Prompts. It Needs Less AI.

Your AI Workflow Doesn't Need Better Prompts. It Needs Less AI.

May 5, 2026 · 9 min read · blog
Prompting is discovery. Skills are repetition. Gates are how AI workflows become reliable.
What Anthropic's April 23 Postmortem Reveals About Your Agent Harness

What Anthropic's April 23 Postmortem Reveals About Your Agent Harness

April 30, 2026 · 3 min read · blog
Three bugs over two months, one usage-limit reset for every Pro subscriber. The postmortem reads like a free audit checklist for any production agent harness.
95% of PII Redaction Doesn't Need an LLM. The Other 5% Does.

95% of PII Redaction Doesn't Need an LLM. The Other 5% Does.

April 21, 2026 · 4 min read · blog
When to use deterministic masking and when a fine-tuned LLM earns its compute on SAP production data copies. A hybrid architecture.
What llama.cpp's Pace Tells You About On-Prem LLM Readiness

What llama.cpp's Pace Tells You About On-Prem LLM Readiness

April 14, 2026 · 4 min read · blog
Your team asked for GPU budget for self-hosted inference. You said not yet. The tooling moved, the org didn't, and the delay is costing you leverage you don't know you're losing.
Claude vs OpenAI for Business Automation: A 2026 Operator Verdict

Claude vs OpenAI for Business Automation: A 2026 Operator Verdict

April 11, 2026 · 6 min read · guides
Claude API vs OpenAI API on the work that matters: tool calling, reliability, pricing at volume, integration effort. A clear pick per use case.
Your AI Content Tool Knows Your Strategy. Do You?

Your AI Content Tool Knows Your Strategy. Do You?

April 7, 2026 · 5 min read · blog
Every prompt you send contains business context. Brand voice docs, competitive positioning, internal strategy. Most AI tools promise not to look at it. There is already technology that replaces trust with proof. Download the free AI Automation Checklist.
Spend Your Human Thinking Tokens Where They Compound

Spend Your Human Thinking Tokens Where They Compound

March 31, 2026 · 4 min read · blog
More automations running. More agents deployed. More pipelines humming. But the biggest improvement wasn't adding more automation — it was getting honest about where my thinking actually matters. Download the free AI Automation Checklist.
AI Skills Are the New Boilerplate: They Fix Nothing

AI Skills Are the New Boilerplate: They Fix Nothing

March 24, 2026 · 4 min read · blog
Everyone shares skill libraries now. Why the 20-slash-command pitch and the 2-hours-saved claim miss what actually ships value in production. Book a free discovery call.
How I Built a Business Email Agent with Compliance Controls in Go

How I Built a Business Email Agent with Compliance Controls in Go

March 21, 2026 · 6 min read · blog
How to build a production email agent in Go with compliance controls. Why off-the-shelf AI email tools fail for business operations, and what to do instead. Download the free AI Automation Checklist.
Your Vector Database Decision Is Simpler Than You Think

Your Vector Database Decision Is Simpler Than You Think

March 17, 2026 · 4 min read · blog
Choosing a vector database is simpler than benchmarks suggest. The three constraints that actually decide it, from someone running semantic search on a single VPS. Download the free AI Automation Checklist.
I Run 10 AI Agents in Production. They're All Bash Scripts.

I Run 10 AI Agents in Production. They're All Bash Scripts.

March 12, 2026 · 5 min read · blog
A week ago I wrote about shipping AI agents the right way. That piece was about the harness: quality... Download the free AI Automation Checklist.
Lots Of People Are Demoing AI Agents. Almost Nobody's Shipping Them The Right Way.

Lots Of People Are Demoing AI Agents. Almost Nobody's Shipping Them The Right Way.

March 4, 2026 · 7 min read · blog
Lots of people are demoing AI agents. Almost nobody's shipping them the right way. Conference stages... Download the free AI Automation Checklist.