<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Production-Ai on René Zander | AI Automation Consultant</title><link>https://renezander.com/tags/production-ai/</link><description>Recent content in Production-Ai on René Zander | AI Automation Consultant</description><generator>Hugo</generator><language>en</language><lastBuildDate>Thu, 11 Jun 2026 10:00:00 +0000</lastBuildDate><atom:link href="https://renezander.com/tags/production-ai/index.xml" rel="self" type="application/rss+xml"/><item><title>AI Agent Model Evaluation: 5 Tests Before the Night Shift</title><link>https://renezander.com/blog/ai-agent-model-evaluation/</link><pubDate>Thu, 11 Jun 2026 10:00:00 +0000</pubDate><guid>https://renezander.com/blog/ai-agent-model-evaluation/</guid><description>&lt;p>A model upgrade used to be good news for anyone running agents overnight.&lt;/p>
&lt;p>Now the next model arrives before the last one has finished probation.&lt;/p>
&lt;p>Anthropic released &lt;a href="https://www.anthropic.com/news/claude-opus-4-8">Opus 4.8 on May 28&lt;/a>. Twelve days later, &lt;a href="https://www.anthropic.com/news/claude-fable-5-mythos-5">Fable 5 arrived&lt;/a> with longer autonomous runs and another page of benchmark wins.&lt;/p>
&lt;p>In between, GitHub made cloud agents &lt;a href="https://github.blog/changelog/2026-06-02-schedule-and-automate-tasks-with-copilot-cloud-agent/">wake up on schedules and repository events&lt;/a>, then exposed &lt;a href="https://github.blog/changelog/2026-06-04-agent-tasks-rest-api-now-available-for-copilot-pro-pro-and-max/">agent tasks through a REST API&lt;/a>.&lt;/p>
&lt;p>The night shift is getting easier to hire.&lt;/p></description></item><item><title>Your 50th Skill Makes the First 49 Less Reliable</title><link>https://renezander.com/blog/skill-library-discovery-ceiling/</link><pubDate>Wed, 27 May 2026 07:00:00 +0000</pubDate><guid>https://renezander.com/blog/skill-library-discovery-ceiling/</guid><description>&lt;p>You added a skill last Tuesday. The agent hasn&amp;rsquo;t called it once. Each new skill silently weakens the discovery odds of the ones you already have.&lt;/p>
&lt;p>You assume it&amp;rsquo;s a description problem. It isn&amp;rsquo;t.&lt;/p>
&lt;p>Everyone&amp;rsquo;s pushing past 50 skills now. Vercel ships a plugin with 40. Google&amp;rsquo;s &lt;code>gws&lt;/code> brings 95. I do it too. My local registry is at 38.&lt;/p>
&lt;p>Each one I added lowered the odds that the others get reached when I need them.&lt;/p></description></item></channel></rss>