
April 29, 2026 · 16 min read · guides
Run Claude Code on a local LLM via ANTHROPIC_BASE_URL. Native Anthropic endpoints for Ollama, LM Studio, llama.cpp, vLLM. 32K context floor.

April 5, 2026 · 16 min read · blog
Complete self-hosted LLM Kubernetes guide. Deploy vLLM on GPU nodes with manifests, HPA, monitoring, and cost modeling. Practitioner notes included. Download the free AI Automation Checklist.