Tagged: Llm

7 posts

German PII Redactor: Covering the 5% Blind Spot in SAP Data Masking

German PII Redactor: Covering the 5% Blind Spot in SAP Data Masking

April 21, 2026 · 5 min read · case-studies
Self-hosted German PII redactor for SAP prod→dev copies. Plugs in after TDMS/Delphix/Informatica to cover free-text NOTES columns, unclassified Z-tables, and OCR'd attachments. DSGVO-konform, Apache 2.0, runs on a single consumer GPU.
How to Choose an LLM for Production: 7 Criteria That Matter

How to Choose an LLM for Production: 7 Criteria That Matter

April 17, 2026 · 13 min read · guides
How to choose an LLM for production workloads. 7 selection criteria, a decision tree, an evaluation process, and a requirements checklist from real deployments. Download the free AI Automation Checklist.
Self-Hosted LLM vs API Cost: Break-Even Analysis (2026)

Self-Hosted LLM vs API Cost: Break-Even Analysis (2026)

April 16, 2026 · 15 min read · guides
Self-hosted LLM vs API cost analysis with break-even math. When to self-host, when to stay on Claude, and the hybrid pattern most production teams actually use. Download the free AI Automation Checklist.
LLM API Comparison 2026: Best API for Production

LLM API Comparison 2026: Best API for Production

April 15, 2026 · 18 min read · guides
An opinionated LLM API comparison for production. Claude vs GPT vs Gemini vs Mistral vs DeepSeek on features, developer experience, reliability, and fit. Download the free AI Automation Checklist.
LLM API Cost Comparison 2026: Framework, Not a Stale Table

LLM API Cost Comparison 2026: Framework, Not a Stale Table

April 11, 2026 · 12 min read · guides
LLM API cost comparison for 2026. Model your real workload costs with prompt caching, output tokens, reasoning, and batch API factored in. Download the free AI Automation Checklist.
Self-Hosted LLM on Kubernetes: A Production vLLM Deployment

Self-Hosted LLM on Kubernetes: A Production vLLM Deployment

April 5, 2026 · 16 min read · blog
Complete self-hosted LLM Kubernetes guide. Deploy vLLM on GPU nodes with manifests, HPA, monitoring, and cost modeling. Practitioner notes included. Download the free AI Automation Checklist.
RAG Pipeline Tutorial: Build a Production Document Q&A System with Qdrant and Claude

RAG Pipeline Tutorial: Build a Production Document Q&A System with Qdrant and Claude

April 1, 2026 · 16 min read · blog
End-to-end RAG pipeline tutorial. Qdrant + Claude Sonnet 4.6 + local embeddings. Real code for chunking, retrieval, augmentation, and citation-grounded answers. Download the free AI Automation Checklist.