Archive

ai-agents · llm · production-systems

My AI Agent Passed Every Check. 67% of It Was Wrong.

My AI agent's report passed every validation check. 67% of its facts were wrong. Here are 3 detection layers I built ...

March 28, 2026

ai-agents · llm · production-systems

My AI agent got 10 of 15 financial claims wrong using web search. Switching to structured APIs fixed every one. Here'...

March 21, 2026

ai-agents · llm · production-systems

Local models got 10 of 15 financial claims wrong, even with web search. The root cause: search returns headlines, not...

March 14, 2026

ai-agents · llm · production-systems

A practical playbook for building AI agents in production. Architecture, error handling, monitoring, security, and co...

March 11, 2026

ai-agents · llm · observability

Four monitoring layers I added to my AI agent after a 709K-character runaway slipped past every health check. Here's ...

March 07, 2026

ai-agents · llm · production-systems · python

Unit tests aren't enough for AI agents. Learn the three testing layers that prevent hallucinations and tool failures ...

February 28, 2026

mcp · security · python · ai-agents

SSRF, prompt injection, path traversal: I audited my MCP server (3,000+ downloads) and found 8 vulnerabilities. Every...

February 25, 2026

aws · serverless · cost-optimization

10 practical tips to cut your AWS Lambda bill, from hidden CloudWatch costs to memory right-sizing, Graviton2, and kn...

February 22, 2026

mcp · python · ai-agents

Notes MCP server in Python with FastMCP 3.0: tools, resources, prompts, SQLite persistence, and in-process testing. U...

February 22, 2026