Ivan Dimov

Write

The RAG Triad in 2026: Testing with LLM & DeepEval

Feb 8, 20265 min read

The $47k Loop: Why Your AI Agent Needs a Circuit Breaker

Feb 6, 20264 min read

The Death of the Flaky Test: Why I Stopped Writing Scripts and Started Architecting Agents

It’s 2026. If you are still manually updating CSS selectors because a div moved three pixels to the left, you are doing it wrong. For the last decade, we’ve been stuck in a loop of "write, break, fix,

Feb 13, 20266 min read

Why Your RAG App Is Slow (and how to prove it)

Feb 5, 20264 min read

The Evaluation Bottleneck: Building a "Golden Dataset" Without Losing Your Mind

Feb 4, 20264 min read

Stop Counting Words: The "Token" Mindset in LLM Engineering

If you are coming from traditional software engineering, your first month working with Large Language Models (LLMs) probably involved a few rude awakenings. Maybe you tried to paste a 50-page PDF into a prompt and watched the API request fail. Maybe ...

Jan 30, 20264 min read

QA’s New Frontier - Trust as a Quality Metric

6. Beyond Assert True: Why Trust is the Only Metric That Matters in LLM QA

Jan 21, 20264 min read

Contain the Damage

5. Prevention & Mitigation Strategies

Jan 14, 20265 min read

The Automated Confidentiality Tripwire

4. Integrating Leakage Detection into the CI/CD Pipeline

Dec 19, 20257 min read

Inside the LLM Leak

3. The Hidden Ways LLMs Accidentally Expose Your Data

Nov 24, 20254 min read

Adversarial Prompt Testing

2. How to Think Like an Attacker and Find These Flaws Before They Find You

Nov 19, 20257 min read

Command Palette

Latest articles