Published inTowards AIFat Context of RAG Drives Inference Cost Sky-high. Here’s How to Save Big on API Calls.Try this before you dump your RAG prototype.1d ago1d ago
Published inAI AdvancesHow to Evaluate AI Summaries?The easiest and cheapest solution to test AI-generated summaries at scale3d ago3d ago
Published inLevel Up CodingPowerful Alternatives You Can’t Replace With LLM-As-A-JudgeAI evaluating an AI is fast and possibly the only solution for large-scale apps, but these aren’t going anywhere.4d agoA response icon14d agoA response icon1
Published inAI AdvancesHow AI Engineers Write Evaluation PromptsWhy are built-in prompts of evaluation frameworks off the mark & how to write your own.Jun 15Jun 15
Published inLevel Up CodingHow to Get ChatGPT Cut the Fluff and Respond ClearlyI no longer have my answers buried under excessive text.Jun 11A response icon1Jun 11A response icon1
Published inLevel Up CodingWhy AI-Written SQLs Are (Mostly) DisastersHow to get around? And when to use them?Jun 4A response icon1Jun 4A response icon1
Published inAI AdvancesHow Do I Evaluate Chunking Strategies For RAGsDon’t guess; here’s how to systematically approach it.May 27A response icon1May 27A response icon1
Published inLevel Up CodingThe Need For Speed In LLMsHave more GPU memory but want more speed? Try this. No need to downgrade to a smaller or quantized model.May 21May 21
Published inLevel Up CodingBuild With Qwen 3, MCP, and a Free GPUYou can have them all in a single NotebookMay 19May 19