Vectorless RAG
If you’ve built anything with LLMs in the past couple of years, you’ve probably wired up a Retrieval-Augmented Generation (RAG) pipeline. The playbook is burned into our brains: take a PDF, smash it i
Search for a command to run...
If you’ve built anything with LLMs in the past couple of years, you’ve probably wired up a Retrieval-Augmented Generation (RAG) pipeline. The playbook is burned into our brains: take a PDF, smash it i
I’ve been thinking a lot recently about the "chunking problem" in Retrieval-Augmented Generation. If you've played around with the llm CLI tool or built anything with vector embeddings, you've probabl
If you’ve spent any time building Retrieval-Augmented Generation (RAG) prototypes, you inevitably hit the exact same wall. You wire up a great embedding model, point it at an excellent local LLM, and
In an article I recently co-authored, we argued that a fundamental shift is underway in product design. The traditional principles of User Experience (UX), which rest on direct user control and manipulation, are becoming obsolete with the rise of tru...
For years, evaluating traditional machine learning models, while never simple, followed a well-trodden path. Your team knew the drill: assemble a labeled dataset, define success with metrics like precision and recall, and track performance. The core ...
For the past few years, the rise of large language models (LLMs) has fueled a growing industry of so-called "prompt experts"—people who claim to have mastered the art of crafting precise instructions to extract the best results from AI. But as LLMs b...
Machine learning engineers transitioning from experimental models to production systems can significantly benefit from adopting principles established in Site Reliability Engineering (SRE). By integrating SRE practices, ML engineers can build systems...
February 2 marks the first compliance deadline for the EU’s AI Act, the groundbreaking regulatory framework that officially took effect last August. This legislation sets a global precedent in defining clear boundaries for the development and deploym...
In the fast-paced world of digital advertising, it’s tempting to focus on the metric that’s easiest to measure: Click-Through Rate (CTR). After all, clicks provide immediate feedback, making it seem like a straightforward indicator of campaign perfor...