Posts

Showing posts from January, 2026

LLM Observability

Image
  In the era of Generative AI, the old adage "if you can’t measure it, you can’t manage it" has taken on a complex new meaning. Traditional software monitoring-tracking if a server is up or if a database is slow-is no longer enough. When your application's core logic is a non-deterministic Large Language Model (LLM), "up" doesn't necessarily mean "working." LLM Model Observability is the practice of capturing the "why" behind model behavior, moving beyond surface-level health to understand the nuance of every prompt, retrieval, and generation. Beyond Monitoring: Why Observability? Traditional monitoring answers: Is the system broken? (e.g., 500 errors, high CPU). LLM observability answers: Is the system hallucinating? Why did it choose this tool? Why did costs spike yesterday? Because LLMs are "black boxes" that produce different outputs for the same input, you need a high-fidelity record of the internal state....

Seeing What’s Missing: Survivorship Bias in Data Science

Image
  What Is Survivorship Bias? Survivorship bias occurs when analyses only focus on entities that have "survived" a selection filter-such as successful companies, models, or observations-while neglecting failures or dropouts. This selective visibility leads to inflated conclusions and overly optimistic inferences, because the non-survivors (e.g. failed companies, unrecovered planes, dismissed data points) are missing from the data.                                           Historical Roots: The WWII Aircraft Example During WWII, analysts inspected returning bombers and saw bullet holes clustered in certain areas. Their instinct was to reinforce those damaged spots. But statistician Abraham Wald illustrated the fallacy: the undamaged areas on surviving planes were actually the most critical-planes hit there didn’t return. Reinforcing the undamaged zones improved aircra...