Why it matters
AI Engineer session on Engineering Better Evals: Scalable LLM Evaluation Pipelines That Work, presented by Dat Ngo, Aman Khan, Arize. It adds practical context for how teams are building and operating AI systems in production.
My takeaway: Useful for AI engineering because it connects model behavior to developer workflow, prompting patterns, and application architecture.