Testing LLM Agents Like Software – Behaviour Driven Evals of AI Systems 19 points by PranoyP 4 hours ago 9 comments story