Your AI Application Needs Evals: Evaluation-driven development in the era of prompts

Presentation byUras Mutlu

This talk introduces a crucial but often overlooked aspect of AI application development: evaluation-driven development (EDD). Using a simple LangGraph agent as a practical example, we'll demonstrate why and how to build a robust evaluation framework that goes beyond simple unit tests. We'll explore the importance of continuous evaluation during the development cycle and how this practice directly translates to the need for comprehensive observability in production, ensuring your AI application remains accurate, reliable, and effective in the real world.

Presented with these Guilds
Cover Photo for JavaScript Monthly London Meetup
Primary Photo for JavaScript Monthly London Meetup

JavaScript Monthly London Meetup

Javascript evolution has sped up (a lot) in recent years and event the most veterans developers find it hard to keep up with the latest trends. This meetup group aims to bring you monthly bite-sized updates on the world of Javascript along with a healthy dose of nice people, beer and pizza.

  • Please use your full name when registering, as some of our venues require a full list of attendees beforehand. You have an idea and you want to be a speaker?

We are always looking for more speakers - submit your talk here (https://docs.google.com/forms/d/e/1FAIpQLSdFaatfveOUbrmer47jYb5J4J4ttxAFc1CgTjUDltBXmDOJmg/viewform)

1.1K Members
Similar Presentations

Your AI Application Needs Evals: Evaluation-driven development in the era of prompts

Presentation byUras Mutlu

This talk introduces a crucial but often overlooked aspect of AI application development: evaluation-driven development (EDD). Using a simple LangGraph agent as a practical example, we'll demonstrate why and how to build a robust evaluation framework that goes beyond simple unit tests. We'll explore the importance of continuous evaluation during the development cycle and how this practice directly translates to the need for comprehensive observability in production, ensuring your AI application remains accurate, reliable, and effective in the real world.

Presented with these Guilds
Cover Photo for JavaScript Monthly London Meetup
Primary Photo for JavaScript Monthly London Meetup

JavaScript Monthly London Meetup

Javascript evolution has sped up (a lot) in recent years and event the most veterans developers find it hard to keep up with the latest trends. This meetup group aims to bring you monthly bite-sized updates on the world of Javascript along with a healthy dose of nice people, beer and pizza.

  • Please use your full name when registering, as some of our venues require a full list of attendees beforehand. You have an idea and you want to be a speaker?

We are always looking for more speakers - submit your talk here (https://docs.google.com/forms/d/e/1FAIpQLSdFaatfveOUbrmer47jYb5J4J4ttxAFc1CgTjUDltBXmDOJmg/viewform)

1.1K Members
Similar Presentations