Roman Grebennikov
A principal ML engineer and an ex startup CTO working on modern search and recommendations problems. A pragmatic fan of open-source software, functional programming, LLMs and performance engineering.
Session
06-17
12:00
40min
How [not] to evaluate your RAG
Roman Grebennikov
How do you know if your RAG system is actually working? We’ll share a real-world case study on evaluating RAG in production—tackling messy data, chunking fails, and unexpected chatbot behavior—so you can measure quality with confidence.
Palais Atelier