

Evaluating LLMs with Leva
12 snips Aug 26, 2025
Kieran Klaassen, a Ruby developer and the mind behind the AI tools Cora and Leva, shares his passion for AI and Rails. He dives into the creation of the Leva gem, designed for evaluating large language models, and discusses his journey in AI product development. The conversation covers best practices in AI tool creation, the importance of continuous evaluation, and effective workflow management. Kieran highlights the collaborative spirit of the Ruby community and the joy found in integrating AI into Ruby on Rails projects.
AI Snips
Chapters
Transcript
Episode notes
Built Leva From A Real Need
- Kieran built Leva because he needed evaluations for his email product and found nothing suitable in Ruby.
- He often creates gems to solve his own needs quickly and then shares them publicly.
Design Gems Starting With The README
- Start gem design by writing your ideal README to clarify the API and use cases before coding.
- Choose abstractions you like and build for yourself so the gem stays modular and practical.
Keep Evals Close To Production Data
- Running evaluations inside your stack solves privacy and audit constraints that external services can't.
- Proximity to production data reduces friction for debugging and prompt iteration.