The Power of Community in AI Development with Oumi

Mar 16, 2025

Emmanouil (Manos) Koukoumidis, CEO of Oumi and former Google Cloud AI tech lead, talks about fostering community in AI development. He stresses the need for open-source models to promote collaboration and accessibility, likening Oumi's vision to 'the Linux of AI.' Manos shares insights on navigating the overwhelming choices in AI models and the importance of engaging a community for innovation. He also addresses gaps in AI accessibility and the need for standardization to empower both researchers and enterprises in their AI journeys.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ANECDOTE

Oumi's Origin

Manos Koukoumidis left Google Cloud, where he led Palm (later Gemini), to found Oumi.
He saw the limitations of closed AI models and the need for open, accessible AI for broader use.

INSIGHT

Open Source AI Defined

Open-source AI requires open data, code, models, and weights, enabling reproduction and extension.
Oumi emphasizes open collaboration, making it easy and inclusive for community contributions.

INSIGHT

Community-Driven AI Development

Open-source AI thrives on contributions across the ecosystem, from data preprocessing to RL techniques.
Community members can test improvements at smaller scales before applying them to larger models.

Get the Snipd Podcast app to discover more snips from this episode

Get the app

Summary
In this episode of the AI Engineering Podcast Emmanouil (Manos) Koukoumidis, CEO of Oumi, about his vision for an open platform for building, evaluating, and deploying AI foundation models. Manos shares his journey from working on natural language AI services at Google Cloud to founding Oumi with a mission to advance open-source AI, emphasizing the importance of community collaboration and accessibility. He discusses the need for open-source models that are not constrained by proprietary APIs, highlights the role of Oumi in facilitating open collaboration, and touches on the complexities of model development, open data, and community-driven advancements in AI. He also explains how Oumi can be used throughout the entire lifecycle of AI model development, post-training, and deployment.

Announcements

Hello and welcome to the AI Engineering Podcast, your guide to the fast-moving world of building scalable and maintainable AI systems
Your host is Tobias Macey and today I'm interviewing Manos Koukoumidis about Oumi, an all-in-one production-ready open platform to build, evaluate, and deploy AI models

Interview

Introduction
How did you get involved in machine learning?
Can you describe what Oumi is and the story behind it?
There are numerous projects, both full suites and point solutions, focused on every aspect of "AI" development. What is the unique value that Oumi provides in this ecosystem?
You have stated the desire for Oumi to become the Linux of AI development. That is an ambitious goal and one that Linux itself didn't start with. What do you see as the biggest challenges that need addressing to reach a critical mass of adoption?
In the vein of "open source" AI, the most notable project that I'm aware of that fits the proper definition is the OLMO models from AI2. What lessons have you learned from their efforts that influence the ways that you think about your work on Oumi?
On the community building front, HuggingFace has been the main player. What do you see as the benefits and shortcomings of that platform in the context of your vision for open and collaborative AI?
Can you describe the overall design and architecture of Oumi?
- How did you approach the selection process for the different components that you are building on top of?
- What are the extension points that you have incorporated to allow for customization/evolution?
Some of the biggest barriers to entry for building foundation models are the cost and availability of hardware used for training, and the ability to collect and curate the data needed. How does Oumi help with addressing those challenges?
For someone who wants to build or contribute to an open source model, what does that process look like?
- How do you envision the community building/collaboration process?
Your overall goal is to build a foundation for the growth and well-being of truly open AI. How are you thinking about the sustainability of the project and the funding needed to grow and support the community?
What are the most interesting, innovative, or unexpected ways that you have seen Oumi used?
What are the most interesting, unexpected, or challenging lessons that you have learned while working on Oumi?
When is Oumi the wrong choice?
What do you have planned for the future of Oumi?

Contact Info

Parting Question

From your perspective, what are the biggest gaps in tooling, technology, or training for AI systems today?

Closing Announcements

Thank you for listening! Don't forget to check out our other shows. The Data Engineering Podcast covers the latest on modern data management. Podcast.__init__ covers the Python language, its community, and the innovative ways it is being used.
Visit the site to subscribe to the show, sign up for the mailing list, and read the show notes.
If you've learned something or tried out a project from the show then tell us about it! Email hosts@aiengineeringpodcast.com with your story.
To help other people find the show please leave a review on iTunes and tell your friends and co-workers.

Links

The intro and outro music is from Hitman's Lovesong feat. Paola Graziano by The Freak Fandango Orchestra/CC BY-SA 3.0