Software Engineering Daily

Scaling Large ML Models to Small Devices with Atila Orhon

May 7, 2024

00:00

Snipd AI

Atila Orhon, expert in scaling large ML models for small devices, discusses challenges running models on phones and laptops. Argmax, his startup, focuses on solutions for this. Orhon shares insights from Apple and NVIDIA, optimizing ML models, and more.

AI Summary

Highlights

AI Chapters

Episode notes

Podcast summary created with Snipd AI

Quick takeaways

Argmax focuses on optimizing large ML models for inference on phones and laptops.

Attila Orhon transitioned from academia to industry, emphasizing computer vision and ML technologies.

Techniques like compression and precomputation are crucial for efficient on-device ML model deployment.

Deep dives

ARG Max: Innovating Large Model Deployment on Commodity Hardware

ARG Max, a startup founded in 2023 by Attila Orhan, focuses on developing methods to run large ML models on non-dedicated hardware like phones and laptops. They observed that the largest models are growing, while commercially relevant smaller models are shrinking. The company received funding from General Catalyst and industry leaders, aiming to optimize ML models for efficient inference on devices.

Adopting New Technology: Understanding the Evolution of Commercial Models

02:00

Introduction

2min

Machine Learning at Apple: Privacy, Open-Source, and Commercial Potential

10min

Exploring the Use of On-Device Machine Learning Models for Code Autocomplete

5min

Challenges of Implementing ML Models on Small Devices vs. the Cloud

2min

Challenges in Adapting ML Models to Various Devices and Platforms

2min

Deploying Large Machine Learning Models on Small Devices

26min

Engaging Developers Through Open Source Projects

9min

The size of ML models is growing into the many billions of parameters. This poses a challenge for running inference on non-dedicated hardware like phones and laptops.

Argmax is a startup focused on developing methods to run large models on commodity hardware. A key observation behind their strategy is that the largest models are getting larger, but the smallest models that are commercially relevant are getting smaller. The company was started in 2023 and has raised money from General Catalyst and other industry leaders.

Atila Orhon is the founder of Argmax and he previously worked at Apple and NVIDIA. He joins the show to talk about working in computer vision, building ML tooling at Apple, optimizing ML models, and more.

Sean’s been an academic, startup founder, and Googler. He has published works covering a wide range of topics from information visualization to quantum computing. Currently, Sean is Head of Marketing and Developer Relations at Skyflow and host of the podcast Partially Redacted, a podcast about privacy and security engineering. You can connect with Sean on Twitter @seanfalconer.

Please click here to see the transcript of this episode.

Sponsorship inquiries: sponsor@softwareengineeringdaily.com

The post Scaling Large ML Models to Small Devices with Atila Orhon appeared first on Software Engineering Daily.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Software Engineering Daily

Scaling Large ML Models to Small Devices with Atila Orhon

Podcast summary created with Snipd AI

Quick takeaways

Deep dives

ARG Max: Innovating Large Model Deployment on Commodity Hardware

Attila Orhan's Transition from Apple to Founding ARG Max

Challenges in Deploying Models on Device and Strategies for Efficiency

Whisper Kit: Enabling Real-Time Transcription Applications

Community Engagement and Future Opportunities in ML Development

Adopting New Technology: Understanding the Evolution of Commercial Models

Get the Snipd
podcast app

AI-powered
podcast player

Discover
highlights

Save any
moment

Share
& Export

AI-powered
podcast player

Discover
highlights

Software Engineering Daily

Scaling Large ML Models to Small Devices with Atila Orhon

Podcast summary created with Snipd AI

Quick takeaways

Deep dives

ARG Max: Innovating Large Model Deployment on Commodity Hardware

Attila Orhan's Transition from Apple to Founding ARG Max

Challenges in Deploying Models on Device and Strategies for Efficiency

Whisper Kit: Enabling Real-Time Transcription Applications

Community Engagement and Future Opportunities in ML Development

Adopting New Technology: Understanding the Evolution of Commercial Models

Get the Snipdpodcast app

AI-poweredpodcast player

Discoverhighlights

Save anymoment

Share& Export

AI-poweredpodcast player

Discoverhighlights

Get the Snipd
podcast app

AI-powered
podcast player

Discover
highlights

Save any
moment

Share
& Export

AI-powered
podcast player

Discover
highlights