MLOps.community

Large Language Models in Production Round-table Conversation

Mar 23, 2023

57:47

Creator website

Highlights

AI Chapters

Episode notes

How to Make Probabilistic Workflows Feel More Deterministic

02:14

The Benefits of Off-the-Shelf Models

02:36

How to Integrate Large Language Models Into Your Product

02:34

Introduction

2min

Introduction to Large Language Models

2min

What Is a Large Language Model?

2min

The History of Transfer Learning

3min

The Importance of Large Language Models

2min

How HANA Can Help You Make Better Decisions

2min

How to Integrate Large Language Models Into Your Product

3min

The Open Source Movement

2min

The Benefits of Off-the-Shelf Models

3min

10.

How to Use OpenAI in Production

4min

11.

The Engineering Gap in ML

2min

12.

The Trade-Offs of Learning ML to Do ML

2min

13.

Building Machine Learning Powered Applications

2min

14.

The Future of LLM in Production

2min

15.

The Cost, Quantity and Latency Triangle in Software Development

2min

16.

The Importance of Re-Architecting Production

2min

17.

The Importance of Engineering Skills in MLP

3min

18.

The Moore's Law Approach to Large Models

2min

19.

The Divergence of Real-Time Use Cases

2min

20.

The Cost of Latency in a Condensed Version Event

3min

21.

How to Optimize for Cost in Meta Scale

2min

22.

How to Avert Cost in ML Projects

2min

23.

The Importance of Trust in Language Models

2min

24.

How to Make Probabilistic Workflows Feel More Deterministic

2min

25.

The Benefits of a Constant Look Up for Databases

3min

LLM in Production Round Table with Demetrios Brinkmann, Diego Oppenheimer, David Hershey, Hannes Hapke, James Richards, and Rebecca Qian. // Abstract Using LLM in production. That's right. Hype or here to stay? The conversation answers some of the questions that have been asked by our community members like; performance & cost of production, the difference in architectures, Reliability issues, and a bunch of random tangents. We have some heavy hitters for this event! // MLOps Jobs board https://mlops.pallet.xyz/jobs // MLOps Swag/Merch https://mlops-community.myshopify.com/ // Related Links LLM in Production survey: https://docs.google.com/forms/d/e/1FAIpQLSerEryK4xHEZTq0hSu-sVmBHilOzaT71BfCQgXe_uIRgIah-g/viewform Virtual LLMs in Production Conference registration: https://home.mlops.community/public/events/llms-in-production-conference-2023-04-13 Chinchilla papers: https://paperswithcode.com/method/chinchilla, https://arxiv.org/abs/2203.15556 --------------- ✌️Connect With Us ✌️ ------------- Join our slack community: https://go.mlops.community/slack Follow us on Twitter: @mlopscommunity Sign up for the next meetup: https://go.mlops.community/register Catch all episodes, blogs, newsletters, and more: https://mlops.community/ Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Diego on LinkedIn: https://www.linkedin.com/in/diego/ Connect with David on LinkedIn: https://www.linkedin.com/in/david-hershey-458ab081/ Connect with Hannes on LinkedIn: https://www.linkedin.com/in/hanneshapke/ Connect with James on LinkedIn: https://www.linkedin.com/in/james-richards-4baa73a7/ Connect with Rebecca on LinkedIn: https://www.linkedin.com/in/rebeccaqian/ Timestamps: [00:00] Round table success to Virtual LLM in Production Conference on April 13th! [00:18] Register for the Virtual LLM in Production Conference now! [00:44] LLM in Production survey [01:40] Lightning round of introduction of speakers [04:34] Large Language Models definition [09:17] What do we consider large? [10:35] Thought process in use cases production [14:30] LLM open source huge movements [16:50] Problems qualifications [19:25] Production use cases frameworks directions [25:25] Open-source language models tokenizer [26:25] Language models democratization [29:25] Three categories for LLMs in Production [31:22] Latency at 2 levels [33:27] Defining production [34:57] Hitting the latency problems [38:20] Fundamental latency barrier [40:39] Latency use case requirement [44:25] Costs and the use cases [48:12] Product management involvement in costing [49:38] LLMs Hallucination definition [52:05] Building deterministic systems trust [55:21] Wrap up