Super Data Science: ML & AI Podcast with Jon Krohn cover image

713: Llama 2, Toolformer and BLOOM: Open-Source LLMs with Meta's Dr. Thomas Scialom

Super Data Science: ML & AI Podcast with Jon Krohn

CHAPTER

Evolution of Pre-trained Models and Reinforcement Learning with Human Feedback

The chapter explores the development of pre-trained models like GPT-3 and emphasizes the importance of diversity in instructions and data sets for optimal model performance. It also delves into the innovative two-stage Reinforcement Learning from Human Feedback process, demonstrating its ability to achieve superhuman performance in creative tasks through human preference fine-tuning.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner