Latent Space: The AI Engineer Podcast

Mapping the future of *truly* Open Models and Training Dolly for $30 — with Mike Conover of Databricks

42 snips
Apr 29, 2023
Mike Conover, a Staff Software Engineer at Databricks with a PhD in Complex Systems, leads the charge for truly open-source AI models. He reveals how his team developed Dolly, a customizable LLM that can be trained for just $30. The conversation dives deep into the evolution of AI, comparing it to biological concepts, and explores innovative training methods. Conover also addresses the practical applications of generative AI, emphasizing the importance of blending creativity with business utility. Don't miss insights into AI's potential across various domains!
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Response Length Correlation

  • The length of training data responses influences generated output length.
  • Longer, narrative answers in training create longer, more detailed model responses.
INSIGHT

Emergent Behavior

  • Instruction following was not a designed feature of the base model.
  • It emerged through perturbation, suggesting more undiscovered capabilities.
ADVICE

Targeted Instruction Tuning

  • Businesses should focus on instruction tuning data relevant to their needs.
  • Avoid generic data like writing love letters, prioritize actionable business value.
Get the Snipd Podcast app to discover more snips from this episode
Get the app