
03: The Next Generation of LLMs with Jonathan Frankle of MosaicML
Replit AI Podcast
The Importance of Formatting Your Data
I love doing this to compare and contrast data sets like Wikipedia is a great place to start. The pile was amazing, Wouldn't have been cool if the pile were always up to date. I'm very excited to be joining Databricks soon because one of the hardest things is building good tools to iterate over these massive data sets at scale. Now we've got your own non-edible open source project called Replit it's going to be available for everyone. You can claim that we got it done before we've started but as long as anybody who works on data knows it takes five times as longBecause every single LOM has been delayed by sheer mountain to climb when it comes
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.