Episode 48: How do the latest updates to large language models stack up against each other? Matt Wolfe (https://x.com/mreflow) and Nathan Lands (https://x.com/NathanLands) are joined by Matthew Berman (https://x.com/MatthewBerman), an expert in deep-diving and testing the nuances of large language models.
In this episode, the trio discusses the recent releases of Grok 3, Claude 3.7, and GPT-4.5, analyzing their strengths, weaknesses, and unique features. Tune in to learn which model might be best for your needs, from coding and real-time information to creative writing and unbiased truth-seeking.
Check out The Next Wave YouTube Channel if you want to see Matt and Nathan on screen: https://lnk.to/thenextwavepd
—
Show Notes:
- (00:00) Exploring New AI Models
- (05:35) Inconsistent AI Code Performance
- (06:26) Redesigning Benchmarks for Modern Models
- (11:33) AI Bias Amplification on Social Media
- (15:11) AI Bias and Human Oversight
- (17:49) Claude 3.7: Improved Coding Abilities
- (20:30) Claude Update: Better Code, Worse Chat
- (23:19) Resistance to Switching IDE from VS Code
- (28:05) Video Producer App Preview
- (29:55) Showcasing Nvidia Digits Prototype
- (34:00) GROK Model's Distributed Training
- (36:31) Optimistic Perspective on Future Upgrades
- (40:59) Excited for GPT-5 Launch
- (42:08) Claude 3.7 Excels in Coding
—
Mentions:
Check out this episode on YouTube: https://www.youtube.com/watch?v=pWXT8NZFG_Y
Get the guide to build your own Custom GPT: https://clickhubspot.com/tnw
—
Check Out Matt’s Stuff:
• Future Tools - https://futuretools.beehiiv.com/
• Blog - https://www.mattwolfe.com/
• YouTube- https://www.youtube.com/@mreflow
—
Check Out Nathan's Stuff:
The Next Wave is a HubSpot Original Podcast // Brought to you by The HubSpot Podcast Network // Production by Darren Clarke // Editing by Ezra Bakker Trupiano