Skepticism Over Language Model Benchmarking and Open Source Initiatives

This chapter examines the doubts surrounding benchmarking methods for large language models, particularly criticizing the MMLU for its superficial insights. It also emphasizes the importance of collaboration in the AI community, highlighting contributions from companies like Accubits Technology and funding initiatives from a16z.

Play episode from 35:30

chevron_right

Transcript

chevron_right

Transcript

Episode notes

Our 137th episode with a summary and discussion of last week's big AI news!

With guest host Jessica Dai. Check out her Reboot publication!

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai

Check out our sponsor, the SuperDataScience podcast. You can listen to SDS across all major podcasting platforms (e.g., Spotify, Apple Podcasts, Google Podcasts) plus there’s a video version on YouTube.

Timestamps + Links:

(00:00) Intro
(01:37) SuperDataScience podcast ad
Tools & Apps
Applications & Business
Projects & Open Source
Research & Advancements
Policy & Safety
- (56:18) Tech leaders including Musk, Zuckerberg call for government action on AI
- (01:01:32) US court rules that artificial intelligence generated artwork cannot be copyrighted
- (01:04:00) 2 Senators Propose Bipartisan Framework for A.I. Laws
- (01:05:47) Transcript: US Senate Judiciary Hearing on Oversight of A.I.
- (01:06:02) U.S. Copyright Office Invites Public To Comment On AI
Synthetic Media & Art
(01:17:23) Outro

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

Home Top podcasts Popular guests Top books