Release of Partial Data Set and Future Data Planning

Get the app

#161 - Claude 3 beats GPT-4, Stability CEO resigns, DBRX, TacticAI, UN resolution on AI

Last Week in AI

chevron_right

notes

NOTE

Release of Partial Data Set and Future Data Planning

The organization has not released the full data set due to the lengthy copyright duration verification process. However, in the upcoming weeks and months, they plan to publish more additional data sets from various sources. The data set includes 180 billion words and a major collection of 21 million digitized newspapers in multiple languages like German, French, Spanish, Dutch, and Italian, with significant portions of data in German and French.

00:00

Transcript

chevron_right

Play full episode

chevron_right

Transcript

Episode notes

Our 161st episode with a summary and discussion of last week's big AI news!

Check out our sponsor, the SuperDataScience podcast. You can listen to SDS across all major podcasting platforms (e.g., Spotify, Apple Podcasts, Google Podcasts) plus there’s a video version on YouTube.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai

Note - one extra story we didn't get to but worth knowing from this week: ‘Totally surreal’: OpenAI shares first short films created with new AI tool Sora

Timestamps + links:

(00:00:00) Intro / Banter
Tools & Apps
- (00:05:20) Google starts testing AI overviews from SGE in main Google search interface
- (00:10:00) Adobe’s new GenStudio platform is an AI factory for advertisers
- (00:13:53) Claude-3 Haiku has reached GPT-4 level by our user preference
- (00:15:26) Microsoft Teams is getting smarter Copilot AI features
- (00:17:16) Samsung is beating Apple in the race to bring AI to smartphones
- (00:19:18) Elon Musk says all premium subscribers on X will gain access to AI chatbot Grok this week
Applications & Business
Projects & Open Source
- (00:38:40) Introducing DBRX: A New State-of-the-Art Open LLM
- (00:44:06) Common Corpus: A Large Public Domain Dataset for Training LLMs
- (00:46:07) DROID: A Large-Scale In-the-Wild Robot Manipulation Dataset
- (00:48:15) InternLM2 Technical Report
Research & Advancements
- (00:51:49) TacticAI: an AI assistant for football tactics
- (00:55:44) On the Conversational Persuasiveness of Large Language Models: A Randomized Controlled Trial
- (00:59:26) AutoDev: Automated AI-Driven Development
- (01:02:48) Reverse Training to Nurse the Reversal Curse
- (01:07:43) Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference
- (01:09:48) Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction
Policy & Safety
- (01:11:12) General Assembly adopts landmark resolution on artificial intelligence
- (01:16:24) Israel Deploys Expansive Facial Recognition Program in Gaza
- (01:19:49) LINEARITY OF RELATION DECODING IN TRANSFORMER LANGUAGE MODELS
- (01:24:52) US Weighs Sanctioning Huawei’s Secretive Chinese Chip Network
- (01:27:51) New York City welcomes robotaxis — but only with safety drivers
- (01:28:40) The White House Puts New Guardrails on Government Use of AI
Synthetic Media & Art
- (01:31:17) BBC Will Stop Using AI For ‘Doctor Who’ Promotion After Receiving Complaints

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.

Home Top podcasts Popular guests