The Role of Generalization in Agent Behavior

They can take a llama to a base model and fine tune it to achieve agent performance similar to GPD 3.5. Their system is impressive because it reaches a level that previous models couldn't. They can perform tasks like online shopping, web browsing, and general computer tasks. When fine tuning, they retain the general world knowledge to help the agents perform better on general reasoning tasks. This approach is different from traditional fine tuning for dialogue and instruction following.

Play episode from 01:07:30

chevron_right

Transcript

chevron_right

Transcript

Episode notes

Our 142nd episode with a summary and discussion of last week's big AI new.

Apologies for this one coming out after a pause, episodes will resume being released regularly as of this week.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai

Timestamps + Links:

(00:00) Intro / Banter
Tools & Apps
Applications & Business
Projects & Open Source
Research & Advancements
- (53:22) Eliciting Human Preferences with Language Models
- (57:23) New Nvidia AI agent, powered by GPT-4, can train robots
- (01:01:38) Unveiling the General Intelligence Factor in Language Models: A Psychometric Approach
- (01:04:48) AgentTuning: Enabling Generalized Agent Abilities for LLMs
- (01:09:51) Contrastive Prefence Learning: Learning from Human Feedback without RL
- (01:11:25) ‘Mind-blowing’ IBM chip speeds up AI

Policy & Safety
- (01:14:57) GM Cruise unit suspends all driverless operations after California ban
- (01:18:52) AI researchers uncover ethical, legal risks to using popular data sets
- (01:22:22) AI Safety Summit: day 1 and 2 programme
- (01:25:23) Anthropic's AI chatbot Claude is posting lyrics to popular songs, lawsuit claims
- (01:26:38) Mike Huckabee says Microsoft and Meta stole his books to train AI
- (01:27:10) Clearview AI Successfully Appeals $9 Million Fine in the U.K.
- (01:28:11) North Korea experiments with AI in cyber warfare: US official
- (01:30:17) OpenAI forms new team to assess ‘catastrophic risks’ of AI
- UK poised to establish global advisory group on AI
Synthetic Media & Art

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

Home Top podcasts Popular guests Top books