Train with AI Feedback vs Human Feedback: The Difference

AI feedback can have gameable preferences and tend to prefer longer answers over shorter ones, which may not be favorable to human beings. It is important to consider the feedback from human reviewers, as it is more expensive but provides a different perspective and evaluation of the generated outputs.

Snipped by

Dr. Daniel Bender

Play episode from 01:04:54

chevron_right

Transcript

chevron_right

Transcript

Episode notes

Our 145th episode with a summary and discussion of last week's big AI news, this time around with guest co-hosts Kevin and Gavin from AI For Humans podcast

Check out the AI For Humans episode on which Andrey and Jeremie guest co-host here.

Also check out our sponsor, the SuperDataScience podcast. You can listen to SDS across all major podcasting platforms (e.g., Spotify, Apple Podcasts, Google Podcasts) plus there’s a video version on YouTube.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

Email us your questions and feedback at contact@lastweekin.ai

Timestamps + links:

Tools & Apps
Applications & Business
Projects & Open Source
- (01:03:15) Starling-7B: Increasing LLM Helpfulness & Harmlessness with RLAIF
- (01:12:10) Defending your voice against deepfakes
Research & Advancements
- (01:16:07) Orca 2: Teaching Small Language Models How to Reason
- (01:22:16) New technique can accelerate language models by 300x
- (01:23:35) DeepMind Says New Multi-Game AI Is a Step Toward More General Intelligence
- (01:27:00) GAIA: A Benchmark for General AI Assistants
- (01:30:45) Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch
Policy & Safety
Synthetic Media & Art
- (01:48:18) Sarah Silverman Hits Stumbling Block In AI Copyright Infringement Lawsuit Against Meta
(01:52:23) Outro

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

Home Top podcasts Popular guests Top books