Multimodal RLHF Fine Tuning: LLAVA, Factually Augmented RLHF, and MM Hal Bench

This chapter explores the challenges of aligning multiple modalities during instruction tuning and RLHF, highlighting the issues of misalignment and generating ungrounded textual outputs.

Play episode from 09:34

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app