How to Train a Model to Not Generate Toxic Content

The goal of the ALMO project is to keep academia and open source developers in the game. The approach we are taking right now is on the pre-training data side, we are filtering out significant percentage of the toxic content. This is another reason that open source language models should be an important thing because this is where people can start executing simpler methods that would do the same job. We see a lot of formerly public research disappearing behind closed doors and we're worried about it.

Play episode from 15:12

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app