AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
RL and Language Queue Learning
There has been research on sequential decision-making with language and various kinds of language models. It hasn't been at quite the same scale as the recent human preferences work that everyone's more familiar with. But for example, we could go back to some work that I actually found very inspiring a few years ago by Natasha Jakes when she was at MIT. She built a chatbot that would optimize not human preferences, but actually human sentiment.