TalkRL: The Reinforcement Learning Podcast cover image

Aravind Srinivas 2

TalkRL: The Reinforcement Learning Podcast

00:00

Is the Decision Transformer Really Relevant in the Big Data Regime?

It's not there yet in terms of really beating the best human engineered algorithms on these benchmarks. It would be nice if it's made to be a very reliable algorithm that works out of something like scikit learn logistic regression. Using it with language or code where you can ask a agent to iteratively change the code based on your feedback that you gave, that would be really awesome. There are so many more variations of this model that people haven't really explored yet and I'm hoping to explore.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app