Train From Scratch Modal Heads?

Based on these results, what do you think we can change about the way we build these models? Is there anything else you think we should do differently? I say several possible directions of research. Firsti haven't mentioned it before, but i've also looked at whether we can train from scratch modal with the same configuration of heads as the broomed ones. We fond that, no, we cannot. It's better brom mich morde than train from scratch. An en molt of the same size. And they can be i y connections towate etiet hypothesis.

Play episode from 15:32

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app