
What are generalist agents and why they can change the AI game (Ep. 199)
Data Science at Home
00:00
Using a General Sequence Model to Predict the Next Token
One point two billion perameter model is not something that you can deal with your regular lap up here. The training process has been performed on a 16 by 16 tpu, or tensor processing unit v three,. And the alttext anotation of each mage l t ip stands for long tacks and image pairs which consists of 312 million images with captions. Are also captioning dato sets with about thre million and hundred 20 thousand image text payers respectively.
Transcript
Play full episode