AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Release of Partial Data Set and Future Data Planning
The organization has not released the full data set due to the lengthy copyright duration verification process. However, in the upcoming weeks and months, they plan to publish more additional data sets from various sources. The data set includes 180 billion words and a major collection of 21 million digitized newspapers in multiple languages like German, French, Spanish, Dutch, and Italian, with significant portions of data in German and French.