ThursdAI - The top AI news from the past week cover image

πŸ“– ThursdAI - Sunday special on datasets classification & alternative transformer architectures

ThursdAI - The top AI news from the past week

00:00

Text Wrangling and Dataset Visualization

The host discusses the process of wrangling large datasets of text and mentions examples of datasets ranging from 16,000 to millions of rows. They also talk about the LIDAC tool and its usage in categorizing and visualizing large datasets like the Hermes dataset and the Open Orchid dataset. Additionally, they discuss the concept of classifiers and their relation to embedding concepts, emphasizing the importance of data quality for AI systems.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app