
Data Skeptic
Q&A with Kyle
Dec 19, 2023
In this Q&A episode, the host discusses finding guests algorithmically, exploring impactful technologies and tools, data annotation as remote work, Cue Basic programming language, programming experiences and hacker culture, 'grab' command line utility and the importance of Git for source control.
40:23
AI Summary
AI Chapters
Episode notes
Podcast summary created with Snipd AI
Quick takeaways
- The host of the podcast uses an algorithmic approach to find guests by crawling through the archive at rxiv.org and selecting publications based on specific categories and predicted interest levels.
- Data Skeptic is a platform for engaging with diverse researchers and showcasing their work, aiming to create spin-off projects and collaborations in the future.
Deep dives
Finding Guests for the Show
Finding guests for the podcast is primarily done algorithmically. The host crawls the archive at rxiv.org, a preprint platform, and selects publications in specific categories of interest. The text from the publications is extracted and indexed using keywords and BERT embeddings. A machine learning model trained on these embeddings predicts the interest level of each publication. By combining keyword matching and predicted interest, the top results are selected and presented to the host for consideration.
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.