Data Skeptic cover image

Data Skeptic

Latest episodes

undefined
Oct 2, 2023 • 38min

Agents with Theory of Mind Play Hanabi

Nieves Montes, Ph.D. student specializing in value-based reasoning and theory of mind, discusses her latest research on combining theory of mind and abductive reasoning in agent-oriented programming. The podcast explores the mechanics and challenges of Hanabi, the relationship between theory of mind and abduction, using predicate logic to represent desire and motivation in an agent, reasoning about other players in Hanabi, and future plans and online presence.
undefined
6 snips
Sep 25, 2023 • 26min

LLMs for Evil

Maximilian Mozes, PhD student at the University College, London, specializing in NLP and adversarial machine learning, discusses the potential malicious uses of Large Language Models (LLMs), challenges of detecting AI-generated harmful content, reinforcement learning with Human Feedback, limitations and safety concerns of LLMs, threats of data poisoning and jailbreaking, and approaches to avoid issues with LLMs.
undefined
Sep 11, 2023 • 31min

The Defeat of the Winograd Schema Challenge

Machine Learning Engineer, Vid Kocijan, discusses the Winograd Schema Challenge and the advancements in Natural Language Processing. They explore the different schools of thought in NLP, the difficulty and techniques in the challenge, and the resolution of the challenge including alternative metrics.
undefined
13 snips
Sep 4, 2023 • 34min

LLMs in Social Science

Petter Törnberg, an Assistant Professor in Computational Social Science, discusses findings from his research papers on the performance of Chat GPT in interpreting political tweets, the ease of using language models in social science research, and the controversy surrounding large language models in social science.
undefined
Aug 28, 2023 • 34min

LLMs in Music Composition

Carlos Hernández Oliván, a Ph.D. student at the University of Zaragoza, discusses building new models for symbolic music generation. He explores whether these models are truly creative and shares situations where AI-generated music can pass the Turing test. He also highlights essential considerations when constructing models for music composition, including the role of creativity and the comparison between language models and music modeling. The podcast also delves into the potential of collaboration between music theorists, composers, and researchers.
undefined
13 snips
Aug 21, 2023 • 27min

Cuttlefish Model Tuning

Hongyi Wang, a Senior Researcher at Carnegie Mellon University, discusses his research paper on low-rank model training. He addresses the need for optimizing ML model training and the challenges of training large models. He introduces the Cuttlefish model, its use cases, and its superiority over the Low-Rank Adaptation technique. He also offers advice on entering the machine learning field.
undefined
21 snips
Aug 15, 2023 • 39min

Which Professions Are Threatened by LLMs

On today’s episode, we have Daniel Rock, an Assistant Professor of Operations Information and Decisions at the Wharton School of the University of Pennsylvania. Daniel’s research focuses on the economics of AI and ML, specifically how digital technologies are changing the economy. Daniel discussed how AI has disrupted the job market in the past years. He also explained that it had created more winners than losers. Daniel spoke about the empirical study he and his coauthors did to quantify the threat LLMs pose to professionals. He shared how they used the O-NET dataset and the BLS occupational employment survey to measure the impact of LLMs on different professions. Using the radiology profession as an example, he listed tasks that LLMs could assume. Daniel broadly highlighted professions that are most and least exposed to LLMs proliferation. He also spoke about the risks of LLMs and his thoughts on implementing policies for regulating LLMs.
undefined
10 snips
Aug 8, 2023 • 49min

Why Prompting is Hard

We are excited to be joined by J.D. Zamfirescu-Pereira, a Ph.D. student at UC Berkeley. He focuses on the intersection of human-computer interaction (HCI) and artificial intelligence (AI). He joins us to share his work in his paper, Why Johnny can’t prompt: how non-AI experts try (and fail) to design LLM prompts.  The discussion also explores lessons learned and achievements related to BotDesigner, a tool for creating chat bots.
undefined
7 snips
Jul 31, 2023 • 36min

Automated Peer Review

In this episode, we are joined by Ryan Liu, a Computer Science graduate of Carnegie Mellon University. Ryan will begin his Ph.D. program at Princeton University this fall. His Ph.D. will focus on the intersection of large language models and how humans think. Ryan joins us to discuss his research titled "ReviewerGPT? An Exploratory Study on Using Large Language Models for Paper Reviewing"
undefined
16 snips
Jul 24, 2023 • 44min

Prompt Refusal

The creators of large language models impose restrictions on some of the types of requests one might make of them.  LLMs commonly refuse to give advice on committing crimes, producting adult content, or respond with any details about a variety of sensitive subjects.  As with any content filtering system, you have false positives and false negatives. Today's interview with Max Reuter and William Schulze discusses their paper "I'm Afraid I Can't Do That: Predicting Prompt Refusal in Black-Box Generative Language Models".  In this work, they explore what types of prompts get refused and build a machine learning classifier adept at predicting if a particular prompt will be refused or not.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner