In episode 99 of The Gradient Podcast, Daniel Bashir speaks to Professor Martin Wattenberg.
Professor Wattenberg is a professor at Harvard and part-time member of Google Research’s People + AI Research (PAIR) initiative, which he co-founded. His work, with long-time collaborator Fernanda Viégas, focuses on making AI technology broadly accessible and reflective of human values. At Google, Professor Wattenberg, his team, and Professor Viégas have created end-user visualizations for products such as Search, YouTube, and Google Analytics. Note: Professor Wattenberg is recruiting PhD students through Harvard SEAS—info here.
Have suggestions for future podcast guests (or other feedback)? Let us know here or reach us at editor@thegradient.pub
Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter
Outline:
* (00:00) Intro
* (03:30) Prof. Wattenberg’s background
* (04:40) Financial journalism at SmartMoney
* (05:35) Contact with the academic visualization world, IBM
* (07:30) Transition into visualizing ML
* (08:25) Skepticism of neural networks in the 1980s
* (09:45) Work at IBM
* (10:00) Multiple scales in information graphics, organization of information
* (13:55) How much information should a graphic display to whom?
* (17:00) Progressive disclosure of complexity in interface design
* (18:45) Visualization as a rhetorical process
* (20:45) Conversation Thumbnails for Large-Scale Discussions
* (21:35) Evolution of conversation interfaces—Slack, etc.
* (24:20) Path dependence — mutual influences between user behaviors and technology, takeaways for ML interface design
* (26:30) Baby Names and Social Data Analysis — patterns of interest in baby names
* (29:50) History Flow
* (30:05) Why investigate editing dynamics on Wikipedia?
* (32:06) Implications of editing patterns for design and governance
* (33:25) The value of visualizations in this work, issues with Wikipedia editing
* (34:45) Community moderation, bureaucracy
* (36:20) Consensus and guidelines
* (37:10) “Neutral” point of view as an organizing principle
* (38:30) Takeaways
* PAIR
* (39:15) Tools for model understanding and “understanding” ML systems
* (41:10) Intro to PAIR (at Google)
* (42:00) Unpacking the word “understanding” and use cases
* (43:00) Historical comparisons for AI development
* (44:55) The birth of TensorFlow.js
* (47:52) Democratization of ML
* (48:45) Visualizing translation — uncovering and telling a story behind the findings
* (52:10) Shared representations in LLMs and their facility at translation-like tasks
* (53:50) TCAV
* (55:30) Explainability and trust
* (59:10) Writing code with LMs and metaphors for using
* More recent research
* (1:01:05) The System Model and the User Model: Exploring AI Dashboard Design
* (1:10:05) OthelloGPT and world models, causality
* (1:14:10) Dashboards and interaction design—interfaces and core capabilities
* (1:18:07) Reactions to existing LLM interfaces
* (1:21:30) Visualizing and Measuring the Geometry of BERT
* (1:26:55) Note/Correction: The “Atlas of Meaning” Prof. Wattenberg mentions is called Context Atlas
* (1:28:20) Language model tasks and internal representations/geometry
* (1:29:30) LLMs as “next word predictors” — explaining systems to people
* (1:31:15) The Shape of Song
* (1:31:55) What does music look like?
* (1:35:00) Levels of abstraction, emergent complexity in music and language models
* (1:37:00) What Prof. Wattenberg hopes to see in ML and interaction design
* (1:41:18) Outro
Links:
* Professor Wattenberg’s homepage and Twitter
* Harvard SEAS application info — Professor Wattenberg is recruiting students!
* Research
* Earlier work
* A Fuzzy Commitment Scheme
* Stacked Graphs—Geometry & Aesthetics
* A Multi-Scale Model of Perceptual Organization in Information Graphics
* Conversation Thumbnails for Large-Scale Discussions
* Baby Names and Social Data Analysis
* History Flow (paper)
* At Harvard and Google / PAIR
* Tools for Model Understanding: Facets, SmoothGrad, Attacking discrimination with smarter ML
* TensorFlow.js
* Visualizing translation
* TCAV
* Other ML papers:
* The System Model and the User Model: Exploring AI Dashboard Design (recent speculative essay)
* Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task
* Visualizing and Measuring the Geometry of BERT
* Artwork
* The Shape of Song
Get full access to The Gradient at
thegradientpub.substack.com/subscribe