Recent approaches are challenging the idea that we have no insight into model functioning. Through visualizations and feature manipulation, such as setting drug-related features to zero, it is possible to detect and control model behavior at runtime, making it harder to manipulate models for undesired outcomes.
Our 170th episode with a summary and discussion of last week's big AI news!
With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)
Feel free to leave us feedback here.
Read out our text newsletter and comment on the podcast at https://lastweekin.ai/
Email us your questions and feedback at contact@lastweekin.ai and/or hello@gladstone.ai
Timestamps + Links:
- Tools & Apps
- Applications & Business
- Projects & Open Source
- Research & Advancements
- Policy & Safety
- Synthetic Media & Art
- (01:46:25) Outro + AI Song