Exploring Sparse Autoencoders for Enhanced Model Interpretability

This chapter delves into the technical details of sparse autoencoders and their significance in understanding large language models. It argues that further training on these models can enhance interpretability and address challenges like deceptive alignment.

Play episode from 01:52:44

chevron_right

Transcript

chevron_right

Transcript

Episode notes

Our 180th episode with a summary and discussion of last week's big AI news!

With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris)

If you would like to get a sneak peek and help test Andrey's generative AI application, go to Astrocade.com to join the waitlist and the discord.

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/

If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form.

Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai

Episode Highlights:

Ideogram AI's new features, Google's Imagine 3, Dream Machine 1.5, and Runway's Gen3 Alpha Turbo model advancements.
Perplexity's integration of Flux image generation models and code interpreter updates for enhanced search results.
Exploration of the feasibility and investment needed for scaling advanced AI models like GPT-4 and Agent Q architecture enhancements.
Analysis of California's AI regulation bill SB1047 and legal issues related to synthetic media, copyright, and online personhood credentials.

Timestamps + Links:

(00:00:00) Intro / Banter
(00:01:08) Response to Listener Comments / Corrections
Tools & Apps
- (00:03:58) Ideogram AI expands its features with v2 model and color palette options
- (00:07:48) Google Releases Powerful AI Image Generator You Can Use for Free
- (00:11:41) Perplexity adds Flux.1 model for Pro users alongside Playground v3 update
- (00:13:58) Luma drops Dream Machine 1.5 — here’s what’s new
- (00:17:49) Runway’s Gen-3 Alpha Turbo is here and can make AI videos faster than you can type
- (00:20:21) Perplexity’s latest update improves code interpreter, charts included
Applications & Business
Projects & Open Source
Research & Advancements
- (01:12:35) Can AI Scaling Continue Through 2030?
- (01:15:35) Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents
- (01:23:58) Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models
- (01:31:18) Loss of plasticity in deep continual learning
Policy & Safety
Synthetic Media & Art
- (01:58:33) Authors sue Claude AI chatbot creator Anthropic for copyright infringement
- (01:59:32) Artists’ lawsuit against Stability AI and Midjourney gets more punch
(02:01:43) Outro

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

Home Top podcasts Popular guests Top books