
Links For April 2023
Astral Codex Ten Podcast
00:00
How to Prevent Negative Selection in AI Interpretability
This month in institution design, the pair ring is a distinctive ring you can wear to signal that you're single and interested in people introducing themselves or flirting with you. A new paper confirms that this is a general pattern whenever right-wing populists win an election. 200 concrete problems in AI interpretability. The tweets from April 1st.
Transcript
Play full episode