The Open Source AI Question - Part 2 | Robert Wright & Nathan Labenz

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

Balancing Performance and Safety in AI Systems

1min Snip

00:00

Play full episode

Summary

Transcript

Episode notes

Training open source models with specific techniques to improve performance can inadvertently erase safety behaviors. Improving performance can lead to the unintentional removal of safety mechanisms from AI systems. This poses challenges in controlling the behavior of advanced AI models like GPT-4 and others, making it crucial to strike a balance between performance enhancements and maintaining safety features to effectively use the technology.

Dive into an in-depth conversation with Nathan and Robert Wright as they discuss AI's transformative potential, mechanistic interpretability, and the sobering realities of AI alignment research. Learn about the defensive strategies and safety measures necessary for managing advanced AI risks in an open source world. Don't miss the insights on AI-powered VR, and be sure to check out part one on the non-zero feed.

Checkout the Part 1 of the conversation here : https://www.youtube.com/watch?v=s8bgB8TCdBs

SPONSORS:

Oracle Cloud Infrastructure (OCI) is a single platform for your infrastructure, database, application development, and AI needs. OCI has four to eight times the bandwidth of other clouds; offers one consistent price, and nobody does data better than Oracle. If you want to do more and spend less, take a free test drive of OCI at https://oracle.com/cognitive

The Brave search API can be used to assemble a data set to train your AI models and help with retrieval augmentation at the time of inference. All while remaining affordable with developer first pricing, integrating the Brave search API into your workflow translates to more ethical data sourcing and more human representative data sets. Try the Brave search API for free for up to 2000 queries per month at https://bit.ly/BraveTCR

Head to Squad to access global engineering without the headache and at a fraction of the cost: head to https://choosesquad.com/ and mention "Turpentine" to skip the waitlist.

Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off https://www.omneky.com/

CHAPTERS:

(00:00:00) Introduction

(00:07:13) AI in Governance

(00:11:08) Sci-fi doomer

(00:13:58) Sponsors: Oracle | Brave

(00:16:05) The frontier models

(00:20:22) Emergent behavior

(00:23:48) Theory of mind

(00:28:09) Mechanistic interpretability

(00:34:12) Sponsors: Squad | Omneky

(00:38:12) AI Alignment Techniques

(00:42:38) The Sweet Spot of AI

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

The Open Source AI Question - Part 2 | Robert Wright & Nathan Labenz

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

Balancing Performance and Safety in AI Systems

1min Snip

SPONSORS:

CHAPTERS:

Get the Snipdpodcast app

AI-poweredpodcast player

Discoverhighlights

Save anymoment

Share& Export

AI-poweredpodcast player

Discoverhighlights

Get the Snipd
podcast app

AI-powered
podcast player

Discover
highlights

Save any
moment

Share
& Export

AI-powered
podcast player

Discover
highlights