EP38: Ed Sheeran Listens to Our Podcast, Deep Fakes & Frontier Risks and AI Ears: SALMONN Model

10 snips

Oct 27, 2023

Ed Sheeran, a famous musician, makes a surprise appearance and discusses his love for the podcast. The podcast also covers topics such as deep fakes and their potential dangers, AI-generated voices becoming undetectable, challenges in web crawling, limitations of current PDF to text technology, and the idea of creating an agent as a moral conscious.

Ask episode

Chapters

Transcript

Episode notes

Introduction

00:00 • 3min

FBI Preparedness Challenge and OpenAI Restrictions

02:47 • 4min

Video Retalking: Making Deepfakes Easier

06:35 • 8min

Salmonn: AI-Generated Voices Become Undetectable

14:58 • 11min

Challenges of Web Crawling and Headless Browsing

26:23 • 4min

The Limitations of Current PDF to Text Technology and the Potential of Multimodal AI

30:37 • 4min

AI-Generated Image of a Woman's Face

34:16 • 6min

Building an Agent as a Moral Conscious

40:30 • 18min

AI Improving Prompts and Enhancing Models

58:50 • 9min

Join the Discord: https://discord.gg/2j6k7AXw

This week, juicy revelations from Ed Sheeran and Taylor Swift's secret love affair! We also discuss the latest mind-blowing AI innovations, including talking heads, vision models that can see from every angle, and intelligent agents plotting world domination. Don't miss our spicy debate on whether AI will transform humanity or destroy us all. Plus advice from Chris on picking up virtual girlfriends using neural networks - this episode has it all!

Please note the Ed Sheeran bit is a joke (please don't sue us haha) and an example of a deep fake and deep fake technology for comedy. Please Ed. We're begging you.

Please consider reviewing the podcast to support the show. We read them all and they mean a lot to us :).

CHAPTERS
=====
00:00 - Ed Sheeran Actually Listens to Our Podcast
02:17 - Frontier Risk and Preparedness, Deep Fakes & VideoReTalking
15:06 - ByteDance's SALMONN AI Audio, Music, Sound Model for AI Hearing
23:01 - Adept's fuyu 8B Vision Model: The Future of How AI Agents Navigate the Web?
34:41 - Multiple Agents in the Metaverse & Zero123++ Making Single Images into 3D Objects
46:42 - Google's Gemini Leaks & Stubbs + Our Failed Gemini Leaker Source
50:17 - Is AI Boring? Chris Roasts Jacob Browning
1:03:41 - Bing's Sydney is Still Trying to Escape & Threatening Humanity

SOURCES:
=====
https://openai.com/blog/frontier-risk-and-preparedness
https://openai.com/form/preparedness-challenge
https://github.com/OpenTalker/video-retalking
https://venturebeat.com/ai/tiktok-makers-new-ai-salmonn-understands-all-audio-not-just-music-and-voices/
https://github.com/OpenTalker/video-retalking
https://huggingface.co/adept/fuyu-8b
https://www.adept.ai/
https://arxiv.org/pdf/2310.15110.pdf
https://twitter.com/dylan522p/status/1716937534490435874?s=46
https://medium.com/@bedros-p/gemini-is-coming-to-makersuite-so-are-stubbs-32248f3924aa
https://medium.com/@bedros-p/stubbs-is-coming-form-your-own-opinions-386489a3f844
https://twitter.com/ylecun/status/1717616244600238358?s=46
https://www.jacob-browning.com/post/generative-ai-is-boring
https://twitter.com/MichaelTontchev/status/1715876157105791138/photo/1