ThursdAI - The top AI news from the past week

Every ThursdAI, Alex Volkov hosts a panel of experts, ai engineers, data scientists and prompt spellcasters on twitter spaces, as we discuss everything major and important that happened in the world of AI for the past week. Topics include LLMs, Open source, New capabilities, OpenAI, competitors in AI space, new LLM models, AI art and diffusion aspects and much more. sub.thursdai.news

Latest episodes

6 snips

Mar 1, 2024 • 1h 54min

📅 ThursdAI - Feb 29 - Leap Year Special ✨

This podcast episode covers updates on AI art, diffusion, and 3D, Alibaba's EMO face animation, Ideogram 1.0 text generation, Mistral Large and Le Chat models, GGGenie from Google for interactive games, and OpenAI collaborations with Microsoft. It also discusses Matryoshka Representation Learning, Berkeley's Function-Calling leaderboard, and Argilla's OpenHermesPreferences dataset for RLHF & DPO.

Feb 23, 2024 • 1h 48min

📅 ThursdAI Feb 22nd - Groq near instant LLM calls, SDXL Lightning near instant SDXL, Google gives us GEMMA open weights and refuses to draw white people, Stability announces SD3 & more AI news

This podcast covers the breakthroughs in AI speed with Groq and ByteDance's Lightning model, Google's release of open weights models, SD3 announcement by Stability AI, and discussions on biases in image generation models. It also touches on advancements in real-time object detection, AI training deals, and the revolutionary AI architecture of Groq.

6 snips

Feb 16, 2024 • 1h 58min

🔥 ThursdAI - Feb 15, 2024 - OpenAI changes the Video Game, Google changes the Context game, and other AI news from past week

This podcast covers two exciting breaking news announcements - OpenAI's text-to-video generation and Google's update with a 1 million tokens context window. They discuss OpenAI's Sora model and Stable Cascade, a diffusion model. They also chat with Swyx about Latent Space and AI engineering. Topics range from AI advancements to community-building and succeeding in San Francisco.

Feb 9, 2024 • 1h 54min

📅 ThursdAI - Feb 8 - Google Gemini Ultra is here, Qwen 1.5 with Junyang and deep dive into ColBERT, RAGatouille and DSPy with Connor Shorten and Benjamin Clavie

Junyang from Alibaba discusses Qwen 1.5 and QwenVL. Deep dive into DSPy, ColBERT, and RAGatouille with Connor and Benjamin from Weights & Biases. Topics include open source LLMs, Google Gemini Ultra, and fine-tuning embedding models for code retrieval.

Feb 5, 2024 • 51min

📖 ThursdAI - Sunday special on datasets classification & alternative transformer architectures

In this podcast, they discuss the importance of datasets in training AI models and mention specific individuals and techniques that have contributed to effective model training. They also talk about text wrangling and dataset visualization, improving dataset classification through clustering, and an open-source cloud service for dataset processing and computation. Additionally, they discuss the process of building datasets, introduce the latest release from RWKV called Eagle, and explore potential improvements in transformer architectures.

8 snips

Feb 2, 2024 • 1h 23min

ThursdAI - Feb 1, 2024- Code LLama, Bard is now 2nd best LLM?!, new LLaVa is great at OCR, Hermes DB is public + 2 new Embed models + Apple AI is coming 👀

Hosts discuss new developments in open AI models including the release of CodeLama 70B, function calling in Mistral and Mixtral, and the non-transformer based Eagle 7B model. They also mention hosting open AI models locally and breaking news from the Allen Institute. Challenges of evaluating language models and open source considerations are explored, along with the release of BGM3 and Nomic AI's embedding models. Copybar, DPO training, and the Capybar dataset are discussed, as well as the Technium Hermes dataset. The release of Whisper Kit and the performance of LLaVa 1.6 are mentioned.

Jan 28, 2024 • 36min

📅 ThursdAI - Sunday special on Merging with Maxime LaBonne

In this episode, Maxime Labonne, Senior Machine Learning Scientist at JPMorgan and author of Hands on GNNs book, talks about the increasingly popular technique of model merging and its impact on the AI community. He discusses the creation of LazyMergeKit, a wrapper on top of MergeKit by Charles Goddard, which has become a widely used library for model merging. The episode also covers the challenges of running benchmarks and creating leaderboards, the use of LNMA FITU for evaluations, and Maxim's involvement in training small language models for playing chess.

Jan 26, 2024 • 1h 41min

📅 ThursdAI - Jan 24 - ⌛Diffusion Transformers,🧠 fMRI multimodality, Fuyu and Moondream1 VLMs, Google video generation & more AI news

This podcast covers topics such as multi-modal transformer models, open source language models, video generation, open source medical language models, and fraud prevention. They also discuss lucid dreaming, creating open source datasets, and AI partnerships. The hosts touch on various AI-related topics including high resolution image synthesis, deepfake audio, government partnerships, and the National Artificial Intelligence Research Resource initiative.

Jan 19, 2024 • 1h 11min

📅 ThursdAI Jan 18 - Nous Mixtral, Deepmind AlphaGeometry, LMSys SGLang, Rabbit R1 + Perplexity, LLama 3 is training & more AI news this week

The podcast discusses topics such as OpenAI's new election guidelines, DeepMind's AlphaGeometry neurosymbolic model for solving geometry, Samsung's AI capabilities in the S24 flagship phone, and the merging of models in AI. It also covers reinforcement learning technique DPO, optimization techniques for Asian frameworks, technology detection, Microsoft's copilot announcement and dispute, hackathons, GPU training, and the progress of Llama 3 as a multi-model system.

Jan 15, 2024 • 42min

🔥 ThursdAI Sunday special - Deep dives into Crew AI with Joao then a tasty Bagel discussion with Jon Durbin

Creator of CrewAI, João Moura, discusses the inspiration, technical challenges, and success of CrewAI. Jon Durbin talks about Bagel merges, including its origins and fine-tuning stage. They also explore the use of fake facts and context in training AI models.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app