

ThursdAI - The top AI news from the past week
From Weights & Biases, Join AI Evangelist Alex Volkov and a panel of experts to cover everything important that happened in the world of AI from the past week
Every ThursdAI, Alex Volkov hosts a panel of experts, ai engineers, data scientists and prompt spellcasters on twitter spaces, as we discuss everything major and important that happened in the world of AI for the past week. 
Topics include LLMs, Open source, New capabilities, OpenAI, competitors in AI space, new LLM models, AI art and diffusion aspects and much more. sub.thursdai.news
Topics include LLMs, Open source, New capabilities, OpenAI, competitors in AI space, new LLM models, AI art and diffusion aspects and much more. sub.thursdai.news
Episodes
Mentioned books

Feb 5, 2024 • 51min
📖 ThursdAI - Sunday special on datasets classification & alternative transformer architectures
 In this podcast, they discuss the importance of datasets in training AI models and mention specific individuals and techniques that have contributed to effective model training. They also talk about text wrangling and dataset visualization, improving dataset classification through clustering, and an open-source cloud service for dataset processing and computation. Additionally, they discuss the process of building datasets, introduce the latest release from RWKV called Eagle, and explore potential improvements in transformer architectures. 

8 snips
Feb 2, 2024 • 1h 23min
ThursdAI - Feb 1, 2024- Code LLama, Bard is now 2nd best LLM?!, new LLaVa is great at OCR, Hermes DB is public + 2 new Embed models + Apple AI is coming 👀
 Hosts discuss new developments in open AI models including the release of CodeLama 70B, function calling in Mistral and Mixtral, and the non-transformer based Eagle 7B model. They also mention hosting open AI models locally and breaking news from the Allen Institute. Challenges of evaluating language models and open source considerations are explored, along with the release of BGM3 and Nomic AI's embedding models. Copybar, DPO training, and the Capybar dataset are discussed, as well as the Technium Hermes dataset. The release of Whisper Kit and the performance of LLaVa 1.6 are mentioned. 

Jan 28, 2024 • 36min
📅 ThursdAI - Sunday special on Merging with Maxime LaBonne
 In this episode, Maxime Labonne, Senior Machine Learning Scientist at JPMorgan and author of Hands on GNNs book, talks about the increasingly popular technique of model merging and its impact on the AI community. He discusses the creation of LazyMergeKit, a wrapper on top of MergeKit by Charles Goddard, which has become a widely used library for model merging. The episode also covers the challenges of running benchmarks and creating leaderboards, the use of LNMA FITU for evaluations, and Maxim's involvement in training small language models for playing chess. 

Jan 26, 2024 • 1h 41min
📅 ThursdAI - Jan 24 - ⌛Diffusion Transformers,🧠 fMRI multimodality, Fuyu and Moondream1 VLMs, Google video generation & more AI news
 This podcast covers topics such as multi-modal transformer models, open source language models, video generation, open source medical language models, and fraud prevention. They also discuss lucid dreaming, creating open source datasets, and AI partnerships. The hosts touch on various AI-related topics including high resolution image synthesis, deepfake audio, government partnerships, and the National Artificial Intelligence Research Resource initiative. 

Jan 19, 2024 • 1h 11min
📅 ThursdAI Jan 18 - Nous Mixtral, Deepmind AlphaGeometry, LMSys SGLang, Rabbit R1 + Perplexity, LLama 3 is training & more AI news this week
 The podcast discusses topics such as OpenAI's new election guidelines, DeepMind's AlphaGeometry neurosymbolic model for solving geometry, Samsung's AI capabilities in the S24 flagship phone, and the merging of models in AI. It also covers reinforcement learning technique DPO, optimization techniques for Asian frameworks, technology detection, Microsoft's copilot announcement and dispute, hackathons, GPU training, and the progress of Llama 3 as a multi-model system. 

Jan 15, 2024 • 42min
🔥 ThursdAI Sunday special - Deep dives into Crew AI with Joao then a tasty Bagel discussion with Jon Durbin
 Creator of CrewAI, João Moura, discusses the inspiration, technical challenges, and success of CrewAI. Jon Durbin talks about Bagel merges, including its origins and fine-tuning stage. They also explore the use of fake facts and context in training AI models. 

4 snips
Jan 12, 2024 • 1h 17min
📅 ThursdAI Jan 11 - GPTs store, Mixtral paper, Phi is MIT + Phixtral, 🥯 by Jon Durbin owns the charts + Alex goes to SF again and 2 deep dive interviews 🎙️
 The podcast covers various interesting topics such as advancements in language models, Mistral's rise as one of the top LOMs, and OpenAI's launch of GPT store. They also discuss conditional training techniques using CRLFT and upcoming hackathons with sponsors. The Luma team updates and enhancements to the Genie are also highlighted. 

Jan 5, 2024 • 1h 39min
📅 ThursdAI Jan 4 - New WizardCoder, Hermes2 on SOLAR, Embedding King? from Microsoft, Alibaba upgrades vision model & more AI news
 New WizardCoder achieves 79% HumanEval score. Teknium's Hermes2 on SOLAR 10.7B. Microsoft's E5 SOTA text embeddings with Mistral. Alibaba updates QWEN-VL PLUS to 14B. Nvidia + Suno release NeMo Parakeet beating Whisper on english ASR. Stanford's Mobile ALOHA bot - Open source cooking robot. 

Dec 29, 2023 • 1h 34min
📅 ThursdAI - Dec 28 - a BUNCH of new multimodal OSS, OpenAI getting sued by NYT, and our next year predictions
 Topics covered include new multimodal OSS releases, OpenAI being sued by the NYT, predictions for 2024, open source LLMs, Apple's ML-Ferret model, controversy surrounding the Upstage model, lawsuit implications on licensing and bias, copyright concerns for B6's image generation, and predictions for open-source language model systems in 2024. 

Dec 22, 2023 • 1h 22min
🎄ThursdAI - LAION down, OpenChat beats GPT3.5, Apple is showing where it's going, Midjourney v6 is here & Suno can make music!
 This week's podcast covers controversial topics in AI, including the takedown of the LAION 5B dataset due to CSAM allegations, challenges in evaluating AI models, and the advancements in transformer architectures. They also discuss Apple's MLX platform, Midjourney v6's image generation capabilities, and Microsoft Copilot's new plugins for AI music generation. The hosts emphasize the importance of evaluation frameworks and highlight the implications of data set controversies. 


