Practical AI: Machine Learning, Data Science, LLM cover image

Practical AI: Machine Learning, Data Science, LLM

Latest episodes

undefined
Jan 3, 2023 • 37min

NLP research by & for local communities

While at EMNLP 2022, Daniel got a chance to sit down with an amazing group of researchers creating NLP technology that actually works for their local language communities. Just Zwennicker (Universiteit van Amsterdam) discusses his work on a machine translation system for Sranan Tongo, a creole language that is spoken in Suriname. Andiswa Bukula (SADiLaR), Rooweither Mabuya (SADiLaR), and Bonaventure Dossou (Lanfrica, Mila) discuss their work with Masakhane to strengthen and spur NLP research in African languages, for Africans, by Africans. The group emphasized the need for more linguistically diverse NLP systems that work in scenarios of data scarcity, non-Latin scripts, rich morphology, etc. You don’t want to miss this one! Join the discussionChangelog++ members support our work, get closer to the metal, and make the ads disappear. Join today!Featuring:Just Zwennicker – LinkedInAndiswa Bukula – TwitterRooweither Mabuya – TwitterBonaventure Dossou – Twitter, GitHub, LinkedIn, WebsiteDaniel Whitenack – Twitter, GitHub, WebsiteShow Notes:EMNLP 2022 papers from the guests: Towards a general purpose machine translation system for Sranantongo MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition AfroLM: A Self-Active Learning-based Multilingual Pretrained Language Model for 23 African Languages Other links relevant to the discussion: Masakhane Lanfrica The South African Centre for Digital Language Resources (SADiLaR) Something missing or broken? PRs welcome!
undefined
Dec 13, 2022 • 30min

SOTA machine translation at Unbabel

José and Ricardo joined Daniel at EMNLP 2022 to discuss state-of-the-art machine translation, the WMT shared tasks, and quality estimation. Among other things, they talk about Unbabel’s innovations in quality estimation including COMET, a neural framework for training multilingual machine translation (MT) evaluation models. Join the discussionChangelog++ members support our work, get closer to the metal, and make the ads disappear. Join today!Featuring:Ricardo Rei – TwitterJosé Souza – TwitterDaniel Whitenack – Twitter, GitHub, WebsiteShow Notes: Unbabel COMET The WMT workshop/ conference EMNLP Something missing or broken? PRs welcome!
undefined
Dec 7, 2022 • 34min

AI competitions & cloud resources

In this special episode, we interview some of the sponsors and teams from a recent case competition organized by Purdue University, Microsoft, INFORMS, and SIL International. 170+ teams from across the US and Canada participated in the competition, which challenged students to create AI-driven systems to caption images in three languages (Thai, Kyrgyz, and Hausa). Join the discussionChangelog++ members support our work, get closer to the metal, and make the ads disappear. Join today!Featuring:Matthew Lanham – Twitter, WebsiteMark Tabladillo – Twitter, LinkedInDaniel Whitenack – Twitter, GitHub, WebsiteShow Notes: Purdue University’s Krannert School of Business Master the basics of Azure: AI Fundamentals Azure Architecture Center SIL International The bloom-captioning dataset Books “Applied Machine Learning and AI for Engineers” by Jeff Prosise Something missing or broken? PRs welcome!
undefined
Nov 29, 2022 • 44min

Copilot lawsuits & Galactica "science"

There are some big AI-related controversies swirling, and it’s time we talk about them. A lawsuit has been filed against GitHub, Microsoft, and OpenAI related to Copilot code suggestions, and many people have been disturbed by the output of Meta AI’s Galactica model. Does Copilot violate open source licenses? Does Galactica output dangerous science-related content? In this episode, we dive into the controversies and risks, and we discuss the benefits of these technologies. Join the discussionChangelog++ members support our work, get closer to the metal, and make the ads disappear. Join today!Featuring:Chris Benson – Twitter, GitHub, LinkedIn, WebsiteDaniel Whitenack – Twitter, GitHub, WebsiteShow Notes:Related to Copilot: Article - “GitHub Copilot Isn’t Worth the Risk” Tabnine Big Code Project Related to Galactica: Model website Article: “Galactica: the AI knowledge base that makes stuff up” Books “Interpretable Machine Learning” by Christoph Molnar “Modeling Mindsets” by Christoph Molnar Something missing or broken? PRs welcome!
undefined
Nov 16, 2022 • 49min

Protecting us with the Database of Evil

Online platforms and their users are susceptible to a barrage of threats – from disinformation to extremism to terror. Daniel and Chris chat with Matar Haller, VP of Data at ActiveFence, a leader in identifying online harm – is using a combination of AI technology and leading subject matter experts to provide Trust & Safety teams with precise, real-time data, in-depth intelligence, and automated tools to protect users and ensure safe online experiences. Join the discussionChangelog++ members support our work, get closer to the metal, and make the ads disappear. Join today!Featuring:Matar Haller – GitHub, LinkedInChris Benson – Twitter, GitHub, LinkedIn, WebsiteDaniel Whitenack – Twitter, GitHub, WebsiteShow Notes: ActiveFence Something missing or broken? PRs welcome!
undefined
Nov 8, 2022 • 44min

Hybrid computing with quantum processors

It’s been a while since we’ve touched on quantum computing. It’s time for an update! This week we talk with Yonatan from Quantum Machines about real progress being made in the practical construction of hybrid computing centers with a mix of classical processors, GPUs, and quantum processors. Quantum Machines is building both hardware and software to help control, program, and integrate quantum processors within a hybrid computing environment. Join the discussionChangelog++ members support our work, get closer to the metal, and make the ads disappear. Join today!Featuring:Yonatan Cohen – Twitter, GitHub, LinkedInChris Benson – Twitter, GitHub, LinkedIn, WebsiteDaniel Whitenack – Twitter, GitHub, WebsiteShow Notes:Quantum Machines Something missing or broken? PRs welcome!
undefined
Nov 1, 2022 • 37min

The practicalities of releasing models

Recently Chris and Daniel briefly discussed the Open RAIL-M licensing and model releases on Hugging Face. In this episode, Daniel follows up on this topic based on some recent practical experience. Also included is a discussion about graph neural networks, message passing, and tweaking synthesized voices! Join the discussionChangelog++ members support our work, get closer to the metal, and make the ads disappear. Join today!Featuring:Chris Benson – Twitter, GitHub, LinkedIn, WebsiteDaniel Whitenack – Twitter, GitHub, WebsiteShow Notes: Daniel’s team license from recent work Graph Neural Network courses from Zak Jost Coqui voice studio Something missing or broken? PRs welcome!
undefined
Oct 26, 2022 • 33min

AI adoption in large, well-established companies

This panel discussion was recorded at a recent event hosted by a company, Aryballe, that we previously featured on the podcast (#120). We got a chance to discuss the AI-driven technology transforming the order/fragrance industries, and we went down the rabbit hole discussing how this technology is being adopted at large, well-established companies. Join the discussionChangelog++ members support our work, get closer to the metal, and make the ads disappear. Join today!Featuring:Mary Fischer-Mullins – LinkedInYanis Caritu – LinkedInDaniel Whitenack – Twitter, GitHub, WebsiteShow Notes: Aryballe Cox Automotive Previous episode with Aryballe Something missing or broken? PRs welcome!
undefined
Oct 18, 2022 • 50min

Data for All

People are starting to wake up to the fact that they have control and ownership over their data, and governments are moving quickly to legislate these rights. John K. Thompson has written a new book on the topic that is a must read! We talk about the new book in this episode along with how practitioners should be thinking about data exchanges, privacy, trust, and synthetic data. Join the discussionChangelog++ members support our work, get closer to the metal, and make the ads disappear. Join today!Featuring:John K. Thompson – Twitter, LinkedInChris Benson – Twitter, GitHub, LinkedIn, WebsiteDaniel Whitenack – Twitter, GitHub, WebsiteShow Notes:Use the code podpracticalAI19 for 40% off of Data for All, along with all Manning products in all formats!!! Books “Data for All” by John K. Thompson John’s other books: “Building Analytics Teams” by John K. Thompson “Analytics” by John Thompson and Shawn Rogers Something missing or broken? PRs welcome!
undefined
Oct 12, 2022 • 42min

What's up, DocQuery?

Chris sits down with Ankur Goyal to talk about DocQuery, Impira’s new open source ML model. DocQuery lets you ask questions about semi-structured data (like invoices) and unstructured documents (like contracts) using Large Language Models (LLMs). Ankur illustrates many of the ways DocQuery can help people tame documents, and references Chris’s real life tasks as a non-profit director to demonstrate that DocQuery is indeed practical AI. Join the discussionChangelog++ members support our work, get closer to the metal, and make the ads disappear. Join today!Featuring:Ankur Goyal – Twitter, LinkedInChris Benson – Twitter, GitHub, LinkedIn, WebsiteShow Notes: DocQuery DocQuery Announcement DocQuery Blog Announcement DocQuery | GitHub Impira Something missing or broken? PRs welcome!

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode