

The Data Exchange with Ben Lorica
Ben Lorica
A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].
Episodes
Mentioned books

32 snips
Nov 28, 2024 • 38min
2024 Generative AI in Healthcare Survey Results
David Talby, CTO of John Snow Labs and an expert in healthcare AI, dives into the results of the 2024 Generative AI in Healthcare Survey. He shares insights on how healthcare organizations are budgeting for generative AI and the increasing use of large language models. The discussion highlights the need for real patient data in validating AI models and navigating privacy concerns. Talby emphasizes the delicate balance between general-purpose and specialized AI, reflecting on the transformative potential and ethical challenges in the healthcare sector.

Nov 21, 2024 • 49min
Monthly Roundup: BAML, Tencent’s Hunyuan Model, AI & Kubernetes, and the Future of Voice AI
Paco Nathan, a principal developer relations engineer at Sensing and founder of the boutique consultancy Derwen, dives into the latest advancements in AI. He discusses BAML, a user-friendly language for AI applications, and Boundary ML's Prompt Fiddle tool for simplifying model experimentation. The conversation also covers AI innovations in biotech, including how major firms are revolutionizing drug development, and the integration of drones in entertainment. Finally, they touch on the significance of cultural insights from Isabel Wilkerson's work and resources like the AI Incident Database.

Nov 14, 2024 • 38min
Building the Future of Finance: Inside AI Valuation Bots
Vasant Dhar, a professor at NYU's Stern School of Business, shares insights on the intersection of AI and finance. He discusses the Damodaran Bot, which emulates valuation methods of a finance legend. The conversation explores the transformative role of AI in financial analysis, like the integration of narratives and quantitative data. Dhar probes the complexities of valuing tech companies and the challenges of maintaining accuracy in AI assessments. Listeners also gain insights into innovative AI tools shaping the future of finance.

Nov 7, 2024 • 46min
Unleashing the Power of BAML in LLM Applications
Vaibhav Gupta, CEO and co-founder of Boundary, discusses BAML, an open-source language designed to enhance interactions with large language models. He delves into the vital role of data quality in retrieval augmented generation and shares insights on improving model accuracy through error correction techniques. Gupta highlights BAML's practical applications for data extraction from unstructured sources, emphasizing its efficiency over traditional formats. The conversation reveals how BAML can transform various industries by streamlining workflows and boosting developer productivity.

7 snips
Oct 31, 2024 • 30min
Cracking the Code: How Enterprises Are Adopting Generative AI
Tim Persons, an AI Leader at PwC specializing in next-generation audit and trust solutions, delves into the intricate world of generative AI adoption. He discusses how companies are cautiously implementing generative AI, focusing on internal applications first. The conversation highlights the increasing budgets and underestimated costs of deployment, emphasizing trust and cultural adaptation. Persons also stresses the importance of cross-functional collaboration, the necessity for workforce education, and learning by doing to navigate the evolving landscape of AI technologies.

5 snips
Oct 24, 2024 • 53min
Monthly Roundup: Ray Compiled Graphs, Llama 3.2 and Multimodal AI, and Structured Data for RAG
In this insightful conversation, Paco Nathan, founder of Derwen and an expert in Data and AI, explores groundbreaking innovations from the Ray Summit, focusing on Ray Compiled Graphs for GPU efficiency. He dives into the complexities of AI regulation and the implications of recent legislative actions in California. The dialogue also highlights the integration of structured and unstructured data, the significance of user annotations, and the competitive dynamics within AI, including the advances of the Llama 3.2 model and its multimodal capabilities.

Oct 17, 2024 • 41min
Reimagining Code: The AI-Driven Transformation of Programming and Data Analytics
Matt Welsh, a technical leader at Aryn AI and former Harvard professor famous for his connection to Mark Zuckerberg, discusses how AI is transforming programming and data analytics. He highlights the shift towards natural language coding, making programming accessible to non-techies. The conversation delves into the importance of human oversight in AI-generated code and the potential of AI to refine mentorship and ETL processes. Welsh also explores the challenges of working with knowledge graphs and emphasizes the need for robust evaluation tools in AI development.

Oct 10, 2024 • 51min
The Security Debate: How Safe is Open-Source Software?
Mars Lan, Co-founder and CTO of Metaphor, sheds light on the security challenges surrounding open-source software, debunking myths of its safety in critical industries. He discusses the complexities of dependency management, revealing common vulnerabilities in popular programming languages like Python and TypeScript. The conversation also dives into the contrasting security dynamics of open-source versus proprietary software and emphasizes accountability. Additionally, Lan highlights how Metaphor enhances data understanding and trust through innovative graph technologies.

Oct 3, 2024 • 60min
Generative AI in Voice Technology
Yishay Carmiel, CEO of Meaning, delves into the innovative world of generative AI in voice technology. He shares insights on real-time voice transformation and the emotional connections users can form with AI. The discussion highlights advancements in text-to-speech systems and the implications of deepfakes. Yishay emphasizes the ethical considerations surrounding voice cloning and the debate over open vs. closed-source technologies, while showcasing how these innovations are shaping customer support and human-computer interaction.

Sep 26, 2024 • 38min
Building An Experiment Tracker for Foundation Model Training
Aurimas Griciūnas, Chief Product Officer at Neptune.AI, dives into the complexities of training large language models and the critical need for effective experiment tracking. He discusses the transition from MLOps to LLMOps and how traditional tools struggle with the data demands of foundation models. Griciūnas highlights the challenges of operating massive GPU clusters and the importance of checkpoints for fault tolerance. The episode also covers breakthroughs in AI reasoning and the fine-tuning approaches essential for enterprises navigating this evolving landscape.


