The Data Exchange with Ben Lorica

Ben Lorica

A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].

Episodes

Mentioned books

Sep 19, 2024 • 46min

Monthly Roundup: AI Regulations, GenAI for Analysts, Inference Services, and Military Applications

Paco Nathan, founder of Derwen, discusses pressing topics in AI and technology. The conversation dives into significant regulatory efforts like California's Senate Bill 1047 aimed at managing AI standards while fostering innovation. They explore how AI tools empower consumers against insurance claim denials and the legal challenges surrounding AI technologies. The podcast also highlights AI's impact on military strategies in Ukraine, ethical concerns about AI in warfare, and the necessity for flexible hardware and software integration in AI systems.

Sep 12, 2024 • 38min

Unlocking the Power of LLMs with Data Prep Kit

Petros Zerfos and Hima Patel, both from IBM Research, are key developers of Data Prep Kit, an open-source toolkit that facilitates data preparation for large language models. They discuss how DPK enhances the processing of raw text and code data, emphasizing its features like data cleansing and deduplication. The duo highlights its compatibility with cloud environments and vector databases. They also explore multimodal capabilities, showcasing its potential for processing diverse data types, including documents in multiple languages.

Sep 5, 2024 • 25min

Advancing AI: Scaling, Data, Agents, Testing, and Ethical Considerations

Dr. Andrew Ng, a leading AI visionary and founder of DeepLearning.AI, shares his insights on the transformative power of AI. He discusses the evolution of GPU technology and its pivotal role in data-centric AI. The conversation highlights the game-changing impact of large language models on user interactions and enterprise applications. Ng also addresses the future of reinforcement learning and the ethical considerations tied to AI deployment, emphasizing the need for a community-driven approach to innovation in the field.

Aug 29, 2024 • 48min

Bridging the Hardware-Software Divide in AI

Jay Dawani, CEO of Lemurian Labs, dives into the challenges of bridging hardware and software in AI development. He discusses how model size influences performance and hurdles in achieving artificial general intelligence. The conversation highlights the critical need for seamless integration between training and inference, as well as the complexities of AI deployment. Dawani also explores the future of supercomputing in AI and the importance of optimizing data representation, showcasing innovative strategies to enhance computational capabilities.

Aug 22, 2024 • 44min

Monthly Roundup: The Economic Realities of Large Language Models

Paco Nathan, founder of Derwen, dives into the latest advancements in large language models, notably the launch of LAMA 3.1 with its groundbreaking 400 billion parameters. He discusses the daunting financial challenges faced by AI developers, emphasizing the competition between startups and tech giants. The conversation also covers cutting-edge research on neural operators, the shift towards custom AI solutions, and vulnerabilities in AI software supply chains. Additionally, listeners are introduced to innovative tools like the Relic library and insights into the cultural impact of technology.

Aug 15, 2024 • 45min

From Hype to Reality: The Current State of Enterprise Generative AI Adoption

Evangelos Simoudis, Managing Director at Synapse Partners, dives into the current landscape of enterprise generative AI adoption. He discusses the cautious but optimistic investments by corporations, the hurdles in transitioning from experimentation to real-world applications, and the critical role of data quality. Simoudis highlights how generative AI enhances productivity in customer support and the complexities of integrating AI into existing processes. He also addresses the financial dynamics of AI investments and the importance of strategic differentiation for startups.

Aug 8, 2024 • 35min

Automating Unstructured Data Extraction with LLMs

Shuveb Hussain, co-founder of Unstract, discusses his innovative no-code platform that automates the extraction of structured data from unstructured documents. He highlights the rise of prompt engineers and their role in data transformation. The conversation dives into the complexities of using large language models and the critical importance of quality optical character recognition. Hussain also addresses the fine-tuning of language models for specific needs and the integration of diverse document types, showcasing how these advancements enhance data processing efficiency.

Aug 1, 2024 • 36min

Generative AI in Context: Hybrid Intelligence and Responsible Development

Alfred Spector, a distinguished expert in networked computing and former leader at IBM, Google, and Two Sigma, discusses pressing topics around generative AI and responsible development. He emphasizes the importance of context in data science to avoid critical pitfalls. The conversation dives into ethical AI practices, arguing for interdisciplinary education to navigate technological impacts. Spector also addresses the pressing need for AI literacy to promote effective integration and explores the challenges of regulating advanced AI amid rapid advancements.

Jul 25, 2024 • 46min

Monthly Roundup: Navigating the Peaks and Valleys of Generative AI Technology

Paco Nathan, founder of Derwen, discusses the latest in generative AI technology. Topics include Entronfic's Sonnet 3.5 release, managing risks in AI advancements, enhancing RAG models with graphs, accelerating protein evolution, weather model advancements, AI's role in mathematics, and shady AI practices with summer book recommendations.

Jul 18, 2024 • 35min

From Preparation to Recovery: Mastering AI Incident Response

Andrew Burt, co-founder of Luminos.Law and Luminos.ai, discusses AI incident response challenges and preparation. Topics include defining incidents in AI systems, specialized response teams, regulations like SB 1047, contrasting US and European approaches to AI regulation, and the importance of detecting and stopping AI failures.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner