The Data Exchange with Ben Lorica cover image

The Data Exchange with Ben Lorica

Latest episodes

undefined
Sep 12, 2024 • 38min

Unlocking the Power of LLMs with Data Prep Kit

Petros Zerfos and Hima Patel, both from IBM Research, are key developers of Data Prep Kit, an open-source toolkit that facilitates data preparation for large language models. They discuss how DPK enhances the processing of raw text and code data, emphasizing its features like data cleansing and deduplication. The duo highlights its compatibility with cloud environments and vector databases. They also explore multimodal capabilities, showcasing its potential for processing diverse data types, including documents in multiple languages.
undefined
Sep 5, 2024 • 25min

Advancing AI: Scaling, Data, Agents, Testing, and Ethical Considerations

Dr. Andrew Ng, a leading AI visionary and founder of DeepLearning.AI, shares his insights on the transformative power of AI. He discusses the evolution of GPU technology and its pivotal role in data-centric AI. The conversation highlights the game-changing impact of large language models on user interactions and enterprise applications. Ng also addresses the future of reinforcement learning and the ethical considerations tied to AI deployment, emphasizing the need for a community-driven approach to innovation in the field.
undefined
Aug 29, 2024 • 48min

Bridging the Hardware-Software Divide in AI

Jay Dawani, CEO of Lemurian Labs, dives into the challenges of bridging hardware and software in AI development. He discusses how model size influences performance and hurdles in achieving artificial general intelligence. The conversation highlights the critical need for seamless integration between training and inference, as well as the complexities of AI deployment. Dawani also explores the future of supercomputing in AI and the importance of optimizing data representation, showcasing innovative strategies to enhance computational capabilities.
undefined
Aug 22, 2024 • 44min

Monthly Roundup: The Economic Realities of Large Language Models

Paco Nathan, founder of Derwen, dives into the latest advancements in large language models, notably the launch of LAMA 3.1 with its groundbreaking 400 billion parameters. He discusses the daunting financial challenges faced by AI developers, emphasizing the competition between startups and tech giants. The conversation also covers cutting-edge research on neural operators, the shift towards custom AI solutions, and vulnerabilities in AI software supply chains. Additionally, listeners are introduced to innovative tools like the Relic library and insights into the cultural impact of technology.
undefined
Aug 15, 2024 • 45min

From Hype to Reality: The Current State of Enterprise Generative AI Adoption

Evangelos Simoudis, Managing Director at Synapse Partners, dives into the current landscape of enterprise generative AI adoption. He discusses the cautious but optimistic investments by corporations, the hurdles in transitioning from experimentation to real-world applications, and the critical role of data quality. Simoudis highlights how generative AI enhances productivity in customer support and the complexities of integrating AI into existing processes. He also addresses the financial dynamics of AI investments and the importance of strategic differentiation for startups.
undefined
Aug 8, 2024 • 35min

Automating Unstructured Data Extraction with LLMs

Shuveb Hussain, co-founder of Unstract, discusses his innovative no-code platform that automates the extraction of structured data from unstructured documents. He highlights the rise of prompt engineers and their role in data transformation. The conversation dives into the complexities of using large language models and the critical importance of quality optical character recognition. Hussain also addresses the fine-tuning of language models for specific needs and the integration of diverse document types, showcasing how these advancements enhance data processing efficiency.
undefined
Aug 1, 2024 • 36min

Generative AI in Context: Hybrid Intelligence and Responsible Development

Alfred Spector, a distinguished expert in networked computing and former leader at IBM, Google, and Two Sigma, discusses pressing topics around generative AI and responsible development. He emphasizes the importance of context in data science to avoid critical pitfalls. The conversation dives into ethical AI practices, arguing for interdisciplinary education to navigate technological impacts. Spector also addresses the pressing need for AI literacy to promote effective integration and explores the challenges of regulating advanced AI amid rapid advancements.
undefined
Jul 25, 2024 • 46min

Monthly Roundup: Navigating the Peaks and Valleys of Generative AI Technology

Paco Nathan, founder of Derwen, discusses the latest in generative AI technology. Topics include Entronfic's Sonnet 3.5 release, managing risks in AI advancements, enhancing RAG models with graphs, accelerating protein evolution, weather model advancements, AI's role in mathematics, and shady AI practices with summer book recommendations.
undefined
Jul 18, 2024 • 35min

From Preparation to Recovery: Mastering AI Incident Response

Andrew Burt, co-founder of Luminos.Law and Luminos.ai, discusses AI incident response challenges and preparation. Topics include defining incidents in AI systems, specialized response teams, regulations like SB 1047, contrasting US and European approaches to AI regulation, and the importance of detecting and stopping AI failures.
undefined
Jul 11, 2024 • 50min

Unlocking the Power of Unstructured Data

CEO Chang She of LanceDB discusses the challenges and innovations in managing unstructured data for AI, including developing new data formats, optimizing AI training workloads, and enhancing applications with multimodal embeddings and vector search.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode