
Infinite Curiosity Pod with Prateek Joshi
The best place to find out how AI builders build. The host Prateek Joshi interviews world-class AI founders and VCs on this podcast. You can visit prateekj.com to learn more about the host.
Latest episodes

Oct 9, 2023 • 23min
Prateek talks about LLMs learning to represent space and time
In this episode, the host Prateek Joshi talks about LLMs learning to represent space and time. Here's the paper where the authors have discussed it in detail: https://browse.arxiv.org/pdf/2310.02207.pdf--------Where to find Prateek Joshi: Newsletter: https://prateekjoshi.substack.com Website: https://prateekj.com LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19 Twitter: https://twitter.com/prateekvjoshi

Oct 2, 2023 • 18min
Prateek talks about the Reversal Curse of LLMs
In this episode, the host Prateek Joshi talks about the Reversal Curse phenomenon in LLMs. Here's the paper where the authors have discussed it in detail: https://owainevans.github.io/reversal_curse.pdf--------Where to find Prateek Joshi: Newsletter: https://prateekjoshi.substack.com Website: https://prateekj.com LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19 Twitter: https://twitter.com/prateekvjoshi

Sep 25, 2023 • 43min
Generative AI x Synthetic data | Ali Golshan, cofounder and CEO of Gretel AI
Ali Golshan is the cofounder and CEO of Gretel AI, a synthetic data platform for ML developers. They have raised $65M in funding so far from investors such as Greylock and Anthos. He was previously the cofounder of StackRox, which was acquired by Red Hat for about $450M. Prior to that, he was the cofounder of Cyphort, which was acquired by Juniper Networks. In this episode, we cover a range of topics including: - The need for synthetic data - Techniques to generate synthetic data - How can AI enhance the synthetic data generation process - Computational irreducibility - Differential privacy - Measuring the performance of the engine that generates synthetic data Ali's favorite books: - The Order of Time (Author: Carlo Rovelli) - The Coddling of the American Mind (Authors: Greg Lukianoff and Jonathan Haidt) --------Where to find Prateek Joshi: Newsletter: https://prateekjoshi.substack.com Website: https://prateekj.com LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19 Twitter: https://twitter.com/prateekvjoshi

Sep 18, 2023 • 51min
Programmatic AI data development, Multimodal AI, False dichotomy of fine-tuning vs RAG, Compute-optimal LLMs | Alex Ratner, CEO of Snorkel AI
Alex Ratner is the CEO of Snorkel AI, a platform that provides programmatic data labeling and foundation models to enable companies to build AI applications. They've raised $135M so far from amazing investors such as Addition, Greylock, Google Ventures, and Lightspeed. He was previously the cofounder and CEO of SiftPage. He has a bachelors degree from Harvard and a PhD from Stanford. In this episode, we cover a range of topics including: - Making AI data development first-class and programmatic - The data-centric step for every model-centric step - False dichotomy of fine tuning vs RAG - Foundation model dynamics: winner take all vs diverse models - Training compute-optimal LLMs - Designing multimodal datasets (DataComp) - Distilling Step-by-Step - 'GPT-You' for every enterprise Alex's favorite books: Foundation series books (Author: Isaac Asimov)--------Where to find Prateek Joshi: Newsletter: https://prateekjoshi.substack.com Website: https://prateekj.com LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19 Twitter: https://twitter.com/prateekvjoshi

Sep 11, 2023 • 39min
Generative AI x Privacy | Riddhiman Das, CEO of TripleBlind
Riddhiman Das is the CEO of TripleBlind, a privacy platform for AI. They have raised $32M in funding, with their most recent round led by General Catalyst. He was previously the Head of International Technology Investments at Ant Financial, which is Alibaba's financial services arm. He was the Product Architect at Zoloz, Chairman at Laplacian, Chief Data Offier at mySideWalk, and CTO of Galleon Labs. He has received the 2013 White House Champions of Change from President Barack Obama. In this episode, we cover a range of topics including: - Attack surface of an AI application - What are the ways in which privacy can be compromised during training and deployment of AI models - Role based access control for Generative AI applications - Data leakage in Generative AI applications - Characteristics of a good privacy product - How is TripleBlind used in healthcare and financial sectors Riddhiman's favorite book: Twenty Thousand Leagues Under the Sea (Author: Jules Verne)--------Where to find Prateek Joshi: Newsletter: https://prateekjoshi.substack.com Website: https://prateekj.com LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19 Twitter: https://twitter.com/prateekvjoshi

Sep 7, 2023 • 31min
Prateek talks about AI infrastructure ideas and categories
In this episode, the host Prateek Joshi talks about AI infrastructure ideas and categories including: Model infrastructure: - Compute hardware - ML frameworks - Foundation model providers - Distributed model training - Model deployment - Building and serving verticalized models - Streaming ML models - Monitoring and logging - Experimentation framework - On-device applications - Orchestration platform - Latency - User feedback loopData infrastructure: - Storage hardware - Data acquisition - Databases - Data labeling - Cataloging product usage data - Privacy and security - Backup and redundancy systems --------Where to find Prateek Joshi: Newsletter: https://prateekjoshi.substack.com Website: https://prateekj.com LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19 Twitter: https://twitter.com/prateekvjoshi

Sep 5, 2023 • 44min
AI coprocessor for data, Relational knowledge graphs | Molham Aref, CEO of RelationalAI
Molham Aref is the CEO of RelationalAI, an AI coprocessor for the data cloud. They have raised $122M in funding from the likes of Tiger Global, Madrona, Addition, and Menlo Ventures. He is a serial enterpreneur and has been the CEO of LogicBlox, Predictix, and Optimi.In this episode, we cover a range of topics including: - Relational knowledge graphs - Knowledge graphs for AI-driven applications - What is an AI coprocessor - Graph analytics - Interaction between ML infrastructure and knowledge graph infrastructure - Data infrastructure for AI compute Molham's favorite book: The Datapreneurs (Author: Bob Muglia)--------Where to find Prateek Joshi: Newsletter: https://prateekjoshi.substack.com Website: https://prateekj.com LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19 Twitter: https://twitter.com/prateekvjoshi

Aug 31, 2023 • 18min
Prateek talks about 8 strategies to speed up ML on different hardware platforms
In this episode, the host Prateek Joshi talks about:- why do we care about this problem- who needs it- how does it work- list of 8 strategies to speed up ML--------Where to find Prateek Joshi: Newsletter: https://prateekjoshi.substack.com Website: https://prateekj.com LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19 Twitter: https://twitter.com/prateekvjoshi

Aug 28, 2023 • 40min
Object store for AI workloads | Anand Babu Periasamy, cofounder and CEO of MinIO
Anand Babu "AB" Periasamy is the cofounder and CEO of MinIO, a high performance object storage for AI that's built for large scale workloads. They have raised $126M in funding from the likes of General Catalyst, Softbank, Intel Capital, and Nexus Venture Partners. It's the world's fastest growing object storage company with more than 1 billion Docker pulls and more than 35K stars on GitHub. He's also an angel investor with investments in companies like H2O.ai, Isovalent, Starburst, Postman, and many more. He was previously the cofounder and CTO of Gluster, which got acquired by Red Hat. In this episode, we cover a range of topics including: - Why is storage important for AI workflows - What are the characteristics of a good data storage product - Repatriation of data from public cloud to on-prem - Running ML experiments in parallel - AI compute offerings from data infrastructure providers - Making data infrastructure faster and cheaper AB's favorite book: An Awesome Book! (Author: Dallas Clayton)--------Where to find Prateek Joshi: Newsletter: https://prateekjoshi.substack.com Website: https://prateekj.com LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19 Twitter: https://twitter.com/prateekvjoshi

Aug 21, 2023 • 40min
AI assistant for software development | Eran Yahav, cofounder and CTO of Tabnine
Eran Yahav is the cofounder and CTO of Tabnine, an AI assistant that developers can use to build software faster. He's a professor at Technion - Israel Institute of Technology and was previously a researcher at IBM. He has a PhD in Computer Science from Tel Aviv University. In this episode, we cover a range of topics including: - Tasks in software development - What tasks are likely to benefit from LLMs - The launch of Tabnine Chat - Characteristics of a good AI coding assistant - Making AI coding assistants context-aware - Generic LLMs vs domain specific LLMs - AI copilot for devops work Eran's favorite book: Catch-22 (Author: Joseph Heller)--------Where to find Prateek Joshi: Newsletter: https://prateekjoshi.substack.com Website: https://prateekj.com LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19 Twitter: https://twitter.com/prateekvjoshi