undefined

Ram Sriharsha

VP of Engineering at Pinecone, a vector database company. Expert in large-scale data processing, machine learning, and the application of vector databases to RAG.

Top 3 podcasts with Ram Sriharsha

Ranked by the Snipd community
undefined
65 snips
Jan 29, 2024 • 35min

Building and Deploying Real-World RAG Applications with Ram Sriharsha - #669

Ram Sriharsha, VP of Engineering at Pinecone and an expert in large-scale data processing, explores the transformative power of vector databases and retrieval augmented generation (RAG). He discusses the trade-offs between LLMs and vector databases for effective data retrieval. The conversation sheds light on the evolution of RAG applications, the complexities of maintaining fresh enterprise data, and the exciting new features of Pinecone's serverless offering, which enhances scalability and cost efficiency. Ram also shares insights on the future of vector databases in AI.
undefined
21 snips
Aug 12, 2024 • 44min

#234 High Performance Generative AI Applications with Ram Sriharsha, CTO at Pinecone

Ram Sriharsha, CTO at Pinecone and a veteran in software engineering, dives into the fascinating world of generative AI applications. He discusses the problem of hallucinations in AI and how retrieval augmented generation can help. Ram explores practical uses for vector databases in chatbots, optimizing performance, and the importance of structured data. He also highlights the future of large language models and the crucial role of data engineering in enhancing AI efficiency. Get ready for a tech-packed conversation that uncovers the secrets of high-performance AI!
undefined
Jan 22, 2024 • 28min

Generation AI Podcast Episode #6 Ram Sriharsha

Ram Sriharsha, VP of Engineering at Pinecone, discusses vector databases and LLMs, including Hybrid Search and both sparse and dense retrieval. He introduces the concept of retrieval augmented generation (RAG) to enhance language models with knowledge from vector databases. The podcast explores the importance of search relevancy in AI chatbot development and the impact of longer sequence windows in language models. Challenges of cloud native platforms and the future possibilities of machine learning models are also discussed.