Get the app
Xiaoqiang Lin
Ph.D. student at the National University of Singapore and former Meta researcher who led the REFRAG work; expert on retrieval-augmented generation, chunk-embedding compression, and latency optimizations for LLM-based systems.
Best podcasts with Xiaoqiang Lin
Ranked by the Snipd community
10 snips
Nov 3, 2025
• 60min
REFRAG with Xiaoqiang Lin - Weaviate Podcast #130!
chevron_right
Xiaoqiang Lin, a Ph.D. student at the National University of Singapore and former Meta researcher, dives into the innovative REFRAG method for enhancing retrieval-augmented generation. He explains how REFRAG improves LLM inference speeds, making Time-To-First-Token 31x faster. The discussion also covers multi-granular chunk embeddings, performance trade-offs in compression, and the exciting future of agentic AI. Listeners will learn about the balance between data and architecture for long-context capabilities and the practical compute requirements for training.
The AI-powered Podcast Player
Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
Get the app