Xiaoqiang Lin

Ph.D. student at the National University of Singapore and former Meta researcher who led the REFRAG work; expert on retrieval-augmented generation, chunk-embedding compression, and latency optimizations for LLM-based systems.

Best podcasts with Xiaoqiang Lin

Ranked by the Snipd community

10 snips

Nov 3, 2025 • 60min

REFRAG with Xiaoqiang Lin - Weaviate Podcast #130!

Xiaoqiang Lin, a Ph.D. student at the National University of Singapore and former Meta researcher, dives into the innovative REFRAG method for enhancing retrieval-augmented generation. He explains how REFRAG improves LLM inference speeds, making Time-To-First-Token 31x faster. The discussion also covers multi-granular chunk embeddings, performance trade-offs in compression, and the exciting future of agentic AI. Listeners will learn about the balance between data and architecture for long-context capabilities and the practical compute requirements for training.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app