2min snip

High Agency: The Podcast for AI Builders cover image

Building the first LLM-based search engine for developers with Michael Royzen

High Agency: The Podcast for AI Builders

NOTE

Optimize for Speed and Relevance in Technical Search

The architecture of the search system embraces a straightforward yet effective retrieval step that leverages intelligent query rewriting to enhance relevance, particularly for technical searches. It employs a compact, high-speed language model that reformulates user queries and combines this with on-the-fly pre-computation to determine whether a single search is sufficient or if multiple searches are necessary. With the help of a classifier, the system can execute up to eight parallel searches, optimizing speed and throughput to ensure that the embedding process completes within 100 milliseconds, even when processing substantial amounts of data. The context formed from multiple sources is then sent to both GPT models and custom-developed models, showcasing a continuous evolution in their technical capabilities due to initial limitations in existing model APIs. The integration of GPT-4 significantly boosted user engagement and popularity, culminating in a notable Hacker News milestone.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode