Latent Space: The AI Engineer Podcast cover image

Mapping the future of *truly* Open Models and Training Dolly for $30 — with Mike Conover of Databricks

Latent Space: The AI Engineer Podcast

00:00

The Evolution of Langchain Models

The models used in Langchain are excellent because they can provide a gradient for learning without any consequences. However, one limitation is that these models cannot die or break an arm, which means they don't have any consequences for their suggestions. In the future, it would be interesting to see if models could be punished or have consequences for generating incorrect or inappropriate content. One advantage of Langchain models is that they can recognize when they are wrong. By asking the model to evaluate its own generated utterance, it can determine if it is correct or not. This is possible due to the attention weights, which allow the model to attend to the entire passage. In contrast, for next token prediction, the model can only see the prefix and make stochastic choices with no preference for factuality. When the model is evaluated on the complete generated passage, it can attend to all tokens simultaneously, making it easier to assess correctness. This insight highlights the benefit of attention weights and how they improve the model's ability to recognize its own mistakes.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner