

Leveraging Documents and Data to Create a Custom LLM Chatbot
61 snips Apr 5, 2024
Calvin Hendryx-Parker, Co-founder and CTO of Six Feet Up, talks about customizing a LLM chatbot for accessing farm research data stored as PDFs spanning 50 years. He discusses tools like LangChain and ChromaDB for vectorizing data, as well as creating a chatbot from a conference website using Django and Python prompt-toolkit.
AI Snips
Chapters
Transcript
Episode notes
Beck's Hybrids Research
- Beck's Hybrids, a family-owned seed company, publishes "Practical Farm Research" books.
- These books contain research data to help farmers, covering topics like planting and harvesting.
PDF Data Challenges
- The challenge lies in converting unstructured PDF data into a format suitable for AI models.
- Visual elements like highlights and hidden text in PDFs pose problems for AI parsing.
Building Trust and Accuracy
- Provide citations and contact information within chatbot responses for increased trust and accuracy.
- Build in observability frameworks to log and review prompts and responses for quality control.